Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Automatic <py-env> with dependency + few tweaks to contributing and readme docs #36

Open
wants to merge 36 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
36 commits
Select commit Hold shift + click to select a range
26b1de5
import modules parsing to support py-env dependencies
leriomaggio Aug 11, 2022
96db47d
generator now includes import collection for files
leriomaggio Aug 11, 2022
07ba09a
basic template extended to support py-env
leriomaggio Aug 11, 2022
39c7fc2
Improved parsing with local namespace inspection
leriomaggio Aug 12, 2022
de00f05
Improved find_imports encapsulation
leriomaggio Aug 12, 2022
0e625ce
fixed typo in docstring
leriomaggio Aug 12, 2022
4862c57
finderresults integration
leriomaggio Aug 12, 2022
d94afdf
Integration of parsing results and new Warning msg
leriomaggio Aug 12, 2022
3fa0ecc
Jupyter notebook conversion supported
leriomaggio Aug 12, 2022
713d37e
Merge remote-tracking branch 'upstream/main' into feat_pyenv_support
leriomaggio Sep 6, 2022
94861f3
ignored spotlight index files
leriomaggio Sep 7, 2022
55407bf
ignored PyCharm dev env
leriomaggio Sep 7, 2022
d28f625
nbconvert and pre-commit as extra deps
leriomaggio Sep 7, 2022
77b5d78
Specialised contribution instructions for dev
leriomaggio Sep 7, 2022
efae1d6
Finder results are not optional anymore!
leriomaggio Sep 7, 2022
95591a3
Finder results are not optional anymore!
leriomaggio Sep 7, 2022
2e76c49
docstring to new Warning typer class
leriomaggio Sep 7, 2022
953a8a3
Integrated the new imports/pyenv support in new wrap hook command
leriomaggio Sep 7, 2022
1635b2e
New tests for wrapping code with imports
leriomaggio Sep 7, 2022
35f064b
Reformatted to not exceed line length
leriomaggio Sep 7, 2022
d9c18a1
Added ipykernel dep and example in README for notebooks conv.
leriomaggio Sep 7, 2022
ec40312
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 7, 2022
9cefe98
Removed unspotted unused import
leriomaggio Sep 7, 2022
2ffb9eb
merge and removed import unused
leriomaggio Sep 7, 2022
a774ec4
Merge branch 'feat_pyenv_support' of github.com:leriomaggio/pyscript-…
leriomaggio Sep 7, 2022
686e446
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 7, 2022
92ce5a0
ignored pyenv python version
leriomaggio Sep 7, 2022
46d848b
Merge branch 'feat_pyenv_support' of github.com:leriomaggio/pyscript-…
leriomaggio Sep 7, 2022
945535a
configuring isort to avoid conflicts with black
leriomaggio Sep 7, 2022
19d753f
Upgrading poetry-core dep to see if tests pass
leriomaggio Sep 7, 2022
072c202
Upgrading poetry-core dep to see if tests pass
leriomaggio Sep 7, 2022
8466380
Upgrading poetry-core dep to see if tests pass
leriomaggio Sep 7, 2022
1e96280
Comment out invalid poetry dependency section
mattkram Sep 7, 2022
40751a9
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 7, 2022
fa93160
Fix type hings
mattkram Sep 7, 2022
29bfdda
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 7, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,3 +1,6 @@
# OSx rubbish
.DS_Store

# Byte-compiled
__pycache__/
*.py[cod]
Expand All @@ -12,9 +15,11 @@ docs/_build/
.venv
env/
venv/
.idea

# Poetry lock file
poetry.lock
.python-version

# Unit test / coverage reports
htmlcov/
Expand Down
40 changes: 40 additions & 0 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,45 @@
# Contribution guide for developers

## Development

### Setting up the environment

To get started, you will need to setup the development environment. This goes
through two main steps, namely [(1) Installing `pyscript-cli` dependencies](#install-pyscript-cli-dependencies)
and [(2) Installing `pyscript-cli` Dev dependencies](#install-the-development-dependencies).

### Install `pyscript-cli` dependencies

`pyscript-cli` requires [`poetry`](https://python-poetry.org/) to manage dependencies
and setup the environment.

Therefore, the first thing to do is to make sure that you have `poetry` installed
in your current Python environment. Please refer to the official `poetry`
[documentation](https://python-poetry.org/docs/master/#installation) for installation
instructions.

To install `pyscript-cli` dependencies, you will need to run the following command from the project root:

```shell
poetry install
```

### Install the development dependencies

There are a few extra dependencies that are solely required for development.
To install these packages, you will need to run the following command from the project root:

```shell
poetry install --with dev-dependencies
```

Once all the dependencies are installed, you will only need to setup the git hooks
via [`pre-commit`](https://pre-commit.com/):

```shell
pre-commit install
```

## Documentation

### Install the documentation dependencies
Expand Down
8 changes: 7 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -21,12 +21,18 @@ $ pip install pyscript

## Usage

### Embed a Python script into a PyScript HTML file
### Embed Python code into a PyScript HTML file

```shell
$ pyscript wrap <filename.py>
```

Alternatively you could also use a **Jupyter notebook** as input file:

```shell
$ python wrap <filename.ipynb>
```

This will generate a file called `<filename.html>` by default.
This can be overwritten with the `-o` or `--output` option:

Expand Down
10 changes: 9 additions & 1 deletion pyproject.toml
Original file line number Diff line number Diff line change
@@ -1,7 +1,10 @@
[build-system]
requires = ["poetry-core>=1.0.0"]
requires = ["poetry-core>=1.1.0b3"]
build-backend = "poetry.core.masonry.api"

[tool.isort]
profile = "black"

[tool.poetry]
name = "pyscript"
version = "0.2.4"
Expand Down Expand Up @@ -32,12 +35,17 @@ sphinx-autobuild = {version = "^2021.3.14", optional = true}
sphinx-autodoc-typehints = {version = "^1.19.2", optional = true}
myst-parser = {version = "^0.18.0", optional = true}
pydata-sphinx-theme = {version = "^0.9.0", optional = true}
requests = "^2.28.1"
six = "^1.16.0"
nbconvert = "^7.0.0"
ipykernel = "^6.15.2"

[tool.poetry.dev-dependencies]
coverage = "^6.3.2"
mypy = "^0.950"
pytest = "^7.1.2"
types-toml = "^0.10.8"
pre-commit = "^2.20.0"

[tool.poetry.extras]
docs = [
Expand Down
58 changes: 51 additions & 7 deletions src/pyscript/_generator.py
Original file line number Diff line number Diff line change
Expand Up @@ -5,21 +5,65 @@
import jinja2
import toml

_env = jinja2.Environment(loader=jinja2.PackageLoader("pyscript"))
from ._node_parser import FinderResult, _convert_notebook, find_imports


def string_to_html(input_str: str, title: str, output_path: Path) -> None:
class UnsupportedFileType(Exception):
pass


_env = jinja2.Environment(
loader=jinja2.PackageLoader("pyscript"), trim_blocks=True, lstrip_blocks=True
)


def string_to_html(
input_str: str, title: str, output_path: Path, pyenv: FinderResult = None
) -> None:
"""Write a Python script string to an HTML file template."""
template = _env.get_template("basic.html")
if pyenv is not None:
modules, paths = pyenv.packages, pyenv.paths
else:
modules = paths = ()
with output_path.open("w") as fp:
fp.write(template.render(code=input_str, title=title))
fp.write(
template.render(code=input_str, title=title, modules=modules, paths=paths)
)


def file_to_html(input_path: Path, title: str, output_path: Optional[Path]) -> None:
"""Write a Python script string to an HTML file template."""
def file_to_html(
input_path: Path, title: str, output_path: Optional[Path]
) -> FinderResult:
"""Write a Python script string to an HTML file template.

Warnings will be returned when scanning for environment, if any.
"""
output_path = output_path or input_path.with_suffix(".html")
with input_path.open("r") as fp:
string_to_html(fp.read(), title, output_path)

fname, extension = input_path.name, input_path.suffix
if extension == ".py":
with open(input_path, "rt") as f:
source = f.read()

elif extension == ".ipynb":
try:
import nbconvert # noqa
except ImportError as e: # pragma no cover
raise ImportError(
"Please install nbconvert to serve Jupyter Notebooks."
) from e
else:
source = _convert_notebook(input_path)

else:
raise UnsupportedFileType(
"{} is neither a script (.py) nor a notebook (.ipynb)".format(fname)
)

import_results = find_imports(source, input_path)
string_to_html(source, title, output_path, pyenv=import_results)
return import_results


def create_project(
Expand Down
194 changes: 194 additions & 0 deletions src/pyscript/_node_parser.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,194 @@
"""
ast-based parser to gather modules/package dependencies of a Python module.
Code adapted from the find-imports project, currently in graveyard archive.
"""
from __future__ import annotations

import ast
import os
import pkgutil
from collections import defaultdict
from itertools import chain, filterfalse
from pathlib import Path

from ._supported_packages import PACKAGE_RENAMES, PYODIDE_PACKAGES, STANDARD_LIBRARY


class NamespaceInfo:
def __init__(self, source_fpath: Path) -> None:
# expanding base_folder to absolute as pkgutils.
# FileFinder will do so - easier for later purging
self.base_folder = str(source_fpath.parent.absolute())
self.source_mod_name = source_fpath.stem
self._collect()
# storing this as it will be useful for multiple lookups
self._all_namespace = set(chain(self.modules, self._packages))

def _collect(self):
iter_modules_paths = [self.base_folder]
for root, dirs, files in os.walk(self.base_folder):
for dirname in dirs:
iter_modules_paths.append(os.path.join(root, dirname))

# need to consume generator as I will iterate
# two times for _packages, and modules
pkg_mods = tuple(pkgutil.iter_modules(iter_modules_paths))
modules = map(
lambda mi: os.path.join(mi.module_finder.path, mi.name),
filterfalse(
lambda mi: mi.ispkg or mi.name == self.source_mod_name, pkg_mods
),
)
_packages = map(
lambda mi: os.path.join(mi.module_finder.path, mi.name),
filter(lambda mi: mi.ispkg, pkg_mods),
)
self.modules = set(map(self._dotted_path, modules))
self._packages = set(map(self._dotted_path, _packages))

def _dotted_path(self, p: str):
p = p.replace(self.base_folder, "").replace(os.path.sep, ".")
if p.startswith("."):
p = p[1:]
return p

def __contains__(self, item: str) -> bool:
return item in self._all_namespace

def __str__(self) -> str:
return (
f"NameSpace info for {self.base_folder} \n\t "
f"Modules: {self.modules} \n\t Packages: {self._packages}"
)

def __repr__(self) -> str:
return str(self)


class FinderResult:
def __init__(self) -> None:
self._packages: set[str] = set()
self._locals: set[str] = set()
self._unsupported: defaultdict[str, set] = defaultdict(set)

def add_package(self, pkg_name: str) -> None:
self._packages.add(pkg_name)

def add_locals(self, pkg_name: str) -> None:
self._locals.add(pkg_name)

def add_unsupported_external_package(self, pkg_name: str) -> None:
self._unsupported["external"].add(pkg_name)

def add_unsupported_local_package(self, pkg_name: str) -> None:
self._unsupported["local"].add(pkg_name)

@property
def has_warnings(self):
return len(self._unsupported) > 0

@property
def unsupported_packages(self):
return self._unsupported["external"]

@property
def unsupported_paths(self):
return self._unsupported["local"]

@property
def packages(self):
return self._packages

@property
def paths(self):
pyenv_paths = map(
lambda l: "{}.py".format(l.replace(".", os.path.sep)), self._locals
)
return set(pyenv_paths)


# https://stackoverflow.com/a/58847554
class ModuleFinder(ast.NodeVisitor):
def __init__(self, context: NamespaceInfo, *args, **kwargs):
# list of all potential local imports
self.context = context
self.results = FinderResult()
super().__init__(*args, **kwargs)

def visit_Import(self, node):
for name in node.names:
# need to check for absolute module import here as they won't work in PyScript
# absolute package imports will be found later in _import_name
if len(name.name.split(".")) > 1 and name.name in self.context:
self.results.add_unsupported_local_package(name.name)
else:
self._import_name(name.name)

def visit_ImportFrom(self, node):
# if node.module is missing it's a "from . import ..." statement
# if level > 0 it's a "from .submodule import ..." statement
if node.module is not None:
self._import_name(node.module)

def _import_name(self, imported):
if imported in self.context:
if imported not in self.context._packages:
self.results.add_locals(imported)
else:
self.results.add_unsupported_local_package(imported)
else:
imported = imported.split(".")[0]
pkg_name = PACKAGE_RENAMES.get(imported, imported)
if pkg_name not in STANDARD_LIBRARY:
if pkg_name in PYODIDE_PACKAGES:
self.results.add_package(pkg_name)
else:
self.results.add_unsupported_external_package(pkg_name)


def _find_modules(source: str, source_fpath: Path):
fname = source_fpath.name
# importing all local modules from source_fpath
namespace_info = NamespaceInfo(source_fpath=source_fpath)
# passing mode='exec' just in case defaults will change in the future
nodes = ast.parse(source, fname, mode="exec")

finder = ModuleFinder(context=namespace_info)
finder.visit(nodes)
return finder.results


def _convert_notebook(source_fpath: Path) -> str:
from nbconvert import ScriptExporter

exporter = ScriptExporter()
source, _ = exporter.from_filename(source_fpath)

return source


def find_imports(
source: str,
source_fpath: Path,
) -> FinderResult:
"""
Parse the input source, and returns its dependencies, as organised in
the sets of external _packages, and local modules, respectively.
Any modules or package with the same name found in the local

Parameters
----------
source : str
Python source code to parse
source_fpath : Path
Path to the input Python module to parse

Returns
-------
FinderResult
Return the results of parsing as a `FinderResult` instance.
This instance provides reference to packages and paths to
include in the py-env, as well as any unsupported import.
"""

return _find_modules(source, source_fpath)
Loading