This document describes how to test the implementation of graphdatascience
as well as how to check that its code style is maintained.
Tests can be found in graphdatascience/tests
. In each of the folders there, unit
and integration
, the tests of that respective type reside.
Please see the section Specifically for this project of our contribution guidelines for how to set up an environment for testing and style checking.
NOTE: This document does not cover documentation testing. Please see the documentation README for that.
The unit tests are run without a connection a database. Typically the CollectingQueryRunner
is used to mock running queries.
To run the unit tests (with default options), simply call:
pytest graphdatascience/tests/unit
In order to run the integration tests one must have a Neo4j DBMS with the Neo4j Graph Data Science library installed running.
The tests will through the Neo4j Python driver connect to a Neo4j database based on the environment variables:
NEO4J_URI
(defaulting to "bolt://localhost:7687" if unset),NEO4J_USER
,NEO4J_PASSWORD
(defaulting to "neo4j" if unset),NEO4J_DB
(defaulting to "neo4j" if unset).
However, if NEO4J_USER
is not set the tests will try to connect without authentication.
Once the driver connects successfully to the Neo4j DBMS the tests will go on to execute against the NEO4J_DB
database.
To run the integration tests (with default options), simply call:
pytest graphdatascience/tests/integration
To include tests that require the Enterprise Edition of the Neo4j Graph Data Science library, you must specify the option --include-enterprise
.
Naturally, this requires access to a valid Neo4j GDS license key, which can be acquired via the Neo4j GDS product website.
To include tests that exercise storing and loading models, you must specify the option --include-model-store-location
.
Note however that this also requires you to have specified a valid path for the gds.model.store_location
configuration key of your database.
If the database you are targeting is an AuraDS instance, you should use the option --target-aura
which makes sure that tests of operations not supported on AuraDS are skipped.
In order to run integration tests that test encryption features, you must setup the Neo4j server accordingly:
# Enable the Arrow Flight Server (necessary if you run integration tests that require Arrow)
gds.arrow.enabled=true
# Allow bolt connections either encrypted or unencrypted
dbms.connector.bolt.tls_level=OPTIONAL
dbms.ssl.policy.bolt.enabled=true
dbms.ssl.policy.bolt.base_directory=<absolute-path-to-graph-datascience-client>/graphdatascience/tests/integration/resources
dbms.ssl.policy.bolt.public_certificate=arrow-flight-gds-test.crt
dbms.ssl.policy.bolt.private_key=arrow-flight-gds-test.key
dbms.ssl.policy.bolt.client_auth=NONE
To run only integration tests that are marked as encrypted_only
, call:
pytest graphdatascience/tests/integration --encrypted-only
There are integration tests that are only compatible with certain versions of the GDS library. For example, a procedure (which does not follow the standard algorithm procedure pattern) introduced in version 2.1.0 of the library will not exist in version 2.0.3, and so any client side integration tests that call this procedure should not run when testing against server library version 2.0.3. For this reason only tests compatible with the GDS library server version you are running against will run.
The code follows a rather opinionated style based on pep8.
You can check all code using all the below mentioned code checking tools by running the scripts/checkstyle
bash script.
There's also a scripts/makestyle
to do formatting.
To enforce pep8 conformity (with the exception of using max line length = 120) flake8 is used. To run it to check the entire repository, simply call:
flake8
from the root. See .flake8
for our custom flake8 settings.
For general formatting we use black with default settings. black can be run to format the entire repository by calling:
black .
from the root. See the [tool.black]
section of pyproject.toml
for our custom black settings.
Additionally isort is used for consistent import sorting. It can similarly be run to format all source code by calling:
isort .
from the root. See .isort.cfg
for our custom isort settings.
The code is annotated with type hints in order to provide documentation and allow for static type analysis with mypy.
Please note that the typing
library is used for annotation types in order to stay compatible with Python versions < 3.9.
To run static analysis on the entire repository with mypy, just run:
mypy .
from the root. See mypy.ini
for our custom mypy settings.