Skip to content

Commit

Permalink
Further doc updates
Browse files Browse the repository at this point in the history
  • Loading branch information
johnlees committed Aug 7, 2024
1 parent f665b83 commit 681a8b5
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion docs/model_distribution.rst
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ Database contents
A database requires the following files:

- ``.h5``. The sketch database, a HDF5 file.
- ``.dists.pkl`` and ``.dists.npy`` files. Distances for all vs all samples in the sketch database.
- ``.dists.pkl`` file. Order and names of samples in the sketch database.
- ``_fit.npz`` and ``_fit.pkl`` files. Python files which describe the model fit.
- ``_graph.gt``. The network relating distances, fit and strain assignment for all samples in the sketch database.
- ``_clusters.csv``. The strain assignment of all samples in the sketch database.
Expand Down
2 changes: 1 addition & 1 deletion docs/query_assignment.rst
Original file line number Diff line number Diff line change
Expand Up @@ -18,7 +18,7 @@ as queries. The clusters assigned by PopPUNK are variable-length-k-mer clusters
A database called ``database`` will contain the following files, in ``database/``:

- ``database.h5`` -- the sketches of the reference sequences generated by ``pp-sketchlib``.
- ``database.dists.npy`` and ``database.dists.pkl`` -- the core and accessory distances for
- ``database.dists.pkl`` -- the order of the core and accessory distances for
all pairwise comparisons in the sketch database.
- ``database_fit.npz`` and ``database_fit.pkl`` -- the model fit to the core and accessory distances.
- ``database_graph.gt`` -- the network defining the fit (loadable with ``graph_tool``).
Expand Down

0 comments on commit 681a8b5

Please sign in to comment.