-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Manifest file for all graphs and subgraphs #1
Comments
As per KG-OBO would also like to keep track of broken links - this will mean the manifest should not be written anew with each update, at least not without reading it first to store all obsolete IDs. |
As per Slack discussion: also keep track of KGX sources, CURIE namespaces, and Biolink types. (Split into issue #8) |
Other misc TODOs:
|
Will also need to set up Jenkinsfile to run this weekly or so (I'll make that its own PR so it can have its own Jenkins test branch) (And its own issue - see #7) |
There is a degree of structural validation the manifest step can accomplish, with items like the following:
|
|
Each DataPackage object (here, compressed files) should include a list of its components (here, node and edge lists). |
In its role as an index of KGs, it would be useful for KG-HUB to provide a list (e.g., a manifest file) of all graphs and their component subgraphs (in most cases besides KG-OBO, the source transforms) and then have this list be publicly viewable. It should include metadata such as graph descriptions. These could be pulled from the
download.yaml
for each project in theory.See also the draft linkml dataset distribution schema:
https://github.com/linkml/linkml-model/blob/main/linkml_model/model/schema/datasets.yaml
The text was updated successfully, but these errors were encountered: