Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Collections #103

Open
stuartmcalpine opened this issue Mar 29, 2024 · 1 comment
Open

Collections #103

stuartmcalpine opened this issue Mar 29, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@stuartmcalpine
Copy link
Collaborator

Similar to Rucio, we could add "collections". An ability to tag datasets and executions jointly under a single collection.

For example datareg.Registrar.collection.register(dataset_ids=[1,2,3], execution_ids=[4,5,6]).

This would be a way of tagging all the components of a single pipeline for example, making it easier to query.

@stuartmcalpine stuartmcalpine added the enhancement New feature or request label Mar 29, 2024
@stuartmcalpine stuartmcalpine self-assigned this Mar 29, 2024
@JoanneBogart
Copy link
Collaborator

Some form of collections could address a number of potential problem use cases. I can see uses for collections of datasets and for collections of executions. I'm not convinced datasets and executions should be lumped together in the same collection, however. Given an execution collection, one can already find all the datasets associated with them. Also, at some level the implementation will involve foreign keys, which will have to be defined as referencing a specific table.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants