Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cell type-aware embedding #9

Open
Nusob888 opened this issue Dec 12, 2024 · 3 comments
Open

cell type-aware embedding #9

Nusob888 opened this issue Dec 12, 2024 · 3 comments
Labels
enhancement New feature or request

Comments

@Nusob888
Copy link

Description of feature

Dear DRVI team,

This is a great tool and am enjoying using it a lot. I am just wondering if there are any plans to incorporate cell-type awareness into the model, similar to SCANVI and MRVI?

Many thanks

@Nusob888 Nusob888 added the enhancement New feature or request label Dec 12, 2024
@moinfar
Copy link
Collaborator

moinfar commented Dec 23, 2024

Dear Bo,

We're delighted to hear that you're enjoying it.

Incorporating supervised information, such as cell types, is indeed an interesting idea. However, DRVI is fundamentally designed for unsupervised discovery in single-cell omics. One of its key strengths is its ability to remain unbiased with respect to potentially imperfect supervised annotations, allowing DRVI to uncover patterns and insights that go beyond existing annotations.

Extending the model to incorporate supervised information is a lower priority than other directions outlined in the discussions section of the paper. Users often incorporate cell-type information to improve integration performance —when the integration performance is measured by the same annotations— introducing a circular logic that we aim to avoid. We’re always open to revisiting our priorities based on compelling use cases. Do you have a reasonable example or scenario where embedding such supervised information could significantly enhance users' analyses? We'd love to hear your perspective.

@Nusob888
Copy link
Author

Nusob888 commented Jan 3, 2025

Dear Amir,

Thanks for the great explanation. Agree re: the circular logic, I guess the case use I was imagining was something akin to Spectra from the Pe'er lab. Where cell type specific programs may be of interest. I guess the easiest way to get around this might be to re-train a model for a specific lineage e.g. Myeloid vs. lymphoid.

@moinfar
Copy link
Collaborator

moinfar commented Jan 21, 2025

Dear Bo,

Thanks for your valuable suggestion.

To acquire lineage-specific programs, one can train DRVI on the whole data and link (not a 1-1 correspondence just relevance) programs identified by DRVI to known variations such as cell types and lineages (this assignment, for example, can be done by mutual information). This approach has the potential to identify not only lineage-specific programs but also shared programs. We expect DRVI to uncover all the mentioned programs as we do not expect irrelevant programs to interfere in DRVI. Alternatively, one can follow your suggested approach and consider it as a nonlinear, batch-corrected version of Spectra.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants