Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How does sourceapp handle tied matches? #2

Open
bglindner opened this issue Jan 19, 2024 · 0 comments
Open

How does sourceapp handle tied matches? #2

bglindner opened this issue Jan 19, 2024 · 0 comments
Labels
documentation Improvements or additions to documentation enhancement New feature or request

Comments

@bglindner
Copy link
Owner

bglindner commented Jan 19, 2024

SourceApp is designed to do competitive read mapping. That is, we should only be counting each read's best alignment if that alignment is above user-specified criteria (e.g., percent identity, etc.). Although not necessarily super common, tied matches occur when a read has multiple best-scoring matches to multiple subjects.

This isn't a huge deal when the subject sequences are simply different contigs (or regions of the same) belonging to the same genome or even different genomes but belonging to the same source category. It's not quite the same when there are instances of tied matches belonging to genomes across different source categories. What should we do about that? Right now, we have --remove-crx as a step in sourceapp_build.py which serves a sort of stopgap for this issue (the idea being, if we remove genomes belonging to the same cluster, then this is less likely to occur).

Either way, depending on the read mapper used, primary alignments are usually just selected at random when there are ties like this. Should we handle this differently, perhaps retaining information on tied matches and creating some sort of error bounds from this (e.g., 2% +/-0.2%)?

@bglindner bglindner added documentation Improvements or additions to documentation enhancement New feature or request labels Jan 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant