Skip to content

Commit

Permalink
Remove SPOM_ prefix that NCBI Gene is adding to PomBase systematic names
Browse files Browse the repository at this point in the history
  • Loading branch information
kevinschaper committed Jul 11, 2024
1 parent 0299da3 commit e8dc36f
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions src/monarch_gene_mapping/cli_utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -281,6 +281,7 @@ def generate_gene_mappings() -> DataFrame:
object_curie_prefix="NCBIGene:",
predicate_id="skos:exactMatch",
mapping_justification="semapv:UnspecifiedMatching")
pombase_to_ncbi['subject_id'] = pombase_to_ncbi['subject_id'].str.replace("SPOM_","") # remove SPOM_ prefix
valid_pombase_genes = pd.read_csv("data/pombase/gene_IDs_names_products.tsv",
sep="\t", usecols=["gene_systematic_id_with_prefix"])
# only keep rows where the subject_id is in valid_pombase_genes
Expand Down

0 comments on commit e8dc36f

Please sign in to comment.