question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Custom (local) UMLS subsets

See original GitHub issue

Hi scispacy team,

First of all, thanks for creating a great tool, I think it’s very useful!

I have a couple of questions related to the UMLS Entity Linker:

Generic UMLS linker From the paper, I understand that scispacy links entities to UMLS concepts from “sections 0, 1, 2 and 9 (SNOMED) of the UMLS 2017 AA release”. Is that still correct? I think it would be useful to add this information to the README as well.

Custom UMLS linker Somewhat related to #234: would it also be possible to link the entities to a local UMLS subset (installed with MetamorphoSys) for people with a UMLS license?

The reason I’m asking is twofold:

  1. UMLS is released twice a year in the first weeks of May and November. The current version is 2020AA and contains new concepts, such as COVID-19, that will currently not be detected by scispacy. Unless you’re planning to do frequent updates on the model, I’d like to be able to use the most frequent concepts.
  2. UMLS is highly customisable; users can select their own subsets of the many vocabularies. It would be great if this customisability is also applied in scispacy.

My understanding is that it is possible by:

Is this correct? Any help or more detailed instructions would be greatly appreciated!

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:6

github_iconTop GitHub Comments

1reaction
gfhayworthcommented, Nov 21, 2022

I was able to get this working. I needed to include a couple of minor tweaks. Added: from scispacy.linking_utils import KnowledgeBase

I applied to code shown by @DeNeutoy then I was able to add_pipe with the new name: umls_nlp.add_pipe("scispacy_linker", config={"resolve_abbreviations": True, "linker_name": "umls2020" } ) see linker_name is changed to the custom value “umls2020”

1reaction
ChantalvanSoncommented, Jul 7, 2020

Hi @DeNeutoy,

Thanks for the alternative! I’m adopting that, because it’s indeed a bit nicer than my previous solution. I completely understand the reasons for implementing it as you did 😃 Providing a function that does this global mutation with intelligent errors sounds like a nice addition to scispacy!

Edit: I just posted another error here that appeared to be my own mistake, so I have deleted it again.

Read more comments on GitHub >

github_iconTop Results From Across the Web

MetamorphoSys Help - UMLS - National Library of Medicine
Install, Customize, and Subset the UMLS ... MetamorphoSys creates a top-level destination directory in local storage for the UMLS Knowledge Sources.
Read more >
MetamorphoSys - The UMLS Installation and Customization ...
MetamorphoSys is the UMLS installation wizard and Metathesaurus customization ... it enables you to create customized Metathesaurus subsets.
Read more >
Load UMLS into MySQL - Research
I have built a load script that will import the lexical tables whose structure I have been able to figure out into a...
Read more >
UMLS users and uses: a current overview - Oxford Academic
In production applications, the UMLS is used to map local terms to standard terminologies, annotate records with standard terminology to improve ...
Read more >
Cross-domain targeted ontology subsets for annotation
We use the SNOMED CORE subset as source; NDF-RT, RxNorm, and the UMLS ... The UMLS 2010AB release was installed in a local...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found