How is concept_aliases.json constructed?
See original GitHub issueHow concept_aliases.json
is constructed?
https://github.com/allenai/scispacy/search?q=“concept_aliases”&unscoped_q=“concept_aliases”
By using this table, I think, if mention is included in this table, mention-entity prior(=similarity) is fixed to 1.0. (If my understanding is incorrect, I’d appreciate if you’d write.)
Currently I’m curious about, how this concept_aliases.json
is constructed?
It seems that this is actually list, whose length is 4426690, but I can’t understand how this list was constructed.
Also, I supposed that alias table is something like,
{'entitiy1':[alias1_1,alias1_2,alias1_3], 'entity2':[alias2_1,alias2_2,...],...}
but concept_aliases.json
is list. I’d like to know what this list means.
Still I’m currently reading codes and maybe I missed some codes, which is related to constructing this table.
Issue Analytics
- State:
- Created 4 years ago
- Comments:8
Top Results From Across the Web
Working with JSON - Learn web development | MDN
In this article, we've given you a simple guide to using JSON in your programs, including how to create and parse JSON, and...
Read more >An Introduction to JSON | DigitalOcean
JSON — short for JavaScript Object Notation — is a format for ... At the broader level, values can also be made up...
Read more >JavaScript JSON - W3Schools
JSON is often used when data is sent from a server to a web page. ... Then, use the JavaScript built-in function JSON.parse()...
Read more >JSON Objects Explained! - ShapeDiver
Once the structure of the JSON objects is clearly defined, you can build some examples in order to use them as test inputs...
Read more >What is JSON? - Squarespace Developers
JSON, or JavaScript Object Notation, is a minimal, readable format for structuring data. It is used primarily to transmit data between a server...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Basically, the problem is that if you have some canonical entities like:
And you encounter the string “Ford County” in some text, there is no way to know which canonical entity (which all share the same alias which is an exact match) you should link to. So we default to returning all of them. Obviously this is the point at which Entity Linking becomes difficult and actually interesting, but the good thing is that this doesn’t happen hugely often 😄
Closing, please open more issues if you need any clarifications, or you want to collaborate or something!