question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

How is concept_aliases.json constructed?

See original GitHub issue

How concept_aliases.json is constructed? https://github.com/allenai/scispacy/search?q=“concept_aliases”&unscoped_q=“concept_aliases”

By using this table, I think, if mention is included in this table, mention-entity prior(=similarity) is fixed to 1.0. (If my understanding is incorrect, I’d appreciate if you’d write.)

Currently I’m curious about, how this concept_aliases.json is constructed? It seems that this is actually list, whose length is 4426690, but I can’t understand how this list was constructed. Also, I supposed that alias table is something like,

{'entitiy1':[alias1_1,alias1_2,alias1_3], 'entity2':[alias2_1,alias2_2,...],...}

but concept_aliases.json is list. I’d like to know what this list means.

Still I’m currently reading codes and maybe I missed some codes, which is related to constructing this table.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:8

github_iconTop GitHub Comments

2reactions
DeNeutoycommented, Feb 7, 2020

Basically, the problem is that if you have some canonical entities like:

"Ford County, Texas": ["Ford County", "Ford County, Texas"]
"Ford County, Alabama": ["Ford County", "Ford County, Alabama"]

etc

And you encounter the string “Ford County” in some text, there is no way to know which canonical entity (which all share the same alias which is an exact match) you should link to. So we default to returning all of them. Obviously this is the point at which Entity Linking becomes difficult and actually interesting, but the good thing is that this doesn’t happen hugely often 😄

0reactions
DeNeutoycommented, Feb 17, 2020

Closing, please open more issues if you need any clarifications, or you want to collaborate or something!

Read more comments on GitHub >

github_iconTop Results From Across the Web

Working with JSON - Learn web development | MDN
In this article, we've given you a simple guide to using JSON in your programs, including how to create and parse JSON, and...
Read more >
An Introduction to JSON | DigitalOcean
JSON — short for JavaScript Object Notation — is a format for ... At the broader level, values can also be made up...
Read more >
JavaScript JSON - W3Schools
JSON is often used when data is sent from a server to a web page. ... Then, use the JavaScript built-in function JSON.parse()...
Read more >
JSON Objects Explained! - ShapeDiver
Once the structure of the JSON objects is clearly defined, you can build some examples in order to use them as test inputs...
Read more >
What is JSON? - Squarespace Developers
JSON, or JavaScript Object Notation, is a minimal, readable format for structuring data. It is used primarily to transmit data between a server...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found