question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Index Wikipedia's hierarchical categories and sub-categories as a FacetField

See original GitHub issue

@rmuir observed in this issue that Wikipedia already has labels/categories per page, and these labels have sub-categories, etc.

This would be another (in addition to the high cardinality but flat RandomLabel we recently added) great way of testing facet labels.

Issue Analytics

  • State:open
  • Created 2 years ago
  • Comments:6 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
mikemccandcommented, Dec 19, 2021

I think we have pretty good low cardinality facet fields. We have two flat ones (dayOfYear, with 365 unique values, and weekday with seven values), and one hierarchical YYYY/MM/DD.

0reactions
mdmarshmallowcommented, Dec 8, 2021

That is true, we can benchmark facets from both datasets separately then as a low and high cardinality test.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Hierarchical structure of the Big Five - Wikipedia
The Big Five personality characteristics represent one level in a hierarchy of traits. These traits can be subdivided into collections of aspects or...
Read more >
How to create Solr schema for hierarchical facet by splitting ...
I want to implement Solr hierarchical facet for my application where there is 2 level hierarchy between Category and SubCategory. I want to...
Read more >
Evolution of Wikipedia's Category Structure - ResearchGate
We investigate the evolution of the category structure of the English Wikipedia from its birth in 2004 to 2008. We treat the category...
Read more >
Facet Userguide - Apache Lucene
A category is an aspect of indexed documents which can be used to classify ... get facet results, which are lists of subcategories...
Read more >
Help:Categories - MediaWiki
Categories, a software feature of MediaWiki, provide automatic indexes that are useful as tables of contents. You can categorize pages and ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found