Index Wikipedia's hierarchical categories and sub-categories as a FacetField
See original GitHub issue@rmuir observed in this issue that Wikipedia already has labels/categories per page, and these labels have sub-categories, etc.
This would be another (in addition to the high cardinality but flat RandomLabel
we recently added) great way of testing facet labels.
Issue Analytics
- State:
- Created 2 years ago
- Comments:6 (4 by maintainers)
Top Results From Across the Web
Hierarchical structure of the Big Five - Wikipedia
The Big Five personality characteristics represent one level in a hierarchy of traits. These traits can be subdivided into collections of aspects or...
Read more >How to create Solr schema for hierarchical facet by splitting ...
I want to implement Solr hierarchical facet for my application where there is 2 level hierarchy between Category and SubCategory. I want to...
Read more >Evolution of Wikipedia's Category Structure - ResearchGate
We investigate the evolution of the category structure of the English Wikipedia from its birth in 2004 to 2008. We treat the category...
Read more >Facet Userguide - Apache Lucene
A category is an aspect of indexed documents which can be used to classify ... get facet results, which are lists of subcategories...
Read more >Help:Categories - MediaWiki
Categories, a software feature of MediaWiki, provide automatic indexes that are useful as tables of contents. You can categorize pages and ...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
I think we have pretty good low cardinality facet fields. We have two flat ones (
dayOfYear
, with 365 unique values, andweekday
with seven values), and one hierarchical YYYY/MM/DD.That is true, we can benchmark facets from both datasets separately then as a low and high cardinality test.