Find a good English dictionary for us!
See original GitHub issueHello spoooopyyy hackers 🎃
This is a Hacktoberfest only issue! 👻
This is also data-sciency!
The Problem
Our English dictionary contains words that aren’t English, and does not contain common English words.
Examples of non-common words in the dictionary:
"hlithskjalf",
"hlorrithi",
"hlqn",
"hm",
"hny",
"ho",
"hoactzin",
"hoactzines",
This is our current dictionary:
https://github.com/dwyl/english-words
What we want
An English dictionary without English words that are horrible, and with common English words in JSON format.
Ideas on how to achieve this
You’ll likely need to use data science, parse English text (such as books / stories) and find uncommon words to remove them. Also potentially adding more words.
I’m not the best data scientist in the world, so what you decide will be good.
You can also publish this work outside of Ciphey, such as in a separate GitHub repository – so long as Ciphey can use it ❤️
While I’m not an expert data scientist, I have studied it – so if you need help leave a comment 😃
Issue Analytics
- State:
- Created 3 years ago
- Comments:21 (16 by maintainers)
Top GitHub Comments
Issue-Label Bot is automatically applying the label
feature_request
to this issue, with a confidence of 0.62. Please mark this comment with 👍 or 👎 to give our bot feedback!Links: app homepage, dashboard and code for this bot.
Sorry we forgot to close this, we added a good English dictionary a while ago. We don’t need any new dictionaries.