Why supported languages are hardcoded?
See original GitHub issueHi all,
first of all, thank you for this awesome work!
I’m almost new to NLP & co in general, so I’ve started stydying as much as possible to get some grasp. Meanwhile, I’ve started tinkering with rasa_nlu and, after running the default provided examples, I tried to use it with italian language, but I can’t specify it in the configuration file (or CLI) because it has hardcoded languages (en
and de
).
I’m aware it needs a total_word_feature_extractor
(at least for the MITIE backend), I’ve generated one from a relatively small italian corpus, but I can’t use it anyway on rasa_nlu.
I’ve also read somewhere that it’s possible to avoid a predefined language model at the cost of very low quality results, but at the point where I am, it’s totally acceptable.
So, is it possible (or is it planned) to support more languages and/or workaround the hardcoded languages?
Thanks in advance for the replies, keep up the good job!
Issue Analytics
- State:
- Created 7 years ago
- Comments:12 (6 by maintainers)
Hi @asssmidt - I’ll add some instructions to the README 😃
There is a group of people actively working on French support. Please email me & I’ll introduce you!