question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

how can we add Stanford CoreNLU Arabic language support to Rasa-core? or is there any other way to add support for arabic language?

See original GitHub issue

Rasa NLU version: master branch

Operating system (windows, osx, …): windows

Content of model configuration file:


Issue:

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:20 (9 by maintainers)

github_iconTop GitHub Comments

5reactions
91nscommented, Jun 19, 2018

@reza-ebrahimi

If you pull the latest version of Rasa NLU and use this pipeline it will work for Arabic Intent Classification + NER. I tried this configuration and it’s very good.

language: “ar”

pipeline:

  • name: “tokenizer_whitespace”
  • name: “ner_crf”
  • name: “intent_featurizer_count_vectors”
  • name: “intent_classifier_tensorflow_embedding” intent_tokenization_flag: true intent_split_symbol: “_”
4reactions
andreasstubercommented, May 26, 2018

@akelad , we just tested the latest PR with Arabic, bot for intent and entity recognition. Seems to work just fine with a small test (not yet representative). We’ll do some more exhaustive testing with some other languages (Urdu, Jeriza, Yoruba) and larger samples. This is really exciting!

Read more comments on GitHub >

github_iconTop Results From Across the Web

how can we add Stanford CoreNLU Arabic language support ...
how can we add Stanford CoreNLU Arabic language support to Rasa-core? or is there any other way to add support for arabic language?...
Read more >
Using CoreNLP on other human languages
To run Stanford CoreNLP on a supported language, you have to include the models jar for that language in your CLASSPATH. The jars...
Read more >
What tokenization of Arabic does the parser assume?
Stanford Arabic Parser IAQ. Questions. What tokenization of Arabic does the parser assume? What character encoding do you assume? What characters are encoded?...
Read more >
Arabic Natural Language Processing - Stanford NLP Group
Arabic is the largest member of the Semitic language family and is spoken by nearly 500 million people worldwide. It is one of...
Read more >
The Stanford CoreNLP Natural Language Processing Toolkit
Stanford CoreNLP, a Java (or at least JVM-based) ... quence of Annotators add information in an analy- ... Support for other languages is...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found