Write syllabifiers for Indian languages
See original GitHub issueThis ticket is for @soumyag213
As discussed by email, you’ll port this and related modules, to the CLTK, from the Indic NLP Library.
For a first step, I’d like to see this working in your own repo, which you have started at: https://github.com/soumyag213/cltk-beginning-indo. In the README for this, I would like to see an example of its API. For example, I imagine you showing something like this is the Python shell (BTW I like iPython):
In [1]: from indic_syllabifier import orthographic_syllabify
In [2]: orthographic_syllabify('supercalifragilisticexpialidocious', 'tamil')
Out[2]: 'su-per-cal-i-fra-gil-ist-ic-ex-pi-al-i-doc-ious'
Issue Analytics
- State:
- Created 7 years ago
- Reactions:1
- Comments:24 (24 by maintainers)
Top Results From Across the Web
NLP Libraries For Indian Languages - Analytics Vidhya
This article explores 3 nlp libraries for Indian languages- iNTLK, Indic NLP library, StanfordNLP. Learn nlp for indian languages and how to ...
Read more >Languages of India - Wikipedia
The Dravidian languages with the most speakers are Telugu, Tamil, Kannada and Malayalam.
Read more >A useful guide to all the languages spoken in India - Berlitz
Sanskrit, Urdu, Nepalese, Bengali and Gujarati all have some similarities with Hindi, either in spoken language or in the written script.
Read more >ISO 639-2 Language Code List - Library of Congress
ISO 639‑2 Code ISO 639‑1 Code German name of Language
aar aa Danakil‑Sprache
abk ab Abchasisch
ace Aceh‑Sprache
Read more >CHARACTERISTICS OF INDIAN LANGUAGES
There are 10-12 major scripts in India. The Devanagari script is the widest used one, being used to write Hindi (the most spoken...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Hi, since we’re done with this. I think we can close this now 😄
@soumyag213 I merged #316 and released it (https://github.com/cltk/cltk/releases/tag/v0.1.38). Thank you, it was perfect.
Only one more thing remains, and that is for you to write some docs for this at cltk/docs/sanskrit.rst. Since you, not me, are the expert on Indian languages, it is important that you explain clearly to the community why what you have done is valuable. Be sure to give examples, in multiple languages, which demonstrate every one of the user-facing functions you added.
Please work on this as soon as possible and make the PR. I will accept it and make minor revisions as necessary.