question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Write syllabifiers for Indian languages

See original GitHub issue

This ticket is for @soumyag213

As discussed by email, you’ll port this and related modules, to the CLTK, from the Indic NLP Library.

For a first step, I’d like to see this working in your own repo, which you have started at: https://github.com/soumyag213/cltk-beginning-indo. In the README for this, I would like to see an example of its API. For example, I imagine you showing something like this is the Python shell (BTW I like iPython):

In [1]: from indic_syllabifier import orthographic_syllabify
In [2]: orthographic_syllabify('supercalifragilisticexpialidocious', 'tamil')
Out[2]: 'su-per-cal-i-fra-gil-ist-ic-ex-pi-al-i-doc-ious'

Issue Analytics

  • State:closed
  • Created 7 years ago
  • Reactions:1
  • Comments:24 (24 by maintainers)

github_iconTop GitHub Comments

1reaction
soumyag213commented, Aug 6, 2016

Hi, since we’re done with this. I think we can close this now 😄

0reactions
kylepjohnsoncommented, Jul 16, 2016

@soumyag213 I merged #316 and released it (https://github.com/cltk/cltk/releases/tag/v0.1.38). Thank you, it was perfect.

Only one more thing remains, and that is for you to write some docs for this at cltk/docs/sanskrit.rst. Since you, not me, are the expert on Indian languages, it is important that you explain clearly to the community why what you have done is valuable. Be sure to give examples, in multiple languages, which demonstrate every one of the user-facing functions you added.

Please work on this as soon as possible and make the PR. I will accept it and make minor revisions as necessary.

Read more comments on GitHub >

github_iconTop Results From Across the Web

NLP Libraries For Indian Languages - Analytics Vidhya
This article explores 3 nlp libraries for Indian languages- iNTLK, Indic NLP library, StanfordNLP. Learn nlp for indian languages and how to ...
Read more >
Languages of India - Wikipedia
The Dravidian languages with the most speakers are Telugu, Tamil, Kannada and Malayalam.
Read more >
A useful guide to all the languages spoken in India - Berlitz
Sanskrit, Urdu, Nepalese, Bengali and Gujarati all have some similarities with Hindi, either in spoken language or in the written script.
Read more >
ISO 639-2 Language Code List - Library of Congress
ISO 639‑2 Code ISO 639‑1 Code German name of Language aar aa Danakil‑Sprache abk ab Abchasisch ace Aceh‑Sprache
Read more >
CHARACTERISTICS OF INDIAN LANGUAGES
There are 10-12 major scripts in India. The Devanagari script is the widest used one, being used to write Hindi (the most spoken...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found