question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Change corpus importer to accept full git URLS (not just fragments)

See original GitHub issue

This will let us point to repos that the CLTK org can’t host.

So for example, as with a corpora.py:

'name':        'punjabi_text_gurban',

… would become:

'name':        'https://github.com/cltk/punjabi_text_gurban.git'

And another path could be:

'name':        'https://github.com/some_other_user_or_org/punjabi_text_gurban.git'

I know I’ll need to change this for getting official repos: https://github.com/cltk/cltk/blob/master/cltk/corpus/utils/importer.py#L273

I’ll also update the docs explaining this, plus the wiki page How to add a corpus to the CLTK.

Issue Analytics

  • State:closed
  • Created 7 years ago
  • Comments:14 (12 by maintainers)

github_iconTop GitHub Comments

1reaction
SilentFlamecommented, Sep 6, 2016

Hey Kyle, Thanks for responding, and yes I’ll love to work on this.

  1. I use Linux(Ubuntu 14.04)
  2. yes I have a good experience with git and terminal applications.

Will be doing the above task now.

0reactions
kylepjohnsoncommented, Mar 18, 2017

@lazycoder1 Thank you for offering. This was done a while ago, but the ticket was not closed. Sorry about that.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Importing a Git repository using the command line - GitHub Docs
On the command line, make a "bare" clone of the repository using the external clone URL. This creates a full copy of the...
Read more >
Migrating to Git - Git SCM
If you have an existing codebase in another VCS but you've decided to start using Git, you must migrate your project one way...
Read more >
3 Processing Raw Text - NLTK
By using NLTK's corpus interface we were able to ignore the files that these texts had come from. The contents of a word,...
Read more >
How to Get Started with GIT and work with GIT Remote Repo
Take note that EACH Git repo is associated with a project directory (and its ... The " git status " output is divided...
Read more >
How to Utilize Submodules within Git Repos - Bits and Pieces
Submodules allow you to keep a Git repository as a subdirectory of ... Last thing to do: copy the URL of the repo...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found