feat(dl_manager): add support for ftp downloads
See original GitHub issueI am working on a new dataset (#302) and encounter a problem downloading it.
# This is the official download link from https://www-i6.informatik.rwth-aachen.de/~koller/RWTH-PHOENIX-2014-T/
_URL = "ftp://wasserstoff.informatik.rwth-aachen.de/pub/rwth-phoenix/2016/phoenix-2014-T.v3.tar.gz"
dl_manager.download_and_extract(_URL)
I get an error:
ValueError: unable to parse ftp://wasserstoff.informatik.rwth-aachen.de/pub/rwth-phoenix/2016/phoenix-2014-T.v3.tar.gz as a URL or as a local path
I checked, and indeed you don’t consider ftp
as a remote file.
https://github.com/huggingface/datasets/blob/4c2af707a6955cf4b45f83ac67990395327c5725/src/datasets/utils/file_utils.py#L188
Adding ftp
to that list does not immediately solve the issue, so there probably needs to be some extra work.
Issue Analytics
- State:
- Created 3 years ago
- Comments:11 (8 by maintainers)
Top Results From Across the Web
Secure FTP downloads for IBM i PTFs
Abstract. IBM Fix Central has implemented a secure FTP function to download PTF images from the Fix Central FTP site.
Read more >FTP Commands: FEAT, OPTS, HELP, STAT, NLST, XCUP ...
The FEAT command provides FTP clients with a mechanism of quickly determining what extended features the FTP server supports. If this command is...
Read more >FTP Manager Lite - Free FTP Client Software - DeskShare
This free FTP client will allow you to move data from your PC to FTP or between PC's and servers. Your data is...
Read more >FILENAME Statement: FTP Access Method - SAS Help Center
If you want FTP to append a file extension of DATA to the member name that is specified in the FILE or INFILE...
Read more >FTP - The Hacker Recipes
When downloading files, users should set the FTP client to "Binary" ( binary ... The HELP and FEAT commands could give information about...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
The dataset loader is not yet ready, because of that issue. If you want to just download the dataset the old-fashioned way, just go to: https://www-i6.informatik.rwth-aachen.de/ftp/pub/rwth-phoenix/2016/phoenix-2014-T.v3.tar.gz (the ftp link is now broken, and its available over https)
@hoanganhpham1006 yes. See pull request https://github.com/huggingface/datasets/pull/722 , it has a loader for this dataset, mostly ready. There’s one issue that delays it being merged - https://github.com/huggingface/datasets/issues/741 - regarding memory consumption.