question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

"dvc pull" fails to find files in a pushed directories subfolder on Windows

See original GitHub issue

Setup info Ubuntu:

  • DVC version: 0.41.3
  • Python version: 3.6.7
  • Platform: Linux-4.15.0-51-generic-x86_64-with-debian-buster-sid
  • Installed with pip

Windows:

  • DVC version: 0.41.3
  • Python version: 3.6.7
  • Platform: Windows-10-10.0.16299-SP0
  • Installed with pip

After pushing a folder from Ubuntu, which included some model files and one subfolder with some variable files, I tried to pull it from Windows. It managed to get all the files in the initial folder, but ran into an error with the files in the one subfolder.

dvc pull very-long_folder-name_with;semicolons;and;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;_and-some-others.dvc
c:\users\srg\appdata\local\continuum\anaconda3\envs\tf-keras\lib\site-packages\git\repo\base.py:128: UserWarning: The use of environment variables in paths is deprecated
for security reasons and may be removed in the future!!
  "\nfor security reasons and may be removed in the future!!")
Preparing to download data from 'F:\localpath\dvc-q99hd-cache'
Preparing to collect status from F:\localpath\dvc-q99hd-cache
[##############################] 100% Collecting information
[##############################] 100% Analysing status.
[####                          ] 16% very-long_folder-name_with;semicolons;and;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;_and-some-others
[#########                     ] 33% very-long_folder-name_with;semicolons;and;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;_and-some-others
[###############               ] 50% very-long_folder-name_with;semicolons;and;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;_and-some-others
[###################           ] 66% very-long_folder-name_with;semicolons;and;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;_and-some-others
ERROR: failed to pull data from the cloud - [Errno 2] No such file or directory: 'F:\\localpath\\localpath\\localpath\\localpath\\localpath\\localpath\\very-long_folder-name_with;semicolons;and;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;fillerXxxYyyZzz;_and-some-others\\variables\\variables.data-00000-of-00001'

After a brief converation with Ruslan over email, he suspect the following:

I suspect it is most likey the filename that is to blame. It is either not handled by windows at all, or somehow butchered or mishandled by dvc, so I need to know more about that filename.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Reactions:1
  • Comments:12 (8 by maintainers)

github_iconTop GitHub Comments

2reactions
efiopcommented, Jun 18, 2019

We will be sure to add it to our user guide: https://github.com/iterative/dvc.org/issues/395 . Thanks a lot for the feedback @sremm ! 😃

2reactions
sremmcommented, Jun 18, 2019

@efiop I enabled long paths following the guide you sent and then the pull command worked, all the files were pulled without error

Read more comments on GitHub >

github_iconTop Results From Across the Web

pull | Data Version Control - DVC
dvc pull downloads tracked data from remote storage to the cachecache, and links (or copies) the files or directories to the workspaceworkspace (refer...
Read more >
Troubleshooting | Data Version Control - DVC
Failed to pull data from the cloud · Too many open files error · Unable to find credentials · Unable to connect ·...
Read more >
.dvcignore Files | Data Version Control · DVC
If DVC finds a .dvcignore file inside a dependency or output directory, it raises an error. Ignoring files inside such directories should be...
Read more >
add | Data Version Control - DVC
This command can be used to track large files, models, dataset directories, etc. that are too big for Git to handle directly. This...
Read more >
Is it possible to only pull/get a subfolder from a existing repo
Each dataset is placed in a subfolder within a directory, which has been pushed to the save location on the network drive using...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found