CelebA download is broken
See original GitHub issueThe download of all CelebA files except identity_CelebA.txt
is broken. For example, the URL to download img_align_celeba.zip
resolves to https://drive.google.com/uc?id=0B7EVK8r0v71pZjFTYXZWM3FlRnM&export=download. This link is publicly accessible, but you have to be logged into Google. Otherwise you’ll see a 404 page.
I’ll have a look if it is possible to get a general download link from the ID.
Issue Analytics
- State:
- Created a year ago
- Comments:11 (6 by maintainers)
Top Results From Across the Web
Failed to download CelebA dataset using download=True
Bug It fails to download the following files img_align_celeba.zip Rather than the zip file, it downloads a html file "Google Drive - Quota ......
Read more >python - Error downloading celebA dataset using torchvision
Using the torchvision module datasets, I can't download the celebA image dataset. I am pretty sure that I am doing everything right.
Read more >CelebA dataset download errors - vision - PyTorch Forums
Dear Sir/Madam, I am having issues downloading the CelebA dataset. It appears that some of the data is not in .zip format which...
Read more >datasets/celeba.py · neilisaac/torchvision - Gemfury
Args: root (string): Root directory where images are downloaded to. split (string): One of {'train', 'valid', 'test', 'all'}.
Read more >CelebFaces Attributes (CelebA) Dataset - Kaggle
Over 200k images of celebrities with 40 binary attribute annotations. ... CelebFaces Attributes (CelebA) Dataset ... file_downloadDownload (1 GB) ...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
I don’t think it will come back. This is not a limitation by GDrive, but as explained in https://github.com/pytorch/vision/issues/5705#issuecomment-1082778379 a conscious decision by the author to limit access. I’ve contacted them twice and asked to revert it, but got no response.
If we want to keep it open, we should have some kind scheduled test or the like if the download is publicly accessible again. Otherwise we’ll just forget about this and will have a stale issue. At least I will forget to regularly check the dataset if the author changed permissions.
Should we keep it open? Ultimately we’ll want to put back the download feature, if the Gdrive becomes available again?