question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Feature request: a public dataset

See original GitHub issue

Similar to df = dd.read_parquet('gs://dask-nyc-taxi/yellowtrip.parquet', storage_options={'token': 'anon'}).

Something like df = dd.read_parquet('az://yellowtrip.parquet', storage_options={'account_name': 'dask-nyc-taxi'})

cc. @martindurant @TomAugspurger

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:7 (5 by maintainers)

github_iconTop GitHub Comments

2reactions
lostmygithubaccountcommented, Sep 29, 2020

for Open Datasets it is @meyetman

you can access via adlfs with the anonymous access:

from adlfs import AzureBlobFileSystem

storage_options = {'account_name': 'azureopendatastorage'}

fs = AzureBlobFileSystem(**storage_options)
fs.ls('isdweatherdatacontainer')
1reaction
TomAugspurgercommented, Jan 26, 2022

@rabernat you provide the “storage account name” which is the https://<storage-account-name>.blob.windows.net part. The “storage container” is in the path. It’s a bit confusing at first:

In [1]: import adlfs

In [2]: fs = adlfs.AzureBlobFileSystem("ai4edataeuwest")

In [3]: fs.ls("/ecmwf")
Out[3]:
['ecmwf/20220121',
 'ecmwf/20220122',
 'ecmwf/20220123',
 'ecmwf/20220124',
 'ecmwf/20220125',
 'ecmwf/20220126']
Read more comments on GitHub >

github_iconTop Results From Across the Web

Feature Request : Download dataset sample - Kaggle
So if we have a new feature to download just the sample instead of the whole dataset, which is auto generative according to...
Read more >
Issue tracking system and product feature requests
Discover how to use Google Cloud's issue tracking system to report issues, submit and vote for product feature requests from the issue tracker...
Read more >
Creating a public app for feature requests - no code required
1. Create your Feature Requests list. Create a table in Jestor, name it “Feature Requests”, and add all the fields you'll need for...
Read more >
Feature Request. Small batches or well know data sets. #157
It might be interesting to have smaller batches of the MNIST, CIFAR -10, or other popular data sets as a selection option. This...
Read more >
Site feature request - Suggested Dataset - Data.gov.ie
We welcome suggestions for publication of appropriate Public Sector datasets, which are not currently linked to the portal as Open Data. Let us...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found