Support creating DataSet from streaming object
See original GitHub issueThe use case is for netCDF files stored on s3 or other generic cloud storage
import requests, xarray as xr
fp = 'http://nasanex.s3.amazonaws.com/NEX-GDDP/BCSD/rcp45/day/atmos/tasmax/r1i1p1/v1.0/tasmax_day_BCSD_rcp45_r1i1p1_MPI-ESM-LR_2029.nc'
data = requests.get(fp, stream=True)
ds = xr.open_dataset(data.content) # raises TypeError: embedded NUL character
Ideal would be integration with the (hopefully) soon-to-be implemented dask.distributed features discussed in #798.
Issue Analytics
- State:
- Created 7 years ago
- Reactions:4
- Comments:16 (11 by maintainers)
Top Results From Across the Web
Stream - Hugging Face
Loading a dataset in streaming mode creates a new dataset type instance (instead of the classic Dataset object), known as an IterableDataset.
Read more >Structured Streaming Programming Guide - Apache Spark
Structured Streaming supports joining a streaming Dataset/DataFrame with a static Dataset/DataFrame as well as another streaming Dataset/DataFrame. The result ...
Read more >Real-time streaming in Power BI - Microsoft Learn
This article shows you how to set up real-time streaming dataset in ... When the dataset is created, the Power BI service automatically ......
Read more >How To: Create a Streaming Data Loader for PyTorch -- Visual ...
In situations where all of the training data will fit into machine memory, the most common approach is to define a problem-specific Dataset...
Read more >Power BI streaming data sets: The good, the great, and the ...
Streaming Dataset : As the name implies, streaming datasets have limited or no history and just look at the most current values for...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
yes! Thanks @jhamman and @shoyer. I hadn’t tried it yet, but just did. worked great!
Just to clarify: I wrote about that we use could support initializing a Dataset from a netCDF4 file image. But this wouldn’t help yet for streaming access.
Initializing a Dataset from a netCDF4 file image should actually work with the latest versions of xarray and netCDF4-python: