Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Opening dataset without loading any indexes?

See original GitHub issue

Is your feature request related to a problem?

Within pangeo-forge’s internals we would like to call open_dataset, then to_dict(), and end up with a schema-like representation of the contents of the dataset. This works, but it also has the side-effect of loading all indexes into memory, even if we are loading the data values “lazily”.

Describe the solution you’d like

@benbovy do you think it would be possible to (perhaps optionally) also avoid loading indexes upon opening a dataset, so that we actually don’t load anything? The end result would act a bit like ncdump does.

Describe alternatives you’ve considered

Otherwise we might have to try using xarray-schema or something but the suggestion here would be much neater and more flexible.

xref: https://github.com/pangeo-forge/pangeo-forge-recipes/issues/256

cc @rabernat @jhamman @cisaacstern

Issue Analytics

State:
Created a year ago
Reactions:2
Comments:8 (8 by maintainers)

Top GitHub Comments

2reactions

rabernatcommented, May 25, 2022

Yes it is definitely a pathological example. 💣 But the fact remains that there are many cases where we just want to discover dataset contents as quickly as possible and want to avoid the cost of loading coordinates and creating indexes.

0reactions

dcheriancommented, May 31, 2022

This would also fix #2233

Top Results From Across the Web

pandas read_csv index_col=None not working with delimiters ...

Quick Answer. Use index_col=False instead of index_col=None when you have delimiters at the end of each line to turn off index column inference...

Read CSV File without Unnamed Index Column in Python

In this article you'll learn how to load a CSV file without an unnamed index column in Python programming. The article consists of...

Export Pandas to CSV without Index & Header

pandas DataFrame to CSV with no index can be done by using index=False param of to_csv() method. With this, you can specify ignore...

How to avoid Python/Pandas creating an index in a saved csv?

1 Answer. The first and most preferable way would be to set your index value as index=False while you are converting your data...

pandas.read_csv — pandas 1.5.2 documentation

Detect missing value markers (empty strings and the value of na_values). In data without any NAs, passing na_filter=False can improve the performance of...