question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

ENH: show progress while fetching rows for query

See original GitHub issue

As reported by @QuinRiva in https://github.com/pydata/pandas-gbq/issues/12#issuecomment-391212920

Progress is written to logging while a query is running, but no progress is reported between when a query finishes and while the data is being downloaded to be added to a DataFrame. The problem is that we call list(rows_iter) to fetch all pages. Previously, progress was written as each page was downloaded.

https://github.com/pydata/pandas-gbq/blob/08166685d3305a57fbfd3bc4c41a1cf5df98ebcf/pandas_gbq/gbq.py#L294-L299

A possible solution is to loop over rows_iter.pages instead. After the first page is fetched, the rows_iter.total_rows property is available, so it would be possible to display a percent complete or even use tqdm as done in https://github.com/pydata/pandas-gbq/pull/166.

Issue Analytics

  • State:closed
  • Created 5 years ago
  • Comments:7 (4 by maintainers)

github_iconTop GitHub Comments

1reaction
daklcommented, Oct 25, 2019

Now that to_dataframe in google-cloud-bigquery supports progress_bar_type, can we make use of that to add a progress bar to read_gbq? @tswast

1reaction
tswastcommented, Mar 29, 2019

Yeah, once I make a new release of google-cloud-bigquery (I expect to next week), we can start populating the progress bar argument. I’d be okay always populating it as 'tqdm', but it is nice to have the special version for notebooks, so an argument to read_gbq makes sense.

Read more comments on GitHub >

github_iconTop Results From Across the Web

ENH: show progress while fetching rows for query · Issue #182
The problem is that we call list(rows_iter) to fetch all pages. Previously, progress was written as each page was downloaded. https://github.com ...
Read more >
OpenEdge 4GL Query concepts (FOR EACH, FIND, GET ...
When a BY clause is used with a query, PROGRESS must produce the query results in the specified order. It does so in...
Read more >
c# - How to indicate the current progress of querying a large ...
For example, I run a Select statement that query a large data table which might take few seconds. I want the progress bar...
Read more >
need progress bar in excel during processing and some ...
Hi All, I'm using an excel which allows user to choose Json file and with help of power query and macro, json is...
Read more >
Real-Time SQL Monitoring - Oracle
Real-time SQL monitoring, introduced in Oracle Database 11g, provides a very effective way to identify run-time performance problems with resource intensive.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found