question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

bug: fix verify_meta=False usage in dask udfs

See original GitHub issue

Fix/Remove usage of verify_meta=False in the dask udf code for scalar outputs as that’s blocking us from updating dask.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:5 (3 by maintainers)

github_iconTop GitHub Comments

1reaction
gerrymanoimcommented, Jan 14, 2022

I put this fix up https://github.com/ibis-project/ibis/pull/3316, but this implementation required basically doing extracting all the pieces of dd.from_delayed besides the meta verification. Given that, I’m going to just close this issue and change my PR to simply remove the shim we don’t need anymore.

1reaction
cpcloudcommented, Jan 14, 2022

@gerrymanoim Did you get a chance to look into this?

Read more comments on GitHub >

github_iconTop Results From Across the Web

Managing Memory — Dask.distributed 2022.12.1 documentation
Dask.distributed stores the results of tasks in the distributed memory of the worker nodes. The central scheduler tracks all data on the cluster...
Read more >
Tackling unmanaged memory with Dask - Coiled
Shed light on the common error message “Memory use is high but worker has no data to store to disk. Perhaps some other...
Read more >
[dask] [python] lightgbm.dask hangs indefinitely after an error
Reproducible example​​ Using dask and distributed v2021. 10.0, after a run which produces an error out on the cluster, the next test gets...
Read more >
Dask Unmanaged Memory | How to Find & Fix | Matt Rocklin
Learn more at https://bit.ly/3Q5aMmpUnmanaged memory often presents as a memory leak, but is actually really easy to identify and resolve.
Read more >
How to fix a Dask memory error for database table with over a ...
If anyone has any ideas of how to use dask.dataframe.read_sql_table for reading 100 million row tables it would be greatly appreciated.
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found