question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Toolz dropin API for dask.bag

See original GitHub issue

@wrobstory suggested that dask.bag actually follow the toolz API directly. Additionally it might be nice to have functions that could work equally well on dask.bag objects as on normal Python Iterators and Iterables.

@eriknw might find this interesting.

Issue Analytics

  • State:closed
  • Created 8 years ago
  • Comments:6 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
mrocklincommented, May 3, 2016

This doesn’t seem like its going to happen. I suspect that there is already sufficient inertia behind dask.bag so that this is more likely a fun project than an essential project. Closing.

0reactions
mrocklincommented, Aug 10, 2015

Toolz generally uses bare functions. We tend to use currying and a pipe function for repeated function composition.

Read more comments on GitHub >

github_iconTop Results From Across the Web

API - Dask documentation
Create a dask Bag from Python sequence. from_delayed (values) ... Write dask Bag to disk, one filename per partition, one line per element....
Read more >
Bag - Dask documentation
Dask Bag implements operations like map , filter , fold , and groupby on collections of generic Python objects. It does this in...
Read more >
Dask Bags — Dask Examples documentation
Dask Bags are often used to do simple preprocessing on log files, JSON records, or other user defined Python objects. Full API documentation...
Read more >
dask.bag.Bag.foldby - Dask documentation
Combined reduction and groupby. Foldby provides a combined groupby and reduce for efficient parallel split-apply-combine tasks. The computation. >>> b ...
Read more >
Spark, Dask, and Ray: Choosing the Right Framework
We cover the history, use-cases, strengths and weaknesses of Spark, Dask and Ray, and how to select the right framework for specific data ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found