question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Visualize TaskGroups

See original GitHub issue

We should find other representations to replace the Task stream. See https://github.com/dask/distributed/issues/4260

One view of that data is aggregated within the TaskGroups. A TaskGroup collects many related tasks together. For example one dd.read_csv call may generate 10,000 tasks, but will generate only one task group. These correpond to high level layers on the client side, or Spark layers.

Task Groups contain information that is potentially useful to convey. Here is a subset

  • start and stop time of every group
  • how long we’ve spent on the group, both in comptuation, but also data transfer and other activities
  • amount of data processed / currently in storage
  • dependency relationships to other taskgroups
  • how far along we are in computing them, as well as if we’ve had any errors (this is the same information we have in the progress bars in the status page of the dashboard today)

How should we convey this information visually to the user? As mentioned above, we convey the progress of tasks within a taskgroup today in the progress chart. Great, what else? We could consider doing something like these graphs from spark

image

But perhaps augmented real-time and with color/size/shading differences with the updated information that we have.

I walked down this path briefly in the attached notebook, using start and stop times to inform layout. I found that, due to overlap, this was hard/impossible. I’m now of the opinion that layout should be purely informed by dependency graph structure (similar to the Spark image above). However, I think that once we have that rough layout there is a lot that we can do with regards to color/size/shading that will be fun. Layout is still an interesting problem though, especially when trying to make the general case robustly laid out.

I think that we have all the information that we need in TaskGroups. Right now the next thing to do is to think about visualization, which should be fun

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:6 (5 by maintainers)

github_iconTop GitHub Comments

2reactions
mrocklincommented, May 4, 2021

@ian-r-rose you may also find this interesting. I think that you and @ncclementi might be a good pairing here. (James’ idea, I just wanted to make sure that this got out there)

On Tue, May 4, 2021 at 2:28 PM Benjamin Zaitlen @.***> wrote:

Visualizing some kind of timeseries of task groups would be really cool! I wanted to note that last year we added a visualization for aggregate timing information using TaskPrefixes and aggregate action information.

xref: #3792 https://github.com/dask/distributed/pull/3792

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/dask/distributed/issues/4744#issuecomment-832188219, or unsubscribe https://github.com/notifications/unsubscribe-auth/AACKZTH2VXJJ2LWOHL3LS43TMBDGVANCNFSM43PB3LHA .

1reaction
ncclementicommented, Apr 30, 2021

Thanks James, I’ll start looking at this.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Group tasks or resources - Microsoft Support
To group tasks, select a sheet view such as the Gantt Chart, Task Sheet, or Task Usage view. You can also group tasks...
Read more >
Top 16 Project Charts to Visualize Projects [2022] • Asana
Project charts help team members visualize tasks and understand how that work connects to larger business goals. From Gantt charts to flowcharts ...
Read more >
Airflow task groups | Astronomer Documentation
Use task groups to organize tasks in the Airflow UI DAG graph view. In this guide, you'll learn how to create task groups...
Read more >
Task Group Mode - TaskRay
Task Groups allow you to define the structure of your projects. Using task groups, you can organize the tasks in your projects into...
Read more >
How to View & Manage your Task Groups?
Navigate to Left Menu, Click on Tasks; To change the Task List view to Task, Click on on the top left header of...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found