question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

[App]: Default Line Plot sometimes does not show outliers, even without enabled smoothing

See original GitHub issue

Current Behavior

The Line plot sometimes does not visualize values as it should and skips steps in plots with the same x-axis.

I’m currently analyzing the average step time in seconds and there is a big spike after each epoch starts due to the dataset reshuffling. image

The second Line plot tracks the data loading time, but it seems to be there is no spike at step 1564.

When zooming into the step_time_s plot and navigating the step 1564, the value 1.183s pops up, but at the full_dataload_time_s plot the step selected is 1563 with the value 0.0007579s.

image

However, when zooming into the target region of step 1564 in the data loading plot, it switches the y-axis and displays the correct spike with the value 1.155s. It also correctly displays the same step 1564.

image

This happens to all line plots that have a similar frequency, i.e., logging at every step.

Expected Behavior

While it may be impossible to visualize such relatively speaking dense data in a Line plot, I would love to have the value at the same timestep in the tooltip, e.g., it should show the value 1.155s when navigating to step 1564 in the step_time_s plot.

Steps To Reproduce

Code to reproduce. Run with pip3 install -r requirements-cpu.txt; wandb login; python torch10-cifar.py line-plot-reproduction.zip

Screenshots

No response

Environment

OS: Ubuntu 20.04

Browsers: Brave

Version: Version 1.42.97 Chromium: 104.0.5112.102 (Official Build) (64-bit)

Additional Context

No response

Issue Analytics

  • State:open
  • Created a year ago
  • Comments:6

github_iconTop GitHub Comments

1reaction
cirquitcommented, Oct 4, 2022

Thanks for the explanation on Weave. While it looks very powerful, it seems that it is bound by the same 1.5k sampling issue as one-off outliers are sometimes included and sometimes not.

I think for now I will just try to minimize the sampling rate to get accurate live plots and zoom into important passages whenever necessary. You can either close this issue or keep it open to close off in the future whenever the dynamic sampling is implemented. Either way, I’m looking forward to the new release 👍

0reactions
thanos-wandbcommented, Oct 9, 2022

Hi @cirquit thank you for letting me know that you have a workaround for now and sorry that this isn’t currently feasible on our side. We have an internal ticketing system and I will keep this ticket on-hold (here it will remain open) so as to update you once we release a feature to control the sampling points.

Read more comments on GitHub >

github_iconTop Results From Across the Web

matplotlib: disregard outliers when plotting - Stack Overflow
Sometimes in a test I happen to have one outlier (say 0.1), while all other values are three orders of magnitude smaller. With...
Read more >
Data Smoothing and Outlier Detection - MATLAB & Simulink
Eliminate unwanted noise or behavior in data, and find, fill, and remove outliers. ... Not all data is suitable for smoothing with a...
Read more >
A ggplot2 Tutorial for Beautiful Plotting in R - Cédric Scherer
An extensive tutorial containing a general introduction to ggplot2 as well as many examples how to modify a ggplot, step by step.
Read more >
Outliers in scatter plots (article) - Khan Academy
Scatter plots often have a pattern. We call a data point an outlier if it doesn't fit the pattern.
Read more >
ScottPlot 4.1 Cookbook
By default primary axes are totally visible, and secondary axes have ticks hidden and no label. Sometimes the top axis (XAxis2) is given...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found