question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Crashing browser when generating histogram for fairly large dataset

See original GitHub issue

The dataset I have (provided in the following link) has the following dimensions: (185459, 29). matplotlib has no trouble generating one with plt.hist(data_2015['net_value'], bins=50), for instance.

from altair import *
import pandas as pd
from urllib.request import urlretrieve

url = 'https://cl.ly/2T2Q0O1c2k35/download/2016-08-08-current-year.xz'
path = '/tmp/2016-08-08-current-year.xz'
urlretrieve(url, path)

data_2015 = pd.read_csv(path)
Chart(data_2015).mark_bar().encode(
    x=X('net_value', bin=Bin(maxbins=50)),
    y='count(*)',
)

Issue Analytics

  • State:closed
  • Created 7 years ago
  • Comments:12 (9 by maintainers)

github_iconTop GitHub Comments

2reactions
jakevdpcommented, Oct 14, 2016

If you’re creating a gist, you could add the dataset CSV files to the gist as well and reference them by URL.

Long term we should have a better story for this, though. I’ll put some thought into it. If you have thoughts on how we might do that most effectively, please let us know.

1reaction
jakevdpcommented, Oct 10, 2016

I think just dataframe.csv alone should work in the current version, if it’s in the same directory as your .ipynb file.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Re: root crashes when creating large histogram/histograms ...
On the questions about the analysis requirements: > The large histograms we create are symmetric. We have many smaller ones > which are...
Read more >
Browser Crashing When Displaying Large CSV Interactive ...
Hi All, I'm trying to create a tool for displaying large, interactive line chart plots for some massive data log files.
Read more >
Using Altair on data aggregated from large datasets
I am for now just trying to histogram by month which is only 6 data points, but doing this with either json or...
Read more >
Working efficiently with large datasets - Coding Club
In this part of the tutorial, we will use two datasets, one from the Global Biodiversity Information Facility (GBIF) and one from Flickr,...
Read more >
Graphing lots of data crashes my browser
Your results are already limited to 50,000 records, which performs adequately in all browsers. The issue here is (most likely, since it's hard...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found