Crashing browser when generating histogram for fairly large dataset
See original GitHub issueThe dataset I have (provided in the following link) has the following dimensions: (185459, 29)
. matplotlib has no trouble generating one with plt.hist(data_2015['net_value'], bins=50)
, for instance.
from altair import *
import pandas as pd
from urllib.request import urlretrieve
url = 'https://cl.ly/2T2Q0O1c2k35/download/2016-08-08-current-year.xz'
path = '/tmp/2016-08-08-current-year.xz'
urlretrieve(url, path)
data_2015 = pd.read_csv(path)
Chart(data_2015).mark_bar().encode(
x=X('net_value', bin=Bin(maxbins=50)),
y='count(*)',
)
Issue Analytics
- State:
- Created 7 years ago
- Comments:12 (9 by maintainers)
Top Results From Across the Web
Re: root crashes when creating large histogram/histograms ...
On the questions about the analysis requirements: > The large histograms we create are symmetric. We have many smaller ones > which are...
Read more >Browser Crashing When Displaying Large CSV Interactive ...
Hi All, I'm trying to create a tool for displaying large, interactive line chart plots for some massive data log files.
Read more >Using Altair on data aggregated from large datasets
I am for now just trying to histogram by month which is only 6 data points, but doing this with either json or...
Read more >Working efficiently with large datasets - Coding Club
In this part of the tutorial, we will use two datasets, one from the Global Biodiversity Information Facility (GBIF) and one from Flickr,...
Read more >Graphing lots of data crashes my browser
Your results are already limited to 50,000 records, which performs adequately in all browsers. The issue here is (most likely, since it's hard...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
If you’re creating a gist, you could add the dataset CSV files to the gist as well and reference them by URL.
Long term we should have a better story for this, though. I’ll put some thought into it. If you have thoughts on how we might do that most effectively, please let us know.
I think just
dataframe.csv
alone should work in the current version, if it’s in the same directory as your.ipynb
file.