question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Make dataset example self-contained

See original GitHub issue

I’ve consolidated the example datasets under a geopandas-like geoplot.datasets domain (here).

This has the beneficial effect that it makes all of the examples in the geoplot documentation, especially the ones in the gallery, immediately reproducible for the user. However, the drawback is that I also have to distribute the example data with the library. After hewing and hawing every which way, I’ve gotten that down to a ~10 MB examples.zip file.

@choldgraf I would like your feedback on this idea. Is a 10 MB add-on like this an acceptable load for such a library? Or should I maybe provide an nltk-like downloader instead?

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:11 (5 by maintainers)

github_iconTop GitHub Comments

5reactions
pybokehcommented, Nov 3, 2017

@ResidentMario Thank you! All I did was run:

import quilt
quilt.install("ResidentMario/geoplot_data")

and was then able to run the example.

1reaction
ResidentMariocommented, Feb 21, 2019

The code snippet pybokeh included in his comment should do the job. I think the new URL is https://docs.quiltdata.com/get-started/step-by-step.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Hands-on: Deploying a self contained model
We are going to use the popular Titanic dataset to generate a predictive model. The goal is to predict whether someone would have...
Read more >
Taking Datasets, DataLoaders, and PyTorch's New DataPipes ...
Creating Datasets with ImageFolder; Introduction to DataPipes ... Check out the 2_imagefolder.py file for a self-contained example.
Read more >
Self-Contained Testing Using an In-Memory Database
A quick, practical tutorial on setting up an in-memory database for running self-contained persistence-layer tests in a Spring application.
Read more >
Tutorial: Creating a model-driven app dataset component
In this tutorial, you'll create a dataset code component, deploy it, and configure it inside a model-driven app using Visual Studio Code.
Read more >
Main classes - Hugging Face
Create a Dataset from a generator. Example: ... The resulting Parquet files are self-contained by default. If your dataset contains Image or Audio...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found