question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Dataset Viewer issue for allocine

See original GitHub issue

Link

https://huggingface.co/datasets/allocine

Description

Not sure if this is a problem with bz2 compression, but I thought these datasets could be streamed:

Status code:   400
Exception:     AttributeError
Message:       'TarContainedFile' object has no attribute 'readable'

Owner

No

Issue Analytics

  • State:closed
  • Created a year ago
  • Comments:5 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
lewtuncommented, Jun 24, 2022

Thanks for the quick fix @albertvillanova

0reactions
lewtuncommented, Jun 27, 2022

Note that the underlying issue is that datasets containing TAR files are not streamable out of the box: they need being iterated with dl_manager.iter_archive to avoid performance issues because they access their file content sequentially (no random access).

Ah thanks for the clarification! I’ll look out for this next time and implement the fix myself 😃

Read more comments on GitHub >

github_iconTop Results From Across the Web

allocine · Datasets at Hugging Face
The Allociné dataset is a French-language dataset for sentiment analysis. The texts are movie reviews written between 2006 and 2020 by members of...
Read more >
Allocine french movie reviews - Kaggle
French sentiment analysis using the Allocine movie review Dataset. The dataset was collected and created by Theophile Blard ... expand_moreView more.
Read more >
AlloCine Dataset - Papers With Code
A new dataset for sentiment analysis, scraped from Allociné.fr user reviews. It contains 100k positive and 100k negative reviews divided into 3 balanced...
Read more >
huggingface/datasets: 2.2.0 - Zenodo
add-dataset.md 615 Bytes; bug-report.md 591 Bytes; config.yml 174 Bytes; dataset-viewer.yml 763 Bytes; feature-request.md 552 Bytes.
Read more >
Interconnection scheme between RDF data of the application ...
This operation is more difficult to realize with datasets that are not in RDF. ... to add or view our appointments ; •...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found