Dataset viewer issues
See original GitHub issueI was looking through the dataset viewer and I like it a lot. Version numbers, citation information, everything’s there! I’ve spotted a few issues/bugs though:
- the URL is still under
nlp
, perhaps an alias fordatasets
can be made - when I remove a feature (and the feature list is empty), I get an error. This is probably expected, but perhaps a better error message can be shown to the user
IndexError: list index out of range
Traceback:
File "/home/sasha/streamlit/lib/streamlit/ScriptRunner.py", line 322, in _run_script
exec(code, module.__dict__)
File "/home/sasha/nlp-viewer/run.py", line 316, in <module>
st.table(style)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 122, in wrapped_method
return dg._enqueue_new_element_delta(marshall_element, delta_type, last_index)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 367, in _enqueue_new_element_delta
rv = marshall_element(msg.delta.new_element)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 120, in marshall_element
return method(dg, element, *args, **kwargs)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 2944, in table
data_frame_proto.marshall_data_frame(data, element.table)
File "/home/sasha/streamlit/lib/streamlit/elements/data_frame_proto.py", line 54, in marshall_data_frame
_marshall_styles(proto_df.style, df, styler)
File "/home/sasha/streamlit/lib/streamlit/elements/data_frame_proto.py", line 73, in _marshall_styles
translated_style = styler._translate()
File "/home/sasha/.local/share/virtualenvs/lib-ogGKnCK_/lib/python3.7/site-packages/pandas/io/formats/style.py", line 351, in _translate
* (len(clabels[0]) - len(hidden_columns))
- there seems to be an encoding issue in the default view, the dataset examples are shown as raw monospace text, without a decent encoding. That makes it hard to read for languages that use a lot of special characters. Take for instance the cs-en WMT19 set. This problem goes away when you enable “List view”, because then some syntax highlighteris used, and the special characters are coded correctly.
Issue Analytics
- State:
- Created 3 years ago
- Comments:5 (2 by maintainers)
Top Results From Across the Web
Issues · huggingface/datasets-viewer - GitHub
datasets -viewer · Code · Issues · Pull requests · Actions · Projects · Security · Insights.
Read more >Dataset preview not showing for uploaded DatasetDict
I created a DatasetDict and pushed it here. I'm getting the message Server Error Status code: 400 Exception: Status400Error Message: could not get...
Read more >The "Open Diagnostic Data Viewer" is broken in Windows 10
The "Open Diagnostic Data Viewer" is broken in Windows 10 (64) Version 2004. When I click on the Open Diagnostic Data Viewer under...
Read more >Visual Studio 2019 Data Viewer Does not show
The data viewer simply comes up as a blank white or black screen. See attachment. Visual Studiowindows 10.0visual studio 2019 version 16.3.. This...
Read more >Using the Data Viewer in the RStudio IDE - Posit Support
Introduction Starting the viewer Sorting Filtering Searching Advanced topics Auto-refreshing Labels Restrictions and Performance Saving...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Thanks for reporting ! cc @srush for the empty feature list issue and the encoding issue cc @julien-c maybe we can update the url and just have a redirection from the old url to the new one ?
⠀⠀⠀ ⠀