question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Dataset viewer issues

See original GitHub issue

I was looking through the dataset viewer and I like it a lot. Version numbers, citation information, everything’s there! I’ve spotted a few issues/bugs though:

  • the URL is still under nlp, perhaps an alias for datasets can be made
  • when I remove a feature (and the feature list is empty), I get an error. This is probably expected, but perhaps a better error message can be shown to the user
IndexError: list index out of range
Traceback:
File "/home/sasha/streamlit/lib/streamlit/ScriptRunner.py", line 322, in _run_script
    exec(code, module.__dict__)
File "/home/sasha/nlp-viewer/run.py", line 316, in <module>
    st.table(style)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 122, in wrapped_method
    return dg._enqueue_new_element_delta(marshall_element, delta_type, last_index)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 367, in _enqueue_new_element_delta
    rv = marshall_element(msg.delta.new_element)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 120, in marshall_element
    return method(dg, element, *args, **kwargs)
File "/home/sasha/streamlit/lib/streamlit/DeltaGenerator.py", line 2944, in table
    data_frame_proto.marshall_data_frame(data, element.table)
File "/home/sasha/streamlit/lib/streamlit/elements/data_frame_proto.py", line 54, in marshall_data_frame
    _marshall_styles(proto_df.style, df, styler)
File "/home/sasha/streamlit/lib/streamlit/elements/data_frame_proto.py", line 73, in _marshall_styles
    translated_style = styler._translate()
File "/home/sasha/.local/share/virtualenvs/lib-ogGKnCK_/lib/python3.7/site-packages/pandas/io/formats/style.py", line 351, in _translate
    * (len(clabels[0]) - len(hidden_columns))
  • there seems to be an encoding issue in the default view, the dataset examples are shown as raw monospace text, without a decent encoding. That makes it hard to read for languages that use a lot of special characters. Take for instance the cs-en WMT19 set. This problem goes away when you enable “List view”, because then some syntax highlighteris used, and the special characters are coded correctly.

Issue Analytics

  • State:closed
  • Created 3 years ago
  • Comments:5 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
lhoestqcommented, Nov 27, 2020

Thanks for reporting ! cc @srush for the empty feature list issue and the encoding issue cc @julien-c maybe we can update the url and just have a redirection from the old url to the new one ?

0reactions
mmkarlosmmcommented, Dec 1, 2020

‏⠀‏‏‏⠀‏‏‏⠀ ‏⠀

Read more comments on GitHub >

github_iconTop Results From Across the Web

Issues · huggingface/datasets-viewer - GitHub
datasets -viewer · Code · Issues · Pull requests · Actions · Projects · Security · Insights.
Read more >
Dataset preview not showing for uploaded DatasetDict
I created a DatasetDict and pushed it here. I'm getting the message Server Error Status code: 400 Exception: Status400Error Message: could not get...
Read more >
The "Open Diagnostic Data Viewer" is broken in Windows 10
The "Open Diagnostic Data Viewer" is broken in Windows 10 (64) Version 2004. When I click on the Open Diagnostic Data Viewer under...
Read more >
Visual Studio 2019 Data Viewer Does not show
The data viewer simply comes up as a blank white or black screen. See attachment. Visual Studiowindows 10.0visual studio 2019 version 16.3.. This...
Read more >
Using the Data Viewer in the RStudio IDE - Posit Support
Introduction Starting the viewer Sorting Filtering Searching Advanced topics Auto-refreshing Labels Restrictions and Performance Saving...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found