question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 936: ordinal not in range(128)

See original GitHub issue

Describe the bug UnicodeDecodeError: ‘ascii’ codec can’t decode byte 0xe2 in position 936: ordinal not in range(128)

Affected dataset(s) ‘msmarco-passage/train’ To Reproduce Steps to reproduce the behavior: Just run the official demo code: `import ir_datasets

if name == “main”: dataset = ir_datasets.load(‘msmarco-passage/train’) # Documents for doc in dataset.docs_iter(): print(doc) `

Expected behavior get normal output

Additional context Add any other context about the problem here.

Issue Analytics

  • State:closed
  • Created 2 years ago
  • Comments:8

github_iconTop GitHub Comments

1reaction
catqaqcommented, Jan 14, 2022

Okay, we’ve been suffering without some easy-to-use IR dataset interface for a long time, thanks for your excellent work!

0reactions
seanmacavaneycommented, Jan 14, 2022

Thanks!

Read more comments on GitHub >

github_iconTop Results From Across the Web

'ascii' codec can't decode byte 0xe2 in position 13: ordinal not ...
The file is being read as a bunch of str s, but it should be unicode s. Python tries to implicitly convert, but...
Read more >
'ascii' codec can't decode byte 0xe2 in position 13: ordinal not ...
PYTHON : UnicodeDecodeError : ' ascii ' codec can't decode byte 0xe2 in position 13: ordinal not in range ( 128 ) [...
Read more >
ERROR - 'ascii' codec can't decode byte 0xc3 in position 8
I'm running the tutorial example to define a pipeline, and when i ran this command : $ sudo airflow test flowtest print_date 2016-03-11...
Read more >
UnicodeDecodeError: 'ascii' codec can't decode byte
The Python "UnicodeDecodeError: 'ascii' codec can't decode byte in position" occurs when we use the ascii codec to decode bytes that were encoded...
Read more >
'ascii' codec can't decode byte 0xe9 in position 0: ordinal not ...
而Python在进行编码方式之间的转换时,会将unicode 作为“中间编码”,但unicode 最大只有128 那么长,所以这里当尝试将ascii 编码字符串转换成"中间编码" unicode 时 ...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found