question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Encoding special characters (ISO 639-1)

See original GitHub issue

Hello,

I use rss-parser on some feeds in the Serbian language, which has several special characters, like: Š, Č, Ć, Ž. All of these are (in rss-parser 2.10.1) replaced with \ufffd.

Is there any way encoding can be improved to cover these (latin-extended, ISO 639-1 codes) characters? Same would apply to several other languages - Slovenian, Croatian, Bosnian, etc.

I love the simplicity of use, despite the lack of async/await support 😃 but I will have to find an alternative if I can’t solve this encoding issue.

Issue Analytics

  • State:closed
  • Created 6 years ago
  • Comments:8 (5 by maintainers)

github_iconTop GitHub Comments

1reaction
rbrencommented, Jan 3, 2018

You’re right, we should set the encoding based on the content-type header if available.

I will try and put this into v3, as well as an option for specifying encoding manually. In the meantime, you can always download the URL outside of rss-parser and use parseString.

0reactions
fmsfcommented, Jan 17, 2018

That’s awesome! thanks

Read more comments on GitHub >

github_iconTop Results From Across the Web

List of ISO 639-1 codes - Wikipedia
ISO language name 639‑1 639‑2/T 639‑2/B 639‑3 Abkhazian ab abk abk abk Afar aa aar aar aar Afrikaans af afr afr afr
Read more >
ISO 639-2 Language Code List - Library of Congress
ISO 639‑2 Code ISO 639‑1 Code German name of Language aar aa Danakil‑Sprache abk ab Abchasisch ace Aceh‑Sprache
Read more >
ISO 639-1
No information is available for this page.
Read more >
ISO 639-1 Language Codes - Java2s.com
ISO 639-1 Language Codes ; Arabic, ar ; Aragonese, an ; Armenian, hy ; Assamese, as.
Read more >
ISOcodes: Selected ISO Codes - R Project
Encoding UTF-8 ... currency codes, ISO 15924 script codes, and the ISO 8859 character codes ... Alpha_2 (the corresponding ISO 639-1 alpha-2 code...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found