question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Subtitle encoding detection fail

See original GitHub issue

Hello,

subliminal converting subtitles to UTF-8 and for some players not a good idea. For example XBMC using user interface language for determine subtitles language. In my case my XBMC is Turkish and my subtitles downloaded from subliminal showing wrong characters.

Debug of subliminal: http://pastebin.com/raw.php?i=FDKCxZzw

I download same subtitle. This is compare of subliminal vs. opensubtitles web download:

$ charade downloaded_by_subliminal.srt downloaded_from_opensubtitles_website.srt 
downloaded_by_subliminal.srt: utf-8 with confidence 0.99
downloaded_from_opensubtitles_website.srt: ISO-8859-2 with confidence 0.874910791543
$ 

I am going to paste sneak peek of this two subtitles. You can see differences: http://pastebin.com/raw.php?i=5bE7YZJJ

Issue Analytics

  • State:closed
  • Created 10 years ago
  • Comments:49 (23 by maintainers)

github_iconTop GitHub Comments

1reaction
Diaoulcommented, Oct 20, 2015

Subliminal doesn’t use any encoding detection by default. It is only triggered when using the -e argument of the CLI.

In the previous versions, subliminal used the detected encoding to decode the subtitle and write it on the filesystem using utf-8. Due to many bug reports without the required information to make the detection algorightm better I decided to deactivate it by default and leave encoding detection to media players.

0reactions
Diaoulcommented, Oct 22, 2015

SickRage uses a custom version of subliminal, any issue related to SickRage must be reported there.

Read more comments on GitHub >

github_iconTop Results From Across the Web

ffmpeg: Unable to recode subtitle event - Super User
You can try -sub_charenc ISO-8859-1 (from output of file in2.srt ) or use iconv or some other tool to change the character encoding...
Read more >
SRT subtitles fail processing - General/Windows - Emby
It appears the problem was that the subtitle file was encoded as UTF-8 BOM rather than plain UTF-8. Converting it to plain UTF-8...
Read more >
Detect if subtitle streams do not contain valid utf-8 - FFmpeg Wiki
Problem Description: Import and encoding of subtitles which containing special characters using ffmpeg from text based *.srt and *.ass-subtitle files into an *....
Read more >
Subtitle encoding problem - Universal Media Server | Forum
Hi, recently updated from 6.4.0 to 6.5.3 UMS. After that subtitle encoding is not working anymore. Subtitles are
Read more >
How to convert included subtitles in not UTF-8 encoding when ...
Normally, WebVTT must always be UTF-8 per the WebVTT specification. This is part of the reason FFmpeg does not attempt to detect and...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found