Subtitle encoding detection fail
See original GitHub issueHello,
subliminal converting subtitles to UTF-8 and for some players not a good idea. For example XBMC using user interface language for determine subtitles language. In my case my XBMC is Turkish and my subtitles downloaded from subliminal showing wrong characters.
Debug of subliminal: http://pastebin.com/raw.php?i=FDKCxZzw
I download same subtitle. This is compare of subliminal vs. opensubtitles web download:
$ charade downloaded_by_subliminal.srt downloaded_from_opensubtitles_website.srt
downloaded_by_subliminal.srt: utf-8 with confidence 0.99
downloaded_from_opensubtitles_website.srt: ISO-8859-2 with confidence 0.874910791543
$
I am going to paste sneak peek of this two subtitles. You can see differences: http://pastebin.com/raw.php?i=5bE7YZJJ
Issue Analytics
- State:
- Created 10 years ago
- Comments:49 (23 by maintainers)
Top Results From Across the Web
ffmpeg: Unable to recode subtitle event - Super User
You can try -sub_charenc ISO-8859-1 (from output of file in2.srt ) or use iconv or some other tool to change the character encoding...
Read more >SRT subtitles fail processing - General/Windows - Emby
It appears the problem was that the subtitle file was encoded as UTF-8 BOM rather than plain UTF-8. Converting it to plain UTF-8...
Read more >Detect if subtitle streams do not contain valid utf-8 - FFmpeg Wiki
Problem Description: Import and encoding of subtitles which containing special characters using ffmpeg from text based *.srt and *.ass-subtitle files into an *....
Read more >Subtitle encoding problem - Universal Media Server | Forum
Hi, recently updated from 6.4.0 to 6.5.3 UMS. After that subtitle encoding is not working anymore. Subtitles are
Read more >How to convert included subtitles in not UTF-8 encoding when ...
Normally, WebVTT must always be UTF-8 per the WebVTT specification. This is part of the reason FFmpeg does not attempt to detect and...
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Subliminal doesn’t use any encoding detection by default. It is only triggered when using the
-e
argument of the CLI.In the previous versions, subliminal used the detected encoding to decode the subtitle and write it on the filesystem using utf-8. Due to many bug reports without the required information to make the detection algorightm better I decided to deactivate it by default and leave encoding detection to media players.
SickRage uses a custom version of subliminal, any issue related to SickRage must be reported there.