Zero-width spaces are used in Khmer
See original GitHub issueHi,
I just found that from 4.3.x, Zwsp
class was included in the code and the trim
method removes them from the text. Currently in Khmer, we use zwsp as word boundaries due to the lack of word breaking support for Khmer on many OSs and softwares. I’m a WordPress user, and few years ago I opened a ticket here discussing about this too.
I’m not sure if zwsp can make any problem?
Thanks 😃
PS: I found that the zwsp trimming appears in code (Serializer.serialize()
) again. It was removed once in 3.5.5. I’m not sure if zwsp is BOM, but I read in Wikipedia and some sources and they said that the BOM character is \uFEFF
.
Issue Analytics
- State:
- Created 7 years ago
- Reactions:1
- Comments:15 (6 by maintainers)
Top Results From Across the Web
Khmer Layout Requirements (Draft) - W3C
In order to manually fine-tune word-boundary detection, the invisible character U+200B ZERO WIDTH SPACE (ZWSP) can be used to create breaks. To ...
Read more >Topic: Invisible Space (in Khmer unicode) is removed automatic!
*(Invisible space is currently used in Laos unicode, Khmer unicode,… etc) ... Invisible Space we also can call it as Zero-width Space.
Read more >Khmer Unicode Zero Width Space disappears after posting - bug
Hi there, I just installed discourse and found out that it didn't support Zero Width Space at all. It is Khmer Unicode special...
Read more >Microsoft Office breaks lines in the middle of Khmer words (part
Normal spaces appear as dots. Zero-width spaces (the most commonly used in Khmer) are shown as double rectangles (one inside another). I've ...
Read more >L2/08-344 - Unicode
Naturally, the ZERO WIDTH SPACE (ZWSP) was placed in the SPACES group ... The use of ZWSP has been quite successful, specially in...
Read more >
Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free
Top Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
The issue with the BOM character is that it has a height that shows in Firefox when the font size is 11px or less. So the line jumps up and down if you navigate around contentEditable=false inline elements. But I guess fixing Kamer might be more important than that visual glitch and it isn’t a problem for wordpress since the font size is higher there. We will probably bump the default text size in upcoming versions however that is kind of a braking change so I don’t want to do that in a patch release.
Yes, we will look into this one.