question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Zero-width spaces are used in Khmer

See original GitHub issue

Hi, I just found that from 4.3.x, Zwsp class was included in the code and the trim method removes them from the text. Currently in Khmer, we use zwsp as word boundaries due to the lack of word breaking support for Khmer on many OSs and softwares. I’m a WordPress user, and few years ago I opened a ticket here discussing about this too.

I’m not sure if zwsp can make any problem?

Thanks 😃

PS: I found that the zwsp trimming appears in code (Serializer.serialize()) again. It was removed once in 3.5.5. I’m not sure if zwsp is BOM, but I read in Wikipedia and some sources and they said that the BOM character is \uFEFF.

Issue Analytics

  • State:closed
  • Created 7 years ago
  • Reactions:1
  • Comments:15 (6 by maintainers)

github_iconTop GitHub Comments

3reactions
spockecommented, Aug 24, 2016

The issue with the BOM character is that it has a height that shows in Firefox when the font size is 11px or less. So the line jumps up and down if you navigate around contentEditable=false inline elements. But I guess fixing Kamer might be more important than that visual glitch and it isn’t a problem for wordpress since the font size is higher there. We will probably bump the default text size in upcoming versions however that is kind of a braking change so I don’t want to do that in a patch release.

3reactions
spockecommented, Apr 29, 2016

Yes, we will look into this one.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Khmer Layout Requirements (Draft) - W3C
In order to manually fine-tune word-boundary detection, the invisible character U+200B ZERO WIDTH SPACE (ZWSP) can be used to create breaks. To ...
Read more >
Topic: Invisible Space (in Khmer unicode) is removed automatic!
*(Invisible space is currently used in Laos unicode, Khmer unicode,… etc) ... Invisible Space we also can call it as Zero-width Space.
Read more >
Khmer Unicode Zero Width Space disappears after posting - bug
Hi there, I just installed discourse and found out that it didn't support Zero Width Space at all. It is Khmer Unicode special...
Read more >
Microsoft Office breaks lines in the middle of Khmer words (part
Normal spaces appear as dots. Zero-width spaces (the most commonly used in Khmer) are shown as double rectangles (one inside another). I've ...
Read more >
L2/08-344 - Unicode
Naturally, the ZERO WIDTH SPACE (ZWSP) was placed in the SPACES group ... The use of ZWSP has been quite successful, specially in...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found