Can not analyze symbols with Chinese
See original GitHub issueCan not analyze the Chinese function name, variable name, please add the analysis of Chinese function name and variable name support. Luajit can support gbk or utf8 Chinese function name and variable name.
example ` function 中文函数名(参数1,参数2) local 中文变量 = “Chinese variable name” end
Issue Analytics
- State:
- Created 6 years ago
- Comments:9 (6 by maintainers)
Top Results From Across the Web
Analysis of Chinese Character Writing Errors by Secondary ...
The main reason for this writing error is that learners are not concerned with the details of Chinese characters, and they are not...
Read more >A Simple Explanation Of Chinese Characters
Ever wonder how Chinese characters work? Instead of being based on an alphabet, they are components of meanings that come together to form...
Read more >The Study on Chinese Character Acquisition Errors of Foreign ...
symbols to record Chinese because of the characteristics of Chinese phonetic structure. ... that, it can't meet the needs of Chinese character teaching....
Read more >Towards a Semiotics of Chinese Characters in - Brill
Consequently, Chinese characters should not be confined to the 'acoustic' and 'chronological' features of the signifier proposed by Saussure.
Read more >How to Read Chinese Characters: A Beginner's Guide
How to Read Chinese Characters: A Beginner's Guide · UNDERSTAND HOW CHARACTERS WORK · START WITH PICTOGRAPH CHARACTERS · LEARN RADICALS · COMBINED ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
@fstirlitz
I was able to convert a UnicodeData.txt into a lightweight 2MB string map of general categories recently. It’s something like
'²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²¯ªªª¬ªªª¦§ª«ª¥ªªCCCCCCCCCCªª«««ªª ¦ª§D!!!!!!!!!!!!!!!!!!!!!!!!!!¦«§«²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²²¯ª¬¬¬¬®®®!¨«²®®«¤¤!®ª [more GCs]'
.There’s also the UnicodeSet tool for that, but it outputs a pattern-like set instead, with range elements. Range checking is generally slower than indexing into a string literal map.
Implemented in 71729404772a771e588a6c0ca2c70d3db7f9f254. Consider it unstable, however. I might still revisit the encoding issue.