[Libre-soc-isa] [Bug 794] SVP64 REMAP for utf8
bugzilla-daemon at libre-soc.org
bugzilla-daemon at libre-soc.org
Wed Mar 30 15:17:05 BST 2022
https://bugs.libre-soc.org/show_bug.cgi?id=794
Jacob Lifshay <programmerjake at gmail.com> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |programmerjake at gmail.com
--- Comment #1 from Jacob Lifshay <programmerjake at gmail.com> ---
additional useful links:
converting utf-8 <-> utf-16 (useful for JS and Java)
https://web.archive.org/web/20210625032530/https://researcher.watson.ibm.com/researcher/files/jp-INOUEHRS/IPSJPRO2008_SIMDdecoding.pdf
validating UTF-8 (useful for JSON decoding and many many other things)
https://github.com/rusticstuff/simdutf8
it's very common to only care if you have correct utf-8 and where the first
error is rather than needing to decode the unicode codepoints -- the unicode
codepoints aren't that much more useful than the bytes for many purposes --
parsing (e.g. JSON) is nearly always faster on just the utf-8 bytes rather than
having to decode to utf-32 first.
--
You are receiving this mail because:
You are on the CC list for the bug.
More information about the Libre-SOC-ISA
mailing list