[Libre-soc-isa] [Bug 794] SVP64 REMAP for utf8

bugzilla-daemon at libre-soc.org bugzilla-daemon at libre-soc.org
Wed Mar 30 15:17:05 BST 2022


https://bugs.libre-soc.org/show_bug.cgi?id=794

Jacob Lifshay <programmerjake at gmail.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |programmerjake at gmail.com

--- Comment #1 from Jacob Lifshay <programmerjake at gmail.com> ---
additional useful links:
converting utf-8 <-> utf-16 (useful for JS and Java)
https://web.archive.org/web/20210625032530/https://researcher.watson.ibm.com/researcher/files/jp-INOUEHRS/IPSJPRO2008_SIMDdecoding.pdf

validating UTF-8 (useful for JSON decoding and many many other things)
https://github.com/rusticstuff/simdutf8

it's very common to only care if you have correct utf-8 and where the first
error is rather than needing to decode the unicode codepoints -- the unicode
codepoints aren't that much more useful than the bytes for many purposes --
parsing (e.g. JSON) is nearly always faster on just the utf-8 bytes rather than
having to decode to utf-32 first.

-- 
You are receiving this mail because:
You are on the CC list for the bug.


More information about the Libre-SOC-ISA mailing list