Issue 761 - v8 - Incorrect UTF-8 encoding/decoding for non-BMP characters in String related functions - V8 JavaScript Engine - Google Project Hosting
february 2012 by TomC
V8 bug tracking thread following some of the same unicode/js issues I've been investigating.
v8
chrome
nodejs
javascript
programming
unicode
text
february 2012 by TomC
isaacs's gist: 1850768 — Gist
february 2012 by TomC
Gist driven side-debate (also on Twitter) with Isaac of Node and Brendan of Javascript, about unicode escaping/encoding issues and the prospect of full VM-level support (fixing String.length, substring etc)
brendaneich
isaacschlueter
nodejs
javascript
unicode
github
gist
text
encoding
escaping
programming
code
february 2012 by TomC
New full Unicode for ES6 idea
february 2012 by TomC
Brendan Eich's followup to last week's debate with the node.js community about full unicode support in Javascript.
javascript
nodejs
brendaneich
ecmascript
standards
es6
programming
text
unicode
february 2012 by TomC
The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) - Joel on Software
february 2012 by TomC
The most commonly recommended (and decent) intro to unicode for programmers.
unicode
programming
code
encoding
utf8
utf16
text
joelspolsky
articles
february 2012 by TomC
RFC 4627 - The application/json Media Type for JavaScript Object Notation (JSON)
february 2012 by TomC
JSON text uses UTF-16 escapes.
json
unicode
utf16
specifications
data
encoding
escaping
formats
february 2012 by TomC
RFC 4627 - The application/json Media Type for JavaScript Object Notation (JSON)
february 2012 by TomC
JSON default encoding is UTF8.
json
unicode
utf8
specifications
data
encoding
escaping
formats
february 2012 by TomC
Emoji for PHP
february 2012 by TomC
Cal's big table of emoji code-points and equivalencies for various different implementations, where available.
emoji
unicode
iamcal
php
text
february 2012 by TomC
How can you make the Github API accept Unicode characters in JSON? — Gist
february 2012 by TomC
More generally: how to escape UTF-16 characters (including surrogate pairs) in JSON.stringified javascript data.
unicode
javascript
json
stringify
encoding
escaping
github
gist
nodejs
february 2012 by TomC
JavaScript’s internal character encoding: UCS-2 or UTF-16? · Mathias Bynens
february 2012 by TomC
Good explanation of surrogate pair encoding/escaping in UTF-16.
unicode
javascript
text
programming
code
utf16
ucs2
february 2012 by TomC
Tweet Compressor
march 2010 by TomC
"Here's the list: cc, ms, ns, ps, in, ls, fi, fl, ffl, ffi, iv, ix, vi, oy, ii, xi, nj, ". " (period space), and ", " (comma space). ... All of these letter groups will be replaced with a single character, using unicode. This is possible because unicode has single character replacements for things like roman numerals (iv, ix), scientific abbreviations (ms, ns), and more."
twitter
unicode
hacks
march 2010 by TomC
Understanding Bidirectional (BIDI) Text in Unicode
march 2009 by TomC
Nice article from Cal Henderson that moves us on a bit further past "The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)". Which is good, because there are some folks out there who still think that ASCII will do.
unicode
rtl
calhenderson
arabic
web
html
php
languages
text
algorithms
programming
bidirectional
bidi
march 2009 by TomC
related tags
algorithms ⊕ api ⊕ arabic ⊕ articles ⊕ bidi ⊕ bidirectional ⊕ brendaneich ⊕ calhenderson ⊕ chrome ⊕ code ⊕ data ⊕ ecmascript ⊕ emoji ⊕ encoding ⊕ es6 ⊕ escaping ⊕ formats ⊕ gist ⊕ github ⊕ hacks ⊕ heiscal ⊕ html ⊕ iamcal ⊕ ios ⊕ iphone ⊕ isaacschlueter ⊕ japan ⊕ javascript ⊕ joelspolsky ⊕ json ⊕ languages ⊕ nodejs ⊕ perl ⊕ php ⊕ programming ⊕ python ⊕ rtl ⊕ specifications ⊕ standards ⊕ stringify ⊕ text ⊕ twitter ⊕ typography ⊕ ucs2 ⊕ unicode ⊖ utf8 ⊕ utf16 ⊕ v8 ⊕ web ⊕ wikipedia ⊕Copy this bookmark: