Unicode and Everything
Slides and additional notes from an SF Perl talk at Geekspace in San Francsico, on March 3rd, 2014
Unicode and Everything
|
(Click any to enlarge)
|
Perl Credits
|
|
Introduction to Unicode
|
|
Unicode Overview
|
|
Raw Data Files
|
|
Unicode::Tussle
|
|
Unicode::Tussle Unichars
|
|
Shapecatcher
|
|
Other Aspects
|
|
Case Folding
|
|
Collation
|
|
Precomposed vs Combining
|
|
Bidi
|
|
Html5 <Bdo> <Bdi> Tags
|
|
Html with Unicode Format Chars
|
|
Unifont Dream
|
|
Fonts On the Web
|
|
Gaps in Story: (1) Fouc and (2) Font by Codepoint
|
|
Historical Approach
|
|
Ascii
|
|
Iso-8859-1 aka Latin-1
|
|
Beyond 8 Bits Unicode
|
|
16bits Basic Multilingual Plane
|
|
21bits 17 Planes
|
|
Standards Releases
|
|
Han Disunification
|
|
Encodings
|
|
UCS-2
|
|
UTF-16
|
|
UTF-8
|
|
UTF-8 Plus 2
|
|
UTF-8 Plus 3
|
|
The Real UTF-8
|
|
Selfsync Problems
|
|
Classic I/O Model
|
|
Classic Perl I/O
|
|
Perl UTF-8 I/O
|
|
Perl UTF-8 I/O in 2 Steps
|
|
Binmode Existing Filehandles
|
|
Making It Just Work
|
|
Perl I/O More Ways
|
|
Decomposing and Composing
|
|
Perl UTF-8 I/O Lax Form
|
|
Perl Internal Format
|
|
Full Unicode Support
|
|
Digit Class Gotcha
|
|
Javascript
|
|
Python
|
|
Ruby
|
|
Mysql
|
|
Postgresql
|
|
Source As UTF-8
|
|
Gimme Some UTF-8 Script 1
|
|
Gimme Some UTF-8 Results
|
|
Gimme Some UTF-8 Fixed
|
|
Gimme Some UTF-8 or Is It
|
|
Gimme Some UTF-8 Emacs Language Encoding
|
|
Gimme Some UTF-8 Safe Form
|
|
Further Reading
|
|
Cpan Rules Upsidedown
|
|
doom@kzsu.stanford.edu