Unicode and Everything

Slides and additional notes from an SF Perl talk at Geekspace in San Francsico, on March 3rd, 2014

March 12, 2014
Unicode and Everything                             (Click any to enlarge)
Perl Credits
Introduction to Unicode
Unicode Overview
Raw Data Files
Unicode::Tussle
Unicode::Tussle Unichars
Shapecatcher
Other Aspects
Case Folding
Collation
Precomposed vs Combining
Bidi
Html5 <Bdo> <Bdi> Tags
Html with Unicode Format Chars
Unifont Dream
Fonts On the Web
Gaps in Story: (1) Fouc and (2) Font by Codepoint
Historical Approach
Ascii
Iso-8859-1 aka Latin-1
Beyond 8 Bits Unicode
16bits Basic Multilingual Plane
21bits 17 Planes
Standards Releases
Han Disunification
Encodings
UCS-2
UTF-16
UTF-8
UTF-8 Plus 2
UTF-8 Plus 3
The Real UTF-8
Selfsync Problems
Classic I/O Model
Classic Perl I/O
Perl UTF-8 I/O
Perl UTF-8 I/O in 2 Steps
Binmode Existing Filehandles
Making It Just Work
Perl I/O More Ways
Decomposing and Composing
Perl UTF-8 I/O Lax Form
Perl Internal Format
Full Unicode Support
Digit Class Gotcha
Javascript
Python
Ruby
Mysql
Postgresql
Source As UTF-8
Gimme Some UTF-8 Script 1
Gimme Some UTF-8 Results
Gimme Some UTF-8 Fixed
Gimme Some UTF-8 or Is It
Gimme Some UTF-8 Emacs Language Encoding
Gimme Some UTF-8 Safe Form
Further Reading
Cpan Rules Upsidedown

doom@kzsu.stanford.edu