Unicode and Everything
Slides and additional notes from an SF Perl talk at Geekspace in San Francsico, on March 3rd, 2014
| Unicode and Everything
|
(Click any to enlarge)
|
| Perl Credits
|
|
| Introduction to Unicode
|
|
| Unicode Overview
|
|
| Raw Data Files
|
|
| Unicode::Tussle
|
|
| Unicode::Tussle Unichars
|
|
| Shapecatcher
|
|
| Other Aspects
|
|
| Case Folding
|
|
| Collation
|
|
| Precomposed vs Combining
|
|
| Bidi
|
|
| Html5 <Bdo> <Bdi> Tags
|
|
| Html with Unicode Format Chars
|
|
| Unifont Dream
|
|
| Fonts On the Web
|
|
| Gaps in Story: (1) Fouc and (2) Font by Codepoint
|
|
| Historical Approach
|
|
| Ascii
|
|
| Iso-8859-1 aka Latin-1
|
|
| Beyond 8 Bits Unicode
|
|
| 16bits Basic Multilingual Plane
|
|
| 21bits 17 Planes
|
|
| Standards Releases
|
|
| Han Disunification
|
|
| Encodings
|
|
| UCS-2
|
|
| UTF-16
|
|
| UTF-8
|
|
| UTF-8 Plus 2
|
|
| UTF-8 Plus 3
|
|
| The Real UTF-8
|
|
| Selfsync Problems
|
|
| Classic I/O Model
|
|
| Classic Perl I/O
|
|
| Perl UTF-8 I/O
|
|
| Perl UTF-8 I/O in 2 Steps
|
|
| Binmode Existing Filehandles
|
|
| Making It Just Work
|
|
| Perl I/O More Ways
|
|
| Decomposing and Composing
|
|
| Perl UTF-8 I/O Lax Form
|
|
| Perl Internal Format
|
|
| Full Unicode Support
|
|
| Digit Class Gotcha
|
|
| Javascript
|
|
| Python
|
|
| Ruby
|
|
| Mysql
|
|
| Postgresql
|
|
| Source As UTF-8
|
|
| Gimme Some UTF-8 Script 1
|
|
| Gimme Some UTF-8 Results
|
|
| Gimme Some UTF-8 Fixed
|
|
| Gimme Some UTF-8 or Is It
|
|
| Gimme Some UTF-8 Emacs Language Encoding
|
|
| Gimme Some UTF-8 Safe Form
|
|
| Further Reading
|
|
| Cpan Rules Upsidedown
|
|
doom@kzsu.stanford.edu