KTUGFaq
KTUG FAQ
"It seems strange to meet computer geeks who're still primarily running Windows... as if they were still cooking on a wood stove or something." - mbp
Karnes/2009-02&value=Yhchoe/ġ̵&value=Karnes/2009-02&value=Yhchoe/ġ̵ › DVIPDFMx/Example › TeX&value=п۲ › 트占쏙옙타占쌉글꼴삼옙占쏙옙歐占 › TeX › UTF-8
1 UTF-8 ¶
UTF-8 Unicode ڸ 1Ʈ 4Ʈ Ʈ Ʈ ڵѴ.
ϳ Unicode ڸ Ʈ ڵ ش ڿ Ҵ ڵ尪 (Unicode Scalar Value) ִ. U+007F(127) 1 Ʈ, U+0080(128) U+07FF(2047) 2 Ʈ, U+0800(2048) U+FFFF(65535) 3 Ʈ, U+10000(65536) U+10FFFF(1114111) 4 Ʈ . US-ASCII ϴ ڴ U+0000 (NULL) ؼ UTF-8 Ʈ ǥ ִ. Ư US-ASCII ȣȯ ؾ ϴ н ý[UTF-8], SMTP (ͳ ) ؽƮ ͳ ݿ ϴ.
Unicode ڵϴ δ UTF-7, UTF-8, UTF-16, UTF-32 ִ.
TeX ַ UTF-8 ϴµ, CJK ڴ U+0800 Ŀ ҴǾ Ƿ UTF-8 3 Ʈ Ἥ Ÿ Ѵ. ݸ鿡 UTF-16 쿡 CJK ڸ ؼ BMP (Basic Multilingual Plane : Unicode ó 65,536 ڵ Ʈ) ϴ ڴ 2 Ʈ Ÿ. ̷ UTF-16 ȣϴ 찡 (UTF-8 UTF-16 ִ 1.5 /۽ ð ϹǷ) US-ASCII ȣȯ 쿡 ߿ϹǷ, Unix(Mac OS X) BeOS ؽƮ İ Ŀ UTF-8 ַ . TeX/Omega UTF-8 ַ . ݸ鿡 Win32 ؽƮ ĵ ⺻δ UTF-16 Ѵ.
ؽƮ İ OS α Ȥ ̺귯 ο ڵ Ĵ ̴. Linux glibc UTF-32 , Mac OS X, Win32, Omega, ICU (International Component for Unicode), Java, ECMAscript[1], Mozilla UTF-16 , BeOS, glib, Perl UTF-8 . Python UTF-32 Ȥ UCS-2 (2byte Ȥ 4byte ڵ UTF-16 UCS-2 2byte ڵ BMP ִ.) ִ.
UTF-32 ڵ UTF-16 ٷµ [2] ִ ݸ鿡 UTF-8 ִ 4 (US-ASCII ϴ ڶ), UTF-16 ؼ ִ 2 (BMP ϴ ) ٴ ִ.
ڼ ű⼭ MS, Apache ִ Ͻʽÿ : --
- http://www.w3.org/International/questions/qa-utf8-bom.html
- http://www.unicode.org/unicode/faq/utf_bom.html
- UTF-8 IETF RFC : RFC3629
- UTF-8 Bob Pike ̸ :http://www.cl.cam.ac.uk/~mgk25/ucs/UTF-8-history.txt
- Unicode Glossary: http://www.unicode.org/glossary/