KTUGFaq
KTUG FAQ
WinEdtTip/GhostScript������&value=WinEdtTip/DOSɽ › TocLofLot&value= › 占쏙옙占쏙옙철&value=LaTeX/HelpOnFormatting › MicroImp › LittleTree/ReadingTeXbook/2006-08 › BOM
BOM, Byte Order Mark
Ʈ ⺻ UTF-8 UCS-2 UTF-16 2 Ʈ ⺻ ̰, UTF-32 4 Ʈ ⺻ ̹Ƿ Ʈ ٲ ֻ Ʈ (Big Endian) ߿ (Little Endian) ٸ ǥ ϴ. ǥϱ ο ġ ƹ U+FEFF (Zero Width Non-Breaking Space) ξ Ʈ ڵ ˾Ƴ ִ. , UTF-16 Little Edian BOM FF FE ݸ Big Edian FE FF ȭѴ. UTF-32LE FF FE 00 00̰, UTF-32BE 00 00 FE FF ̴. BOM(U+FEFF) UTF-8 Ÿ EF BB BF̴.
UTF-16 LE | FF FE |
UTF-16 BE | FE FF |
UTF-32 LE | FF FE 00 00 |
UTF-32 BE | 00 00 FE FF |
UTF-8 | EF BB BF |
UTF-8 ⸦ ̿ LaTeX/Lambda ۾ BOM ¶
Win32 Ϻ α (Notepad, Wordpad, MS Word) UTF-8 BOM Win 9x/ME ̴ ANSI[1] code page (ѱ Windows codepage 949 - Ȯ ϼ) ۼ ؽƮ ϰ ϱ ǥ Ѵ. , UTF-16, UTF-32, UTF-8 BOM UTF-16, UTF-32, UTF-8 νϰ ANSI code page Ѵ. XML ǥؿ UTF-8 BOM BOM ڵ ϳ ϰ ִ. , UTF-8 BOM POSIX OS cat file1 file2 file3 | another_program ŰǷ ״ ȯ Ѵ. TeX/Omega 忡 Notepad, Wordpad BOM UTF-8 ̴ ״ ݰ ƴϴ. TeX/Omega compiler .tex ο ǹ̸ ڰ ִ Ƿ . ׳ <Enter> ġ ϸ ó پ ´. --
----
- [1] ANSI ƹ , MS .
KTUGCollection2006 BOM ¶
Ϳ BOM ¶
Ʒѱ(ѱ2005 ), (NotePad), е, UltraEdit UTF-8 ϸ ڵ BOM ٿش. ݸ Windows Yudit Linux BOM ʴ´. ü UTF-8 ڵ LaTeX BOM óϿ ̴.
UTF-8 BOM ݵ ʿѰ ǹ̴. . NotePad UltraEdit ڵ .tex ҽ ڵ(UTF-16) libiconv ̿ؼ UTF-8 ȯϸ UTF-8 BOM ش.
ٿ ʿ䰡 , ¥ϴ. ݸ鿡 ڵ ڵ ˾Ƴµ ϱ մϴ. Perlδ ؼ ֽϴ. (Perl Unicode list ö Դϴ.)
#!/usr/bin/perl -pi~ -0777 # program to remove a leading [UTF-8] BOM from a file # works both STDIN -> STDOUT and on the spot (with filename as argument) if ($] > 5.007) { require Encode; Encode::_utf8_off($_); } s/^\xEF\xBB\xBF//s;ٸ ֽϴ (Perl̹Ƿ ٸ .) --
UltraEdit ¶
ɼ %SystemRoot%\UEdit32.INI ־ָ, BOM ִ
UltraEdit , Advanced -> General BOM ÿθ ֽϴ.
Write UTF-8 BOM = 0 Write UTF-8 BOM NF = 0ù UTF-8 , BOM ΰ ϸ, ι° UTF-8 Ͽ , BOM Ѵ. (0 , 1 )