Unicode(Re: Tu-a sia ho e lunbun)
taigu "Kiatgak"
taigu "Kiatgak"
Goa chiu goa sou liau-kai e pou-hun kan-tan po-ko--1-e, ki-tha e pou-hun
chiah ma-hoan Khaisu a-si Aki pou-chhiong.
1. Unicode e lek-su
1991 V1.0 28,302 ji
1996 V2.0 38,885 ji
2000 V3.0 49,194 ji
Iau-ku leh cheng-ka sin e ji jip--khi, m-ku i-keng ti lai-te--e to be koh
kai.
2. Unicode e hoan-ui
Oah e (iau leh su-iong--e) gi-gian. Phou-thong e bun-ji (plain text).
Unicode chhu-li ji e piau-si hong-hoat, tak e ji seN-choe siaN-khoan si
leng-goa 1 chan tai-chi.
3. Unicode e bok-phiau
Hou choan se-kai bun-ji e tian-chu chu-liau e-tang liu-thong kau-oaN.
I-cha e siat-ke long si 1-e/1-koa gi-gian eng 1 tho phiau-chun (kio-choe
codepage). Kap pat-chiong gi-gian e tian-chu chu-liau bo hoat-tou kau-oaN.
Li na u su-iau eng 2 chiong chha khah che e gi-gian (phi-ju kong Hoat-gi,
Jit-gi), to thau thiaN--a. Ai su-iong tek-piat e nng-the; chu-liau kap
pat-lang ma bo it-teng e-tang kau-oaN.
Unicode e siat-ke hou bo kang gi-gian e chu-liau e-tang khng cho-hoe,
kau-oaN ma bo bun-toe--a. Na khoaN bo, to chng ji-heng; na beh sia, to chng
hit-chiong gi-gian e su-jip-hoat to ho.
Chia u 1 e chhu-bi e koan-tiam. Unicode sui-bong khoaN--khi-lai si kong
"thong-it-be", m-ku i e siat-ke si beh hou jiok-se gi-gian ma e-tang kap
kiong-se gi-gian u kang-khoan e te-ui, long e-tang eng. Tai-oan cheng-hu
teng e phiau-chun tian-to-peng: "Li na e-sai eng chit tho, pat-hang long
be-sai eng--chit."
4. Unicode e hian-hong
Windows NT/2000 long si eng Unicode lai siat-ke--e.
Word 2000 ma si. (Word 97 tui Unicode e chi-oan bo oan-cheng.)
HTML 4.0 e phiau-chun kui-teng eng Unicode. Chit-ma IE 5.x, Netscape 6.x
long si eng Unicode.
Java tit-chiap eng Unicode.
Ki-tha chi-oan unicode e nng-the iau chiaN che. (E ju-lai ju-che.) M-ku
bak-cheng toa pou-hun e nng-the bo chi-oan Unicode.
5. Unicode e pian-be
Unicode eng 16 bit lai piau-si 1 ji (character). Sou-i e-tang piau-si
2^16=65536 ji. M-ku i koh siat-ke eng D800-DBFF(1024 ji) kap DC00-DFFF
(1024 ji) 2 ji lai piau-si chin-chiaN e 1 ji, sou-i koh ke-chhut
1024*1024=1,048,576 ji. Long-chong e-tang piau-si 65536 + 1048576= 1,114,112
ji (0x110000).
0-127 e pou-hou ham ASCII sio-siang.
Chin-chiaN tian-chu chu-liau e piau-si hong-hoat u UTF-16, UTF-16LE,
UTF-16BE, UTF-8, ... Siong chiap eng--e si UTF-8. UTF-8 e-sai kap toa
pou-hun ku e (chi-oan ASCII--e) nng-the sio-thong.
Unicode pau-koat toa pou-hun hian-iu e pian-be phiau-chun. Chiu-si kong li
e-tang ka hian-iu bo kang gi-gian e chu-liau long choan-choe Unicode,
chiah-e chu-liau to e-tang kau-oaN, bo khi chhiong--tioh-a.
Unicode e pian-be u i ka-ki e kui-chek, bo it-teng ham lan phou-thong kong e
1 e ji sio-siang. Phi-ju kong 1 e Arabic e ji (grapheme) kho-leng ai 4 e
Unicode e ji lai piau-si. Phi-ju kong a2 e-sai eng Unicode 00E1 1 e ji lai
piau-si (precomposed character), ma e-sai eng Unicode 'a' kap Unicode 0301 2
e ji lai piau-si (combining character sequence).
6. Unicode kap ISO 10646
Kan-tan kong 2 e e-sai kong si kang-khoaN--e.
Pian-be hong-bin, Unicode ui V1.1 i-au to long kap ISO 10646 i-chhi i-ti.
M-ku, Unicode u ke 1-koa ji e teng-gi. Sou-i e-sai kong hu-hap ISO 10646--e
to e hu-hap Unicode.
7. Unicode kap ji-heng
Unicode chhu-li pian-be kap ji e teng-gi, ji-heng chhu-li beh chaiN-iuN
hian-si, che si 2 chan tai-chi. Bo kang e he-thong su-iong bo kang e hian-si
hong-hoat.
Beh hian-si Unicode sio-khoa ma-hoan. Thong-siong su-iong .OTF e ji-heng
tong-an. Nng-the hong-bin, Microsoft u the-kiong 1 e Uniscribe hou nng-the
siat-ke-chia hong-pian su-iong. Ti Mac thiaN kong u 1 e ATSUI. Leng-goa koh
u 1 e Freetype thiaN kong Windows/Mac/Linux long thong-iong.
Goa siat-ke e ji-heng (leng-goa 1 e e-mail) si .TTF e tong-an na-tiaN, iau
bo hoat-tou chi-oan Unicode. M-chai A-ki hiaN e ji-heng e-tang chi-oan
Unicode--be?
8. Unicode kap La-teng-ji
Lan it-poaN tiaN khoaN--tioh-e e La-teng-ji, Unicode long u chi-oan. Phi-ju
kong
Basic Latin (0020-007F: Eng-gi),
Latin-1 (00A0-00FF: Hoat-gi, Tek-gi, ..),
Latin Extended-A (0100-017F: Latvian, Romanian, Polish, Lithuanian,
Croatian, Esperanto, Maltese, Irish, Czech, ...),
Latin Extended-B (0180-0233: Zulu, Pan-Nigerian, Zhuang, Acrican, Ewe,
Pinyin, ...),
Latin Extended Additional (1EF0-1EF9: Livonian, Vietnamese, ...)
9. Unicode kap POJ
ChhiaN chham-kho A-ki hiaN "Tai-oan Tek-su ji-bo e pio".
Lai-te peh--e kap chhiN--e chiu-si Unicode i-keng tit-chiap teng-gi e ji
(precomposed).
Kam-a-sek--e si Unicode bo tit-chiap teng-gi e ji, kho-leng e-tang eng
kui-e-a Unicode e ji lai chou-hap. Phi-ju kong a8 e-sai eng Unicode 'a' kap
Unicode 030D lai chou-hap.
Kam e-sai hiong Unicode cho-chit sin-chheng chiah-e su-iau chou-hap e ji
(chiaN-choe precomposed e ji)? Chiau in e goan-chek, na e-tang eng
chou-hap--e, to be chiap-siu.
10. Unicode e bang-chi
unicode chou-chit:
[http://www.unicode.org](<http://www.unicode.org/>)
Khai-su:
<http://www.egt.ie/standards/la/taioan.html>
Proposal to add Latin characters required by Latinized Taiwanese languages
to ISO/IEC 10646
<http://www.taioan.com/unicode/unicode.html>
POJ unicode
A-ki:
<http://www.taioan.com/unicode/unicode.html>
<http://www.taioan.com/unicode/kihosamp.html>
11. Kiat-lun
Tai-oan ai:
Pun-thou-hoa --> su-iong 100 goa tang e pun-thou bun-ji, POJ. (im-phiau eng
IPA to e-sai--chit)
Kok-che-hoa --> kap bo-kang bun-ji, bun-hoa e kok-ka kau-liu. (eng POJ kap
IPA siong sek-hap)
Chu-sin-hoa --> eng Unicode m-taN e-sai chhu-li POJ, koh e-sai kap choan
se-kai kau-oaN tian-chu chu-liau. (pun-thou-hoa kiam kok-che-hoa)
Sui-bong u e POJ Unicode bo tit-chiap teng-gi, m-ku chiah-e ki-sut-seng e
bun-toe long e-tang kai-koat. Kap toa pou-hun e gi-gian pi--khi-lai be khah
khun-lan.
I-siong chhiaN chham-kho.
Kiatgak
> Goa u thak Ungian e lunbun, pouhun e POJ inui hethong chengchha e bunte bo
> hoattou chimchiok liaukai, mkoh kui-e lai khoaN si chin u ketat e lunbun.
> Goa ma hibang Khaisu, Aki, Kiatgak hiaN etang sia koa khah thongsiok e
> bunchiuN. Kisit inui koe beh 3~4tang a, chin che lang iausi sa bo
Unicode,
> ISO, Lateng/POJ ji-the chi-kan e koanhe.
>
> --Honggiau
>
> > > Goa e kamkah si, goa tui Unicode bosiaN liaukai, siongho si
> > > Kiatgak hiaN iahsi Khaisu etang sia 1 phiN khah thongsiok e bunchiuN
> > > lai siaukai chitpouhun e buntoe.
> >
> > Khai-su kap A-ki tui Unicode u khah chhim e gian-kiu, goa iau-koh bo
siaN
> > liau-kai.
> >
> >
> > I-siong chhiaN chham-kho.
> >
> > Kiatgak
> >
> >
>