Ungian, Goa khah ke-po, ka li kong Eng-gi bo tho-tong e sou-chai. Ang-sek si goan-bun, chhenn-sek si goa kian-gi. Babuza | Taiwanese language and literature course became the formal course in junior high school and promary(primary) school of Taiwan since year 2001, and the importance of written Taiwanese has been promoted by considerable(considerably). But the furtherance of written Taiwanese confronts two main problems : the selection of spelling systems and the characters usage. This project intends to suggest the characters usage of written Taiwanese via the syllables and words count from the Taiwanese corpus.
One of main effort in this project is to colloect at least 3,000,000 syllables Taiwanese corpus which can unfold the reality of Taiwanese writing style from the above raw material.
We use this corpus to count the syllables and words frequency for the purpose of the Taiwanese characters usage suggestion. We also hope it will be the most important basis of the following related Taiwanese natural language processing research such as Taiwanese teaching material editing, lexicography, automatic part-of-speech tagging, concordancer, collocation, sentense parsing, auto-correction, input method, automatic document abstraction … etc.
---
Keywordï¼ corpus, written Taiwanese, syllable frequency, word frequency

----- Original Message ----- From: Iunn Un-gian Sent: Thursday, August 04, 2005 4:16 PM Subject: [TGB] Giankiu ke-oe chuliau chiuN-bang

Takke ho :
http://iug.csie.dahan.edu.tw/giankiu/keoe/KKH/guliau-supin/guliau-supin.asp Che si goa 2004/8~2005/7 chip-heng e kok-kho-hoe ke-oe, chiamsi ko 1-toaN-loh, choekun e chhe sikan lioksiok chiong seng-ko chiuN-bang. Chit-e ke-oe tittioh chinchoe pengiu e pang-chan, iuki si guliau e the-kiong. Choe-au e thong-ke chuliau si Kiatgak hiaN sia Program chengli chhutlai e. Li ma esai ui chia jipkhi http://iug.csie.dahan.edu.tw/taigu.asp Soan "gian-kiu / gian-kiu ke-oe", laitoe u liat 4 e ke-oe, mkoh toapouhun long iah bo siaNme chuliau. goa hibong lan e giankiu sengko long etang chiuN-bang, hou kohkhah choe lang chaiiaN lan phahpiaN e sengko. Ungian 8/4

--
IuN Un-gian æ¥å è¨
Tai-han Chu-kang-he Chou-kau-siu大漢è³å·¥ç³»å©ææ
Tai-tai Chu-kang-he Phok-su-pan hak-sengå°å¤§è³å·¥ç³»å士ç­å­¸ç
http://iug.csie.dahan.edu.tw