Free on-line access of spoken data

From: Lau Seng-hian
Date: 2009-04-08 17:30:14

政治大學華語、客語、台語語料庫。

http://140.119.172.200/


TAIWAN LINGUIST LIST 台灣語言學討論區

=============================================================

本討論區提供台灣語言學者交換意見以及張貼公告。

若有貼文請寄:

將內容載於本文,請勿用附件寄送。


Dear colleagues,

We would like to inform you about the free on-line access of Mandarin, Hakka, and Southern Min spoken data in the NCCU Corpus of Spoken Chinese. The NCCU Spoken Corpus consists of three sub-corpora:

(1) The Corpus of Spoken Mandarin

There are seventeen extracts of daily face-to-face conversations, totaling seven hours of talk. Besides the orthographic transcription, nine extracts also include phonetic transcription, English glosses, and free translation.

(2) The Corpus of Spoken Hakka

There are six extracts of daily face-to-face conversations, totaling two hours of talk.

(3) The Corpus of Spoken Southern Min

[in preparation]

We have been collecting spoken data since 2006 to document the daily use of Mandarin, Hakka, and Southern Min spoken in Taiwan. The data can now be accessed on-line via internet at http://140.119.172.200/ for research and teaching purposes.


TAIWAN LINGUIST LIST 台灣語言學討論區

Moderator 管理人: 黃慧娟      email:

欲訂閱或退訂本討論區,請至清華大學校務宣導電子報網址

http://list.net.nthu.edu.tw/

欲加入台灣語言學學會(LST),請見學會網址 http://linguist.tw


-- Lau, Seng-hian

~無 koh 在定,ma 無放外外~


Re: Re: Free on-line access of spoken data

From: Lûi Bêng-Hàn
Date: 2009-04-09 07:18:20

Chhanlian, Tâi-gí--ê iáu bô. Nā bô góa siūⁿ-tio̍h 1-kóa-á chhù-bī ê lō͘-iōng. Pí-lūn-kóng, kā i sàng khì hùn-liān voice recognition ê ianjin, ē-sái jīn voice command, chiap tī kán-tan ê tian-tōng-á, chhin- chhiuⁿ ku-ōe-tô͘ hit-khoán--ê, án-ne gín-á ē sńg kah chin hoaⁿ-hí.Siūⁿ bóng siūⁿ, bô êng thang chò.Bêng-Hàn

On 4月8日, 上午2時30分, Lau Seng-hian wrote:

政治大學華語、客語、台語語料庫。

http://140.119.172.200/


TAIWAN LINGUIST LIST 台灣語言學討論區 ============================================================= 本討論區提供台灣語言學者交換意見以及張貼公告。 若有貼文請寄: 將內容載於本文,請勿用附件寄送。 **********************

Dear colleagues,

We would like to inform you about the free on-line access of Mandarin, Hakka, and Southern Min spoken data in the NCCU Corpus of Spoken Chinese. The NCCU Spoken Corpus consists of three sub-corpora:

(1) The Corpus of Spoken Mandarin

There are seventeen extracts of daily face-to-face conversations, totaling seven hours of talk. Besides the orthographic transcription, nine extracts also include phonetic transcription, English glosses, and free translation.

(2) The Corpus of Spoken Hakka

There are six extracts of daily face-to-face conversations, totaling two hours of talk.

(3) The Corpus of Spoken Southern Min [in preparation]

We have been collecting spoken data since 2006 to document the daily use of Mandarin, Hakka, and Southern Min spoken in Taiwan. The data can now be

accessed on-line via internet athttp://140.119.172.200/for research and

teaching purposes.


TAIWAN LINGUIST LIST 台灣語言學討論區 Moderator 管理人: 黃慧娟      email: 欲訂閱或退訂本討論區,請至清華大學校務宣導電子報網址http://list.net.nthu.edu.tw/

欲加入台灣語言學學會(LST),請見學會網址http://linguist.tw


-- Lau, Seng-hian

~無 koh 在定,ma 無放外外~