Chulalongkorn University Theses and Dissertations (Chula ETD)

พารามิเตอร์ทางเสียงสำหรับการจำแนกลักษณะการเปล่งเสียงในเสียงพูดต่อเนื่องภาษาไทย

Other Title (Parallel Title in Other Language of ETD)

Acoustic parameters for manner of articulation classification in Thai continuous speech

วิทยา โรจน์กิตติเจริญ, คณะวิศวกรรมศาสตร์

Year (A.D.)

2011

Document Type

Thesis

First Advisor

อติวงศ์ สุชาโต

Second Advisor

โปรดปราน บุณยพุกกณะ

Faculty/College

Faculty of Engineering (คณะวิศวกรรมศาสตร์)

Degree Name

วิศวกรรมศาสตรมหาบัณฑิต

Degree Level

ปริญญาโท

Degree Discipline

วิศวกรรมคอมพิวเตอร์

DOI

10.58837/CHULA.THE.2011.1592

Abstract

ในการพัฒนาระบบรู้จำเสียงแบบอื่นเช่น ระบบรู้จำเสียงแบบแลนมาร์ค จะต้องทำการหาตำแหน่งของแลนมาร์ค ของเสียงที่เราให้ความสนใจ เช่นตำแหน่งของเสียงพยัญชนะ หรือตำแหน่งของเสียงสระ เป็นต้น เพื่อใช้เป็นข้อมูลขาเข้าในการรู้จำเสียงพูด ดังนั้นเป้าหมายงานวิทยานิพนธ์นี้จึง ได้เน้นไปที่การจำแนกลักษณะการเปล่งเสียงในเสียงพูดต่อเนื่องภาษาไทย เพื่อสามารถนำไปใช้ในการพัฒนาระบบรู้จำเสียงพูดแบบแลนแลนมาร์คได้ โดยที่งานวิทยานิพนธ์นี้ได้ทำการปรับปรุงชุดพารามิเตอร์ทางเสียงสำหรับเพื่อให้เหมาะสมกับภาษาไทย ซึ่งประกอบด้วยโดยได้ปรับให้มีการใช้ 1) จุดศูนย์ถ่วงของสเปกตรัม 2) อัตราการตัดศูนย์ในช่วงเวลา 3) อัตราส่วนพลังงานในช่วงความถี่ [0-400] Hz ต่อ พลังงานในช่วงความถี่ [400-6000] Hz เพิ่มเติม จากผลการทดลองจำแนกสมบัติทางสมบัติทางสวนสัทศาสตร์ แสดงให้เห็นว่ามีความผิดพลาดในการจำแนกสมบัติทางสมบัติทางสวนสัทศาสตร์ ลดลง 28.09%, 11.0%, 2.41% สำหรับการจำแนกสมบัติทางสวนสัทศาสตร์ [คอนทินิวแอนท์], [ซิลลาบิค] และ [ไซเรนท์] ตามลำดับ เมื่อทำการเปรียบเทียบกับ ชุดพารามิเตอร์ทางเสียงที่ใช้ในการจำแนกสมบัติทางสวนสัทศาสตร์สำหรับเสียงภาษาอังกฤษ และเมื่อทำการตัดแบ่งเสียงเพื่อทำการหาตำแหน่งเสียงพยัญชนะ และ เสียงสระ พบว่าได้ความถูกต้องในการตัดแบ่ง 80.46% โดยมีความผิดพลาดในการตัดแบ่งลดลง 23.46% เมื่อเทียบกับระบบอ้างอิงที่ใช้การรู้จำเสียงพูดแบบอาศัยแบบจำลองฮิดเดนมาร์คอฟ ในการทดลองสุดท้ายพบว่าเมื่อทำการเทียบผลการรู้จำในระดับพยางค์ ในรูปแบบ พยัญชนะต้น-สระ-ตัวสะกด ระบบที่เสนอกับระบบอ้างอิงให้ความถูกต้องในระดับเดียวกัน

Other Abstract (Other language abstract of ETD)

In landmark-based speech recognition system. We need to locate the landmark of speech such a consonant landmark or a vowel landmark. For using that kind of landmark as an input data to speech recognition system. This thesis focuses on finding broad manner class of Thai speech. For developing the landmark-based speech recognition system This thesis is aimed at the improvement of the acoustic parameters for the Thai automatic speech recognition system. We proposed acoustic parameters that capture the characteristics of broad manner class of Thai speech. These acoustic parameters are: 1) spectral center of gravity 2) short time zero crossing rate to 3) the energy ratio E[0-400] to E[400-6000]. The results showed 28.09%, 11.0% and 2.41% error reductions for the continuant, the syllabic and the silence features, respectively, when compared to acoustic parameters used in English. The accuracy of 80.46% was obtained from the speech segmentation task and also introduced a 23.46% error reduction when compared to the baseline HMM-MFCC based broad class segmentation. We also found similar performance for word classification in the CVC context when compared to the baseline HMM-MFCC in word recognition tasks.

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Recommended Citation

โรจน์กิตติเจริญ, วิทยา, "พารามิเตอร์ทางเสียงสำหรับการจำแนกลักษณะการเปล่งเสียงในเสียงพูดต่อเนื่องภาษาไทย" (2011). Chulalongkorn University Theses and Dissertations (Chula ETD). 68679.
https://digital.car.chula.ac.th/chulaetd/68679

Link to Full Text

COinS

Chulalongkorn University Theses and Dissertations (Chula ETD)

พารามิเตอร์ทางเสียงสำหรับการจำแนกลักษณะการเปล่งเสียงในเสียงพูดต่อเนื่องภาษาไทย

Other Title (Parallel Title in Other Language of ETD)

Year (A.D.)

Document Type

First Advisor

Second Advisor

Faculty/College

Degree Name

Degree Level

Degree Discipline

DOI

Abstract

Other Abstract (Other language abstract of ETD)

Creative Commons License

Recommended Citation

Search

Browse

Author Corner

Chulalongkorn University Theses and Dissertations (Chula ETD)

พารามิเตอร์ทางเสียงสำหรับการจำแนกลักษณะการเปล่งเสียงในเสียงพูดต่อเนื่องภาษาไทย

Other Title (Parallel Title in Other Language of ETD)

Author

Year (A.D.)

Document Type

First Advisor

Second Advisor

Faculty/College

Degree Name

Degree Level

Degree Discipline

DOI

Abstract

Other Abstract (Other language abstract of ETD)

Creative Commons License

Recommended Citation

Share

Search

Browse

Author Corner