Scholars@Duke publication: The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features

The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features

Publication , Journal Article

Kouki, M; Hideaki, M; Hideaki, K; Reiko, M

Published in: Proceedings of the Annual Conference of the International Speech Communication Association Interspeech

December 1, 2011

It is unclear as to how infants learn the acoustic expression of each phoneme of their native languages. In recent studies, researchers have inspected phoneme acquisition by using a computational model. However, these studies have used a limited vocabulary as input and do not handle a continuous speech that is almost comparable to a natural environment. Therefore, we use a natural continuous speech and build a self-organization model that simulates the cognitive ability of the humans, and we analyze the quality and quantity of the speech information that is necessary for the acquisition of the native phoneme system. Our model is designed to learn values of the acoustic features of a continuous speech and to estimate the number and boundaries of the phoneme categories without using explicit instructions. In a recent study, our model could acquire the detailed vowels of the input language. In this study, we examined the mechanism necessary for an infant to acquire all the phonemes of a language, including consonants. In natural speech, vowels have a stationary feature; hence, our recent model is suitable for learning them. However, learning consonants through the past model is difficult because most consonants have more dynamic features than vowels. To solve this problem, we designed a method to separate "stable" and "dynamic" speech patterns using a feature-extraction method based on the auditory expressions used by human beings. Using this method, we showed that the acquisition of an unstable phoneme was possible without the use of instructions. Copyright © 2011 ISCA.

Duke Scholars

Author Reiko Mazuka Psychology & Neuroscience

Published In

Proceedings of the Annual Conference of the International Speech Communication Association Interspeech

EISSN

1990-9772

Publication Date

December 1, 2011

Start / End Page

749 / 752

Citation

APA

Chicago

ICMJE

MLA

NLM

Kouki, M., Hideaki, M., Hideaki, K., & Reiko, M. (2011). The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, 749–752.

Kouki, M., M. Hideaki, K. Hideaki, and M. Reiko. “The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features.” Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, December 1, 2011, 749–52.

Kouki M, Hideaki M, Hideaki K, Reiko M. The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features. Proceedings of the Annual Conference of the International Speech Communication Association Interspeech. 2011 Dec 1;749–52.

Kouki, M., et al. “The multi timescale phoneme acquisition model of the self-organizing based on the dynamic features.” Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, Dec. 2011, pp. 749–52.

Published In

Proceedings of the Annual Conference of the International Speech Communication Association Interspeech

EISSN

1990-9772

Publication Date

December 1, 2011

Start / End Page

749 / 752