The encoding of vowel features in Mel-Frequency Cepstral Coefficients
DOI:
https://doi.org/10.17469/O2104AISV000001Keywords:
Mel-Frequency Cepstral Coefficients, vowel featuresAbstract
Most work on acoustic phonetics uses formant frequencies as the parameterization of the phonetic signal for understanding the acoustic difference between the sounds of the world’s languages. Work in speech technology, however, has relied for several decades on Linear Prediction Coefficients (LPC) and Mel-Frequency Cepstral Coefficients (MFCC’s), due to their greater invariance to physical differences between speakers. This paper explores the phonetics of the MFCC’s, asking whether these coefficients can be used by phoneticians to develop a greater understanding of the phonetic nature of speech segments. This is done through an analysis of the ability of individual coefficients to distinguish between American English vowels in the Hillenbrand database.
Downloads
Published
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.