Abstract
The method of speaker normalization has been known as the successful method for improving the speech recognition at speaker independent speech recognition system. This paper propose a new power spectrum warping approach to making improvement of speaker normalization better than a frequency warping. The power spectrum warping uses Mel-frequency cepstral of Mel filter bank in MFCC. Also, this paper proposes the hybrid VTN combined the power spectrum warping and a frequency warping. Experiment of this paper did a comparative analysis about the recognition performance of the SKKU PBW DB applied each the power spectrum is 3.06%, and hybrid VTN is 4.07% word error rate reduction as word recognition performance of baseline system.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lee, L., Rose, R.: A Frequency Warping Approach to Speaker Normalization. IEEE Transactions on Speech and Audio Processing 6(1) (January 1998)
Welling, L., Ney, H., Kanthak, S.: Speaker Adaptive Modeling by Vocal Tract Normalization. IEEE Transaction on Speech and Audio Processing 10(6) (September 2002)
Andreou, A., Kam, T., Cohen, J.: Experiments in Vocal Tract Normalization. In: Proc. CAIP Workshop: Frontiers in Speech Recognition II (1994)
Seltzer, M.: SPHINX III Signal Processing Front End Specification, CMU Speech Group (August 1999)
Linde, Y., Duzo, A., Gray, R.M.: An Algorithm for Vector Quantizer Design. IEEE Transaction on COM. 28 (January 1980)
Youn, J.S., Chung, K.W., Hong, K.S.: A Continuous Digit Speech Recognition Applied Vowel Sequence and VCCV Unit HMM. In: Proceeding of the Acoustical Society of Korea, vol. 20(2) (2001)
Rossing, T.D., Wheeler, P., Moore, F.R.: The Science of Sound. Addition Wesley. Addison Wesley, London (2002)
Roth, R., et al.: Dragon systems 1994 Large Vocabulary Continuous Speech Recognizer. In: Proc. Spoken Language Systems Technology Workshop (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Roh, YW., Kim, JH., Kim, DJ., Hong, KS. (2006). A Hybrid Warping Method Approach to Speaker Warping Adaptation. In: Bloch, I., Petrosino, A., Tettamanzi, A.G.B. (eds) Fuzzy Logic and Applications. WILF 2005. Lecture Notes in Computer Science(), vol 3849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11676935_18
Download citation
DOI: https://doi.org/10.1007/11676935_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32529-1
Online ISBN: 978-3-540-32530-7
eBook Packages: Computer ScienceComputer Science (R0)