Abstract
In designing classifiers for automatic speech recognitions, one of the problems the user faces is to cope with an unwanted variability in the environment such as changes in the speaker or the acoustics. To overcome this problem, various adaptation schemes have been proposed in the literature. In this short paper, rather than selecting a single acoustic model as being representative of a category, we adaptively find the optimal or near-optimal number of hidden Markov models during the Baum-Welch (BW) learning process through splitting and merging operations. This scheme is based on incorporating the split-merge operations into the HMM parameter re-estimation process of the BW algorithm. In the splitting phase, an acoustic model is divided into two sub-models based on a suitable criterion. On the other hand, in the merging phase, two models are combined into a single one. The experimental results demonstrate that the proposed mechanism can efficiently resolve the problem by adjusting the number of acoustic models while increasing the classification accuracy. The results also demonstrate that the advantage gained in the case of multi-modally distributed data sets is significant.
This work was generously supported by the Korea Research Foundation Grant funded by the Korea Government (MOEHRD-KRF-2005-042-D00265).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Anastasakos, T., McDonough, J., Schwartz, R., Kakhoul, J.: A compact model for speaker-adaptive training. In: Proceedings of ICSLP, pp. 1137–1140 (1996)
Bahl, L.R., Jelinek, F., Mercer, R.L.: A maximum likelihood approach to continues speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-5, 179–190 (1983)
Bahl, L.R., Brown, P.F., de SouzaBrown, P.V., Mercer, R.L.: A new algorithm for estimation of hidden Markov model parameters. In: Proceedings of ICASSP 1988, New York, pp. 493–496 (April 1988)
Ben-Yishai, A., Burshtein, D.: A discriminative training algorithm for hidden Markov models. IEEE Transactions on Speech and Audio Processing 12(3), 204–217 (2004)
Gales, M.J.F.: Cluster adaptive training for hidden Markov models. IEEE Transactions on Speech and Audio Processing 8(4), 417–428 (2000)
Lee, K.F.: Automatic Speech Recognition - The development of the SPHINX System. Kluwer Academic Publishers, Boston (1989)
Nakagawa, S.: A survey on automatic speech recognition. IEICE Transactions on Information and Systems E85-D(3), 465–486 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, SW., Oh, SH. (2006). On Adaptively Learning HMM-Based Classifiers Using Split-Merge Operations. In: Ali, M., Dapoigny, R. (eds) Advances in Applied Artificial Intelligence. IEA/AIE 2006. Lecture Notes in Computer Science(), vol 4031. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11779568_72
Download citation
DOI: https://doi.org/10.1007/11779568_72
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35453-6
Online ISBN: 978-3-540-35454-3
eBook Packages: Computer ScienceComputer Science (R0)