Abstract
Word sense disambiguation plays an important role in natural language processing, such as information retrieval, text summarization, machine translation etc. This paper proposes a corpus-based Chinese word sense disambiguation approach using HowNet. The method is based on the co-occurrence frequency between the relatives (such as synonym, antonymy, meronymy) of target word and each word in the context. Further, domains have been used to characterize the senses of polysemous word. To our knowledge, this is the first time a Chinese word sense disambiguation method using domain knowledge is reported. The accuracy is 73.2% at present. The experimental result shows that the method is very promising for Chinese word sense disambiguation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Dash, N.S., Chaudhuri, B.B.: Using Text Corpora for Understanding Polysemy in Bangla. In: Proceedings of the Language Engineering Conference, pp. 99–109 (2002)
Ahlswede, T., Lorand, D.: Word Sense Disambiguation by Human Subjects: Computational and Psycholinguistic Implications. In: Proceedings of the Workshop on Acquisitions of Lexical Knowledge from Text, Columbus,Ohio, pp. 1–9 (1993)
Seo, H.C., Chung, H.J., Rim, H.C., Myaeng, S.H., Kim, S.H.: Unsupervised Word Sense Disambiguation Using WordNet Relatives. Computer Speech and Language 18, 253–273 (2004)
Gliozzo, A., Strapparava, C., Dagan, I.: Unsupervised and Supervised Exploitation of Semantic Domains in Lexical Disambiguation. Computer Speech and Language 18, 275–299 (2004)
Dong, Z.D., Dong, Q.: HowNet, http://www.keenage.com
Dong, Z.D., Dong, Q.: HowNet – A Hybrid Language and Knowledge Resource. In: Proceeding of the International Conference on Natural Language Process and Knowledge Engineering, pp. 820–824 (2003)
Preiss, J.: Probabilistic Word Sense Disambiguation. Computer Speech and Language 18, 319–337 (2004)
Yarowsky, D.: Unsupervised Word-Sense Disambiguation Rival Supervised Methods. In: Proceeding of the 33rd Annual Meeting of the Association for Computational Languistics, pp. 189–196 (1995)
Zhang, Y.T., Gong, L., Wang, Y.C., Yin, Z.H.: An Effective Concept Extraction Method for Improving Text Classification Performance. Geo-spatial Information Science 6(4), 66–72 (2003)
Sebastiani, F.: Machine Learning in Automated Text Categorization. ACM Computing Surveys 34(1), 1–47 (2002)
Yi, G., Wang, X.L., Kong, X.Y., Zhao, J.: Quantifying Semantic Similarity of Chinese Words from HowNet. In: Proceedings of the First International Conference on Machine Learning and Cybernetics, pp. 234–239 (2002)
Stevenson, M., Wilks, Y.: Combining Weak Knowledge Sources for Sense Disambiguation. In: Proceeding of the International Joint Conference for Artificial Intelligence, pp. 884–889 (1999)
Yarowsky, D.: Hierarchical Decision Lists for Word Sense Disambiguation. Computers and the Humanities 34(1/2), 321–349 (2001)
Preiss, J.: Probabilistic Word Sense Disambiguation. Computer Speech and Language 18, 319–337 (2004)
Liu, Z.M., Liu, T., Zhang, G., Li, S.: Word Sense Disambiguation Based on Dependency Relationship Analysis and Bayes Model (in Chinese). High Technology Letter 5, 1–7 (2003)
Lu, S., Bai, S., Huang, X.: An Unsuptervised Approach to Word Sense Disambiguation Based on Sense-words in Vector Space Model (in Chinese). Journal of Software 13(6), 1082–1089 (2002)
Kilgarriff, A.: Gold Standard Datasets for Evaluating Word Sense Disambiguation Programs. Computer Speech and Language 12, 453–472 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Y., Gong, L., Wang, Y. (2005). Chinese Word Sense Disambiguation Using HowNet. In: Wang, L., Chen, K., Ong, Y.S. (eds) Advances in Natural Computation. ICNC 2005. Lecture Notes in Computer Science, vol 3610. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539087_123
Download citation
DOI: https://doi.org/10.1007/11539087_123
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28323-2
Online ISBN: 978-3-540-31853-8
eBook Packages: Computer ScienceComputer Science (R0)