Abstract
Conceptual modeling has been fundamental to the management of structured data. However, its value is increasingly being recognized for knowledge management in general. In trying to develop suitable conceptual models for unstructured information, issues such as the level of representation and complexity of processing techniques arise. Here, we investigate the use of a conceptual model that is simple enough to allow efficient automatic extraction from documents. Our model focused on the problem-solution relationship that is central to the analysis of scientific papers. It also consists of supporting relationships such as benefits and drawbacks, assumptions, methods, extensions, and claims. Our study considered two kinds of documents - scientific research papers and patents. We evaluated the utility of the approach by building a prototype system and our user evaluation shows promising results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Berners-Lee, T., Hendler, J., Lassila, The Semantic Web. Scientific American (2001)
Weibel, S., Godby, J., Miller, E., R. Daniel, J.: OCLC/NCSA metadata workshop report. (1995)
MARC: US MARC Standards. (2001)
MUC-7: Proceedings of the 7th Message Understanding Conference (MUC-7). (1998)
Porter, M.: An algorithm for suffix stripping. Program 14(3) (1980) 130–137
Nielsen, J.: Usability Engineering. Academic Press, Inc. (1993)
Sparck-Jones, K., Galliers, J.: Evaluating natural language processing systems: an analysis and review. New York: Springer (1996)
Hovy, E., Marcu, D.: Automated Text Summarization: Tutorial Notes. COLING-ACL’98, University of Montreal, Montreal, Quebec, Canada. (1998)
Teufel, S., Moens, M.: Discourse-level argumentation in scientific articles: human and automatic annotation. In: ACL Workshop on Towards Standards and Tools for Discourse Tagging. (1999)
Jing, H., Barzilay, R., McKeown, K., Elhadad, M.: Summarization Evaluation Methods: Experiments and Analysis. In: AAAI Intelligent Text Summarization Workshop. (1998) 60–68
Overmyer, S., Lavoie, B. Rambow, O.: Conceptual Modeling through Linguistic Analysis Using LIDA. In: Proceedings of the 23rd International Conference on Software Engineering (ICSE 2001). Volume Toronto, Canada. (2001)
Maybury, M.: Tools for the Knowledge Analyst: An Information Superiority Visionary Demonstration. IEEE COMPSAC 98. 22nd Annual International Computer Software and Applications Conference, Vienna, Austria. (1998)
TREC-10: Proceedings of the 10th Text REtrieval Conference (TREC-10). (2001)
Embley, D., Campbell, D., Jiang, Y., Liddle, S., Lonsdale, D., Ng, Y., Smith, R.: Conceptual-Model-Based Data Extraction from Multiple-Record Web Pages. Data & Knowledge Engineering 31 (1999) 227–251
Loh, S., Wives, L., de Oliveira, J.: Concept-Based Knowledge Discovery in Texts Extracted from the Web. SIGKDD Explorations 2:1 (2000)
Aussenac-Gilles, N., Biébow, B. Szulman, S.: Corpus analysis for conceptual modelling. In: CEUR Workshop Proceedings. Volume 51. (2000)
Shum, S.B., Motta, E., Dominique, J.: ScholOnto: An Ontology-Based Digital Library Server for Research Documents and Discourse. International Journal on Digital Libraries 3(3) (2000) 237–248
Rounds, W.: The relevance of computational complexity theory to natural language processing. MIT Press (1991)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hui, B., Yu, E. (2002). Extracting Conceptual Relationships from Specialized Documents. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds) Conceptual Modeling — ER 2002. ER 2002. Lecture Notes in Computer Science, vol 2503. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45816-6_26
Download citation
DOI: https://doi.org/10.1007/3-540-45816-6_26
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44277-6
Online ISBN: 978-3-540-45816-6
eBook Packages: Springer Book Archive