iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://api.crossref.org/works/10.1145/3596603
{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,10,20]],"date-time":"2024-10-20T10:10:01Z","timestamp":1729419001844,"version":"3.27.0"},"reference-count":78,"publisher":"Association for Computing Machinery (ACM)","issue":"9","funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["U20A20229, 61922073, and 62106244"],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"name":"The provincial projects on quality engineering for colleges and universities in Anhui Province","award":["2020zdxsjg400"]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2023,11,30]]},"abstract":"In recent years, due to the explosive growth of patent applications, patent mining has drawn extensive attention and interest. An important issue of patent mining is that of recognizing the technologies contained in patents, which serves as a fundamental preparation for deeper analysis. To this end, in this article, we make a focused study on constructing a technology portrait for each patent, i.e., to recognize technical phrases concerned in it, which can summarize and represent patents from a technical perspective. Along this line, a critical challenge is how to analyze the unique characteristics of technical phrases and illustrate them with definite descriptions. Therefore, we first generate the detailed descriptions about the technical phrases existing in extensive patents based on different criteria, including various previous works, practical experience, and statistical analyses. Then, considering the unique characteristics of technical phrases and the complex structure of patent documents, such as multi-aspect semantics and multi-level relevances, we further propose a novel unsupervised model, namely TechPat, which can not only automatically recognize technical phrases from massive patents but also avoid the need for expensive human labeling. After that, we evaluate the extraction results from various aspects. Specifically, we propose a novel evaluation metric called Information Retrieval Efficiency (IRE) to quantify the performance of extracted technical phrases from a new perspective. Extensive experiments on real-world patent data demonstrate that the TechPat model can effectively discriminate technical phrases in patents and greatly outperform existing methods. We further apply extracted technical phrases to two practical application tasks, namely patent search and patent classification, where the experimental results confirm the wide application prospects of technical phrases. Finally, we discuss the generalization ability of our proposed methods.<\/jats:p>","DOI":"10.1145\/3596603","type":"journal-article","created":{"date-parts":[[2023,5,13]],"date-time":"2023-05-13T11:14:21Z","timestamp":1683976461000},"page":"1-31","update-policy":"http:\/\/dx.doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["TechPat: Technical Phrase Extraction for Patent Mining"],"prefix":"10.1145","volume":"17","author":[{"ORCID":"http:\/\/orcid.org\/0000-0003-3436-7620","authenticated-orcid":false,"given":"Ye","family":"Liu","sequence":"first","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-5599-0625","authenticated-orcid":false,"given":"Han","family":"Wu","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0003-1661-0420","authenticated-orcid":false,"given":"Zhenya","family":"Huang","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-9921-2078","authenticated-orcid":false,"given":"Hao","family":"Wang","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-5713-3531","authenticated-orcid":false,"given":"Yuting","family":"Ning","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-5322-0638","authenticated-orcid":false,"given":"Jianhui","family":"Ma","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0001-6956-5550","authenticated-orcid":false,"given":"Qi","family":"Liu","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]},{"ORCID":"http:\/\/orcid.org\/0000-0002-4835-4102","authenticated-orcid":false,"given":"Enhong","family":"Chen","sequence":"additional","affiliation":[{"name":"University of Science and Technology of China, China and State Key Laboratory of Cognitive Intelligence, China"}]}],"member":"320","published-online":{"date-parts":[[2023,6,15]]},"reference":[{"key":"e_1_3_2_2_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.111"},{"key":"e_1_3_2_3_2","doi-asserted-by":"publisher","DOI":"10.1109\/WI-IAT.2012.82"},{"volume-title":"Natural Language Processing with Python: Analyzing Text With the Natural Language Toolkit","year":"2009","author":"Bird Steven","key":"e_1_3_2_4_2","unstructured":"Steven Bird, Ewan Klein, and Edward Loper. 2009. Natural Language Processing with Python: Analyzing Text With the Natural Language Toolkit. \u201cO\u2019Reilly Media, Inc.\u201d."},{"key":"e_1_3_2_5_2","doi-asserted-by":"crossref","unstructured":"Saroj Kr Biswas Monali Bordoloi and Jacob Shreya. 2018. A graph based keyword extraction model using collective node weight. Expert Systems with Applications 97 (2018) 51\u201359.","DOI":"10.1016\/j.eswa.2017.12.025"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/N18-2105"},{"key":"e_1_3_2_7_2","doi-asserted-by":"crossref","unstructured":"Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems 30 1\u20137 (1998) 107\u2013117.","DOI":"10.1016\/S0169-7552(98)00110-X"},{"key":"e_1_3_2_8_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v33i01.33016268"},{"key":"e_1_3_2_9_2","doi-asserted-by":"publisher","DOI":"10.1145\/2506182.2506198"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1145\/963770.963776"},{"key":"e_1_3_2_11_2","unstructured":"Jacob Devlin Ming-Wei Chang Kenton Lee and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Vol. 1 4171\u20134186."},{"key":"e_1_3_2_12_2","first-page":"2733","volume-title":"IJCAI","author":"Downey Doug","year":"2007","unstructured":"Doug Downey, Matthew Broadhead, and Oren Etzioni. 2007. Locating complex named entities in web text. In IJCAI, Vol. 7. 2733\u20132739."},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.24963\/ijcai.2021\/200"},{"volume-title":"NTCIR","year":"2007","author":"Fujii Atsushi","key":"e_1_3_2_14_2","unstructured":"Atsushi Fujii, Makoto Iwayama, and Noriko Kando. 2007. Overview of the patent retrieval task at the NTCIR-6 workshop. In NTCIR."},{"key":"e_1_3_2_15_2","unstructured":"Suyu Ge Fangzhao Wu Chuhan Wu Tao Qi Yongfeng Huang and Xing Xie. 2020. Fedner: Privacy-preserving medical named entity recognition with federated learning. arXiv:2003.09288. Retrieved from https:\/\/arxiv.org\/abs\/2003.09288."},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-12511-4_11"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-1119"},{"key":"e_1_3_2_18_2","unstructured":"Matthew Honnibal and Ines Montani. 2017. spaCy 2: Natural language understanding with Bloom embeddings convolutional neural networks and incremental parsing. To appear 7 1 (2017) 411\u2013420."},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.3390\/e20020104"},{"issue":"4","key":"e_1_3_2_20_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/2651444","article-title":"Recognition of patient-related named entities in noisy tele-health texts","volume":"6","author":"Kim Mi-Young","year":"2015","unstructured":"Mi-Young Kim, Ying Xu, Osmar R. Zaiane, and Randy Goebel. 2015. Recognition of patient-related named entities in noisy tele-health texts. ACM Transactions on Intelligent Systems and Technology (TIST) 6, 4 (2015), 1\u201323.","journal-title":"ACM Transactions on Intelligent Systems and Technology (TIST)"},{"key":"e_1_3_2_21_2","doi-asserted-by":"crossref","unstructured":"Guillaume Lample Miguel Ballesteros Sandeep Subramanian Kazuya Kawakami and Chris Dyer. 2016. Neural Architectures for Named Entity Recognition. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 260\u2013270.","DOI":"10.18653\/v1\/N16-1030"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2018.00042"},{"issue":"5","key":"e_1_3_2_23_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3201407","article-title":"Employing semantic context for sparse information extraction assessment","volume":"12","author":"Li Peipei","year":"2018","unstructured":"Peipei Li, Haixun Wang, Hongsong Li, and Xindong Wu. 2018. Employing semantic context for sparse information extraction assessment. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 5 (2018), 1\u201336.","journal-title":"ACM Transactions on Knowledge Discovery from Data (TKDD)"},{"key":"e_1_3_2_24_2","doi-asserted-by":"crossref","unstructured":"Tuohang Li Liang Hu Hongtu Li Chengyu Sun Shuai Li and Ling Chi. 2021. TripleRank: An unsupervised keyphrase extraction algorithm. Knowledge-Based Systems 219 (2021) 106846.","DOI":"10.1016\/j.knosys.2021.106846"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.emnlp-main.14"},{"key":"e_1_3_2_26_2","doi-asserted-by":"crossref","unstructured":"Bill Yuchen Lin Dong-Ho Lee Ming Shen Ryan Moreno Xiao Huang Prashant Shiralkar and Xiang Ren. 2020. TriggerNER: Learning with Entity Triggers as Explanations for Named Entity Recognition. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics . 8503\u20138511.","DOI":"10.18653\/v1\/2020.acl-main.752"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2751523"},{"key":"e_1_3_2_28_2","first-page":"5052","volume-title":"IJCAI","author":"Liu Qi","year":"2018","unstructured":"Qi Liu, Han Wu, Yuyang Ye, Hongke Zhao, Chuanren Liu, and Dongfang Du. 2018. Patent litigation prediction: A convolutional tensor factorization approach. In IJCAI. 5052\u20135059."},{"issue":"3","key":"e_1_3_2_29_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3046941","article-title":"An influence propagation view of pagerank","volume":"11","author":"Liu Qi","year":"2017","unstructured":"Qi Liu, Biao Xiang, Nicholas Jing Yuan, Enhong Chen, Hui Xiong, Yi Zheng, and Yu Yang. 2017. An influence propagation view of pagerank. ACM Transactions on Knowledge Discovery from Data (TKDD) 11, 3 (2017), 1\u201330.","journal-title":"ACM Transactions on Knowledge Discovery from Data (TKDD)"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-32049-6_24"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM50108.2020.00139"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.5555\/1699510.1699544"},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/D15-1104"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P16-1101"},{"key":"e_1_3_2_35_2","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P14-5010"},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2017.12"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.21105\/joss.00205"},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.naacl-main.396"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/P17-1054"},{"key":"e_1_3_2_40_2","first-page":"404","volume-title":"Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing","author":"Mihalcea Rada","year":"2004","unstructured":"Rada Mihalcea and Paul Tarau. 2004. Textrank: Bringing order into text. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. 404\u2013411."},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2015.01.050"},{"key":"e_1_3_2_42_2","first-page":"875","volume-title":"Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)","author":"Pan Liangming","year":"2017","unstructured":"Liangming Pan, Xiaochen Wang, Chengjiang Li, Juanzi Li, and Jie Tang. 2017. Course concept extraction in moocs via embedding-based graph propagation. In Proceedings of the 8th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 875\u2013884."},{"key":"e_1_3_2_43_2","first-page":"311","volume-title":"Proceedings of the 40th Annual Meeting on Association for Computational Linguistics","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Association for Computational Linguistics, 311\u2013318."},{"key":"e_1_3_2_44_2","doi-asserted-by":"crossref","unstructured":"Youngjin Park and Janghyeok Yoon. 2017. Application technology opportunity discovery from technology portfolios: Use of patent classification and collaborative filtering. Technological Forecasting and Social Change 118 (2017) 170\u2013183.","DOI":"10.1016\/j.techfore.2017.02.018"},{"key":"e_1_3_2_45_2","unstructured":"Fabian Pedregosa Ga\u00ebl Varoquaux Alexandre Gramfort Vincent Michel Bertrand Thirion Olivier Grisel Mathieu Blondel Peter Prettenhofer Ron Weiss Vincent Dubourg and others. 2011. Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research 12 (2011) 2825\u20132830."},{"key":"e_1_3_2_46_2","doi-asserted-by":"crossref","unstructured":"Qi Peng Changmeng Zheng Yi Cai Tao Wang Haoran Xie and Qing Li. 2021. Unsupervised cross-domain named entity recognition using entity-aware adversarial training. Neural Networks 138 (2021) 68\u201377.","DOI":"10.1016\/j.neunet.2020.12.027"},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12559-019-09706-3"},{"key":"e_1_3_2_48_2","first-page":"1","article-title":"Automatic keyword extraction from individual documents","volume":"1","author":"Rose Stuart","year":"2010","unstructured":"Stuart Rose, Dave Engel, Nick Cramer, and Wendy Cowley. 2010. Automatic keyword extraction from individual documents. Text Mining: Applications and Theory 1 (2010), 1\u201320.","journal-title":"Text Mining: Applications and Theory"},{"key":"e_1_3_2_49_2","doi-asserted-by":"crossref","unstructured":"Walid Shalaby and Wlodek Zadrozny. 2019. Patent retrieval: a literature review. Knowledge and Information Systems 61 (2019) 631\u2013660.","DOI":"10.1007\/s10115-018-1322-7"},{"key":"e_1_3_2_50_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2018.2812203"},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v36i10.21381"},{"key":"e_1_3_2_52_2","doi-asserted-by":"publisher","DOI":"10.1145\/3477539"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/2487575.2487659"},{"key":"e_1_3_2_54_2","doi-asserted-by":"publisher","DOI":"10.1145\/2339530.2339741"},{"key":"e_1_3_2_55_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v34i05.6435"},{"key":"e_1_3_2_56_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.ipm.2006.11.011"},{"key":"e_1_3_2_57_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICALT.2009.215"},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-16-6372-7_19"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2018.00075"},{"issue":"20","key":"e_1_3_2_60_2","first-page":"1","article-title":"The improvements of text rank for domain-specific key phrase extraction","volume":"17","author":"Wang Zhijuan","year":"2016","unstructured":"Zhijuan Wang, Yinghui Feng, and Fuxian Li. 2016. The improvements of text rank for domain-specific key phrase extraction. International Journal of Simulation Systems, Science & Technology 17, 20 (2016), 1\u201311.","journal-title":"International Journal of Simulation Systems, Science & Technology"},{"volume-title":"Proceedings of the 23rd International Joint Conference on Artificial Intelligence","year":"2013","author":"Wang Zhichun","key":"e_1_3_2_61_2","unstructured":"Zhichun Wang, Juanzi Li, and Jie Tang. 2013. Boosting cross-lingual knowledge linking via concept annotation. In Proceedings of the 23rd International Joint Conference on Artificial Intelligence."},{"key":"e_1_3_2_62_2","doi-asserted-by":"publisher","DOI":"10.1109\/WAINA.2015.37"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1145\/3414901"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1145\/3308558.3313743"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2019.00180"},{"key":"e_1_3_2_66_2","unstructured":"Han Xiao. 2018. bert-as-service. Retrieved from https:\/\/github.com\/hanxiao\/bert-as-service."},{"key":"e_1_3_2_67_2","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v35i16.17669"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1145\/1645953.1646295"},{"key":"e_1_3_2_69_2","doi-asserted-by":"publisher","DOI":"10.1145\/1571941.1572139"},{"key":"e_1_3_2_70_2","doi-asserted-by":"publisher","DOI":"10.1093\/jamia\/ocaa189"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1145\/3051127"},{"key":"e_1_3_2_72_2","unstructured":"Jifan Yu Chenyu Wang Gan Luo Lei Hou Juanzi Li Jie Tang and Zhiyuan Liu. 2019. Course concept expansion in moocs with external knowledge and interactive game. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics . 4292\u20134302."},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.1145\/564376.564398"},{"key":"e_1_3_2_74_2","doi-asserted-by":"publisher","DOI":"10.1145\/2783702.2783704"},{"key":"e_1_3_2_75_2","doi-asserted-by":"publisher","DOI":"10.1145\/2600428.2609518"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1145\/3201408"},{"key":"e_1_3_2_77_2","doi-asserted-by":"crossref","unstructured":"Feng Zhao Xianyu Gui Yafan Huang Hai Jin and Laurence T. Yang. 2020. Dynamic entity-based named entity recognition under unconstrained tagging schemes. IEEE Transactions on Big Data 8 4 (2020) 1059\u20131072.","DOI":"10.1109\/TBDATA.2020.2998770"},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/2021.acl-long.485"},{"key":"e_1_3_2_79_2","unstructured":"Decong Li Sujian Li Wenjie Li Wei Wang and Weiguang Qu. 2010. A semi-supervised key phrase extraction approach: learning from title phrases through a document semantic network. In Proceedings of the ACL 2010 conference short papers . 296\u2013300."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3596603","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,20]],"date-time":"2024-10-20T09:40:42Z","timestamp":1729417242000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3596603"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,15]]},"references-count":78,"journal-issue":{"issue":"9","published-print":{"date-parts":[[2023,11,30]]}},"alternative-id":["10.1145\/3596603"],"URL":"https:\/\/doi.org\/10.1145\/3596603","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"type":"print","value":"1556-4681"},{"type":"electronic","value":"1556-472X"}],"subject":[],"published":{"date-parts":[[2023,6,15]]},"assertion":[{"value":"2022-01-12","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-05-03","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-06-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}