Abstract
Gene normalization is a process of automatically detecting gene names in the literature and linking them to database records. It is critical for improving the coverage of annotation in gene databases. Automatic association of a gene with a species, also known as species assignment, is an essential step of gene normalization. In this article, we propose a new species assignment method which explores the structure of full length article. Experimental results show our method outperforms state-of-art systems on full length article level species assignment. Thus, we believe our work can be used in the process of full length article gene normalization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
The UniProt Consortium: UniProt: the universal protein knowledgebase. Nucleic Acids Research, 45(D1), D158–D169 (2017)
Wei, C.-H., Kao, H.-Y.: Cross-species gene normalization by species inference. BMC Bioinform. 12(Suppl 8), S5 (2011)
Gerner, M., Nenadic, G., Bergman, C.M.: LINNAEUS: a species name identification system for biomedical literature. BMC Bioinform. 11, 85 (2010). https://doi.org/10.1186/1471-2105-11-85
Krallinger, M., Leitner, F., Rodriguez-Penagos, C., Valencia, A.: Overview of the protein-protein interaction annotation extraction task of BioCreative II. Genome Biol. 9(Suppl 2), S4 (2008). https://doi.org/10.1186/gb-2008-9-s2-s4
Wei, C.-H., Kao, H.-Y., Lu, Z.: SR4GN: a species recognition software tool for gene normalization. PLoS ONE 7(6), e38460 (2012)
Wei, C.-H., Kao, H.-Y., Lu, Z.: GNormPlus: an integrative approach for tagging genes, gene families, and protein domains. Biomed. Res. Int. 2015, 918710 (2015)
Lu, Z., Kao, H.-Y., Wei, C.-H., Huang, M., Liu, J., Kuo, C.-J., Wilbur, W.J.: The gene normalization task in BioCreative III. BMC Bioinform. 12(Suppl 8), S2 (2011)
Ding, R., Arighi, C.N., Lee, J.-Y., Wu, C.H., Vijay-Shanker, K.: pGenN, a gene normalization tool for plant genes and proteins in scientific literature. PLoS ONE 10(8), e0135305 (2015)
Acknowledgements
The work was supported by Guangdong University of Foreign Studies (299-X5219112, 299-X5218168).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Ding, R., Chen, H., Liu, J., Kuang, J. (2020). Species Assignment for Gene Normalization Through Exploring the Structure of Full Length Article. In: Popescu, E., Hao, T., Hsu, TC., Xie, H., Temperini, M., Chen, W. (eds) Emerging Technologies for Education. SETE 2019. Lecture Notes in Computer Science(), vol 11984. Springer, Cham. https://doi.org/10.1007/978-3-030-38778-5_31
Download citation
DOI: https://doi.org/10.1007/978-3-030-38778-5_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38777-8
Online ISBN: 978-3-030-38778-5
eBook Packages: Computer ScienceComputer Science (R0)