iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://api.crossref.org/works/10.1093/BIOINFORMATICS/BTM495
{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2023,5,14]],"date-time":"2023-05-14T13:10:13Z","timestamp":1684069813816},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"23","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,12,1]]},"abstract":"Abstract<\/jats:title>Motivation: The rate at which gene-related findings appear in the scientific literature makes it difficult if not impossible for biomedical scientists to keep fully informed and up to date. The importance of these findings argues for the development of automated methods that can find, extract and summarize this information. This article reports on methods for determining the molecular function claims that are being made in a scientific article, specifically those that are backed by experimental evidence.<\/jats:p>Results: The most significant result is that for molecular function claims based on direct assays, our methods achieved recall of 70.7% and precision of 65.7%. Furthermore, our methods correctly identified in the text 44.6% of the specific molecular function claims backed up by direct assays, but with a precision of only 0.92%, a disappointing outcome that led to an examination of the different kinds of errors. These results were based on an analysis of 1823 articles from the literature of Saccharomyces cerevisiae (budding yeast).<\/jats:p>Availability: The annotation files for S.cerevisiae are available from ftp:\/\/genome-ftp.stanford.edu\/pub\/yeast\/data_download\/literature_curation\/gene_association.sgd.gz. The draft protocol vocabulary is available by request from the first author.<\/jats:p>Contact: \u00a0crangle@converspeech.com<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm495","type":"journal-article","created":{"date-parts":[[2007,10,18]],"date-time":"2007-10-18T00:44:48Z","timestamp":1192668288000},"page":"3232-3240","source":"Crossref","is-referenced-by-count":9,"title":["Mining experimental evidence of molecular function claims from the literature"],"prefix":"10.1093","volume":"23","author":[{"given":"Colleen E.","family":"Crangle","sequence":"first","affiliation":[{"name":"1 Converspeech LLC, 60 Kirby Place, Palo Alto, CA 94301 and 2Department of Genomics, Stanford University, Stanford, CA 94025, USA"}]},{"given":"J. Michael","family":"Cherry","sequence":"additional","affiliation":[{"name":"1 Converspeech LLC, 60 Kirby Place, Palo Alto, CA 94301 and 2Department of Genomics, Stanford University, Stanford, CA 94025, USA"}]},{"given":"Eurie L.","family":"Hong","sequence":"additional","affiliation":[{"name":"1 Converspeech LLC, 60 Kirby Place, Palo Alto, CA 94301 and 2Department of Genomics, Stanford University, Stanford, CA 94025, USA"}]},{"given":"Alex","family":"Zbyslaw","sequence":"additional","affiliation":[{"name":"1 Converspeech LLC, 60 Kirby Place, Palo Alto, CA 94301 and 2Department of Genomics, Stanford University, Stanford, CA 94025, USA"}]}],"member":"286","published-online":{"date-parts":[[2007,10,17]]},"reference":[{"key":"2023041107551270500_","volume-title":"Current Protocols in Molecular Biology","author":"Ausubel","year":"2007"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1093\/nar\/28.1.304","article-title":"The ENZYME database in 2000","volume":"28","author":"Bairoch","year":"2000","journal-title":"Nucleic Acids Res"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1093\/nar\/gkl972","article-title":"BRENDA, AMENDA and FRENDA: the enzyme information system in 2007","volume":"35","author":"Barthelmes","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"S16","DOI":"10.1186\/1471-2105-6-S1-S16","article-title":"Evaluation of BioCreAtIvE assessment of task 2","volume":"6","author":"Blaschke","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"S1.7","DOI":"10.1186\/1471-2105-6-S1-S17","article-title":"An evaluation of GO annotation retrieval for BioCreAtIvE and GOA","volume":"6","author":"Camon","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"2024","DOI":"10.1038\/sj.emboj.7600684","article-title":"RMI1\/NCE4, a suppressor of genome instability, encodes a member of the RecQ helicase\/Topo III complex","volume":"24","author":"Chang","year":"2005","journal-title":"EMBO J"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"1417","DOI":"10.1093\/bioinformatics\/btg160","article-title":"MeKE: discovering the functions of gene products from biomedical literature via sentence alignment","volume":"19","author":"Chiang","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041107551270500_","first-page":"2004","article-title":"Extracting Functional Annotations of Proteins Based on Hybrid Text Mining Approaches","author":"Chiang","year":"2004","journal-title":"In Proceedings of the BioCreAtIvE Challenge Evaluation Workshop"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"S21","DOI":"10.1186\/1471-2105-6-S1-S21","article-title":"Finding genomic ontology terms in text using evidence content","volume":"6","author":"Couto","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","DOI":"10.1007\/3-540-46019-5_24","article-title":"Text summarization in data mining","volume-title":"Soft-Ware 2002, LNCS 2311","author":"Crangle","year":"2002"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","article-title":"Identifying gene ontology concepts in natural-language text","author":"Crangle","year":"2004","DOI":"10.1109\/IEMBS.2004.1403805"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"S23","DOI":"10.1186\/1471-2105-6-S1-S23","article-title":"Data-poor categorization and passage retrieval for gene ontology annotation in Swiss-Prot","volume":"6","author":"Ehrler","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041107551270500_","first-page":"25","article-title":"The Gene Ontology Consortium","volume":"25","author":"Gene Ontology: tool for the unification of biology","year":"2000","journal-title":"Nat. Genet"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"S1","DOI":"10.1186\/1471-2105-6-S1-S1","article-title":"Overview of BioCreAtIvE: critical assessment of information extraction for biology","volume":"6","author":"Hirschman","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"2759","DOI":"10.1093\/bioinformatics\/bti390","article-title":"Literature mining and database annotation of protein phosphorylation using a rule-based system","volume":"21","author":"Hu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"329","DOI":"10.1016\/S0076-6879(02)50972-1","article-title":"Saccharomyces Genome Database","volume":"350","author":"Issel-Tarver","year":"2002","journal-title":"Meth. Enzymol"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"551","DOI":"10.1142\/S0219720004000739","article-title":"BioIE: retargetable information extraction and ontological annotation of biological interactions from literature","volume":"2","author":"Kim","year":"2004","journal-title":"J. Bioinform. Comput. Biol"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"1227","DOI":"10.1093\/bioinformatics\/bti084","article-title":"Automatic extraction of gene\/protein biological functions from biomedical text","volume":"21","author":"Koike","year":"2005","journal-title":"Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"3245","DOI":"10.1128\/MCB.19.5.3237","article-title":"Glycogen synthase phosphatase interacts with heat shock factor to activate CUP1 gene transcription in Saccharomyces cerevisiae","volume":"19","author":"Lin","year":"1999","journal-title":"Mol. Cell. Biol"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"235","DOI":"10.1093\/ijl\/3.4.235","article-title":"Introduction to WordNet: an on-line lexical database","volume":"3","author":"Miller","year":"1990","journal-title":"Int. J. Lexicogr"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"3089","DOI":"10.1093\/bioinformatics\/btl534","article-title":"Building an abbreviation dictionary using a term recognition approach","volume":"22","author":"Okazaki","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"2084","DOI":"10.1093\/bioinformatics\/bth207","article-title":"Gene annotation from scientific literature using mappings between keyword systems","volume":"20","author":"P\u00e9rez","year":"2004","journal-title":"Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1108\/eb046814","article-title":"An algorithm for suffix stripping","volume":"14","author":"Porter","year":"1980","journal-title":"Program"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"S18","DOI":"10.1186\/1471-2105-6-S1-S18","article-title":"Learning statistical models for annotating proteins with function information using biomedical text","volume":"6","author":"Ray","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"902","DOI":"10.1038\/nbt0806-902","article-title":"Protein annotation by EBIMed","volume":"24","author":"Rebholz-Schuhmann","year":"2006","journal-title":"Nat. Biotechnol"},{"key":"2023041107551270500_","article-title":"Rule-based extraction of experimental evidence in the biomedical domain \u2013 the Kdd Cup (Task 1)","volume-title":"SIGKDD Explor., 4","author":"Regev","year":"2002"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"1816","DOI":"10.1128\/MCB.20.5.1816-1824.2000","article-title":"A DNA helicase required for maintenance of the functional mitochondrial genome in Saccharomyces cerevisiae","volume":"20","author":"Sedman","year":"2000","journal-title":"Mol. Cell. Biol"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1093\/nar\/gkg094","article-title":"The FlyBase database of the Drosophila genome projects and community literature","volume":"31","author":"The FlyBase Consortium","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1089\/omi.2006.10.199","article-title":"FuGO working group. Development of FuGO: an ontology for functional genomics investigations","volume":"10","author":"Whetzel","year":"2006","journal-title":"OMICS"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"i331","DOI":"10.1093\/bioinformatics\/btg1046","article-title":"Evaluation of text data mining for database curation: lessons learned from the KDD Challenge Cup","volume":"19","author":"Yeh","year":"2003","journal-title":"Bioinformatics"},{"key":"2023041107551270500_","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.jbi.2006.06.001","article-title":"Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles","volume":"40","author":"Yu","year":"2007","journal-title":"J. Biomed. Inform"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/23\/3232\/49822753\/bioinformatics_23_23_3232.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/23\/3232\/49822753\/bioinformatics_23_23_3232.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,14]],"date-time":"2023-05-14T12:35:53Z","timestamp":1684067753000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/23\/3232\/289972"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,10,17]]},"references-count":31,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2007,12,1]]}},"URL":"http:\/\/dx.doi.org\/10.1093\/bioinformatics\/btm495","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2007,12,1]]},"published":{"date-parts":[[2007,10,17]]}}}