iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://pubmed.ncbi.nlm.nih.gov/22645317/
dbCAN: a web resource for automated carbohydrate-active enzyme annotation - PubMed Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Jul;40(Web Server issue):W445-51.
doi: 10.1093/nar/gks479. Epub 2012 May 29.

dbCAN: a web resource for automated carbohydrate-active enzyme annotation

Affiliations

dbCAN: a web resource for automated carbohydrate-active enzyme annotation

Yanbin Yin et al. Nucleic Acids Res. 2012 Jul.

Abstract

Carbohydrate-active enzymes (CAZymes) are very important to the biotech industry, particularly the emerging biofuel industry because CAZymes are responsible for the synthesis, degradation and modification of all the carbohydrates on Earth. We have developed a web resource, dbCAN (http://csbl.bmb.uga.edu/dbCAN/annotate.php), to provide a capability for automated CAZyme signature domain-based annotation for any given protein data set (e.g. proteins from a newly sequenced genome) submitted to our server. To accomplish this, we have explicitly defined a signature domain for every CAZyme family, derived based on the CDD (conserved domain database) search and literature curation. We have also constructed a hidden Markov model to represent the signature domain of each CAZyme family. These CAZyme family-specific HMMs are our key contribution and the foundation for the automated CAZyme annotation.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Flowchart of our procedure for identifying and defining signature domain models for an example CAZyme family. Here this family contains four full-length proteins with different lengths. The red box is the signature domain regions defining the CAZyme family. It could be either identified by searching against annotated functional domain models in the CDD database or retrieved from literature curation. Boxes in other colors are non-overlapped domain regions annotated by other CDD models. The CDD search is done by RPS-BLAST, the multiple sequence alignment is done by MAFFT (default parameters), the building of HMM is done by hmmbuild and all other processes are done by self-developed perl scripts.
Figure 2.
Figure 2.
Snapshots of dbCAN annotation server. (A) The query page, where users can paste some FASTA format protein sequences in the text box or upload a text file containing the FASTA sequences. Clicking on ‘submit’ will invoke the hmmscan program in the backend server to search the queried sequences against the dbCAN HMMs. (B) The result page, where users can download the raw output from the hmmscan run and view the processed tabular format output (if alignment length >80 amino acids, use E-value < 1e − 5, otherwise use E-value < 1e − 3). A diagram is shown in the bottom to illustrate the CAZyme domain architecture according to the positional information in the tabular output.

Similar articles

Cited by

References

    1. Cantarel BL, Coutinho PM, Rancurel C, Bernard T, Lombard V, Henrissat B. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics. Nucleic Acids Res. 2009;37:D233–D238. - PMC - PubMed
    1. Li LL, McCorkle SR, Monchy S, Taghavi S, van der Lelie D. Bioprospecting metagenomes: glycosyl hydrolases for converting biomass. Biotechnol. Biofuels. 2009;2:10. - PMC - PubMed
    1. Tasse L, Bercovici J, Pizzut-Serin S, Robe P, Tap J, Klopp C, Cantarel BL, Coutinho PM, Henrissat B, Leclerc M, et al. Functional metagenomics to mine the human gut microbiome for dietary fiber catabolic enzymes. Genome Res. 2010;20:1605–1612. - PMC - PubMed
    1. Allgaier M, Reddy A, Park JI, Ivanova N, D’Haeseleer P, Lowry S, Sapra R, Hazen TC, Simmons BA, VanderGheynst JS, et al. Targeted discovery of glycoside hydrolases from a switchgrass-adapted compost community. PloS One. 2010;5:e8812. - PMC - PubMed
    1. Hess M, Sczyrba A, Egan R, Kim TW, Chokhawala H, Schroth G, Luo S, Clark DS, Chen F, Zhang T, et al. Metagenomic discovery of biomass-degrading genes and genomes from cow rumen. Science. 2011;331:463–467. - PubMed

Publication types