MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics

Research output: Contribution to journalArticle

Standard

MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics. / Kushwaha, Sandeep; Manoharan, Lokeshwaran; Meerupati, Tejashwari; Hedlund, Katarina; Ahrén, Dag.

In: BMC Bioinformatics, Vol. 16, 65, 2015.

Research output: Contribution to journalArticle

Harvard

APA

CBE

MLA

Vancouver

Author

RIS

TY - JOUR

T1 - MetCap: a bioinformatics probe design pipeline for large-scale targeted metagenomics

AU - Kushwaha, Sandeep

AU - Manoharan, Lokeshwaran

AU - Meerupati, Tejashwari

AU - Hedlund, Katarina

AU - Ahrén, Dag

PY - 2015

Y1 - 2015

N2 - Background: Massive sequencing of genes from different environments has evolved metagenomics as central to enhancing the understanding of the wide diversity of micro-organisms and their roles in driving ecological processes. Reduced cost and high throughput sequencing has made large-scale projects achievable to a wider group of researchers, though complete metagenome sequencing is still a daunting task in terms of sequencing as well as the downstream bioinformatics analyses. Alternative approaches such as targeted amplicon sequencing requires custom PCR primer generation, and is not scalable to thousands of genes or gene families. Results: In this study, we are presenting a web-based tool called MetCap that circumvents the limitations of amplicon sequencing of multiple genes by designing probes that are suitable for large-scale targeted metagenomics sequencing studies. MetCap provides a novel approach to target thousands of genes and genomic regions that could be used in targeted metagenomics studies. Automatic analysis of user-defined sequences is performed, and probes specifically designed for metagenome studies are generated. To illustrate the advantage of a targeted metagenome approach, we have generated more than 300,000 probes that match more than 400,000 publicly available sequences related to carbon degradation, and used these probes for target sequencing in a soil metagenome study. The results show high enrichment of target genes and a successful capturing of the majority of gene families. MetCap is freely available to users from: http://soilecology.biol.lu.se/metcap/. Conclusion: MetCap is facilitating probe-based target enrichment as an easy and efficient alternative tool compared to complex primer-based enrichment for large-scale investigations of metagenomes. Our results have shown efficient large-scale target enrichment through MetCap-designed probes for a soil metagenome. The web service is suitable for any targeted metagenomics project that aims to study several genes simultaneously. The novel bioinformatics approach taken by the web service will enable researchers in microbial ecology to tap into the vast diversity of microbial communities using targeted metagenomics as a cost-effective alternative to whole metagenome sequencing.

AB - Background: Massive sequencing of genes from different environments has evolved metagenomics as central to enhancing the understanding of the wide diversity of micro-organisms and their roles in driving ecological processes. Reduced cost and high throughput sequencing has made large-scale projects achievable to a wider group of researchers, though complete metagenome sequencing is still a daunting task in terms of sequencing as well as the downstream bioinformatics analyses. Alternative approaches such as targeted amplicon sequencing requires custom PCR primer generation, and is not scalable to thousands of genes or gene families. Results: In this study, we are presenting a web-based tool called MetCap that circumvents the limitations of amplicon sequencing of multiple genes by designing probes that are suitable for large-scale targeted metagenomics sequencing studies. MetCap provides a novel approach to target thousands of genes and genomic regions that could be used in targeted metagenomics studies. Automatic analysis of user-defined sequences is performed, and probes specifically designed for metagenome studies are generated. To illustrate the advantage of a targeted metagenome approach, we have generated more than 300,000 probes that match more than 400,000 publicly available sequences related to carbon degradation, and used these probes for target sequencing in a soil metagenome study. The results show high enrichment of target genes and a successful capturing of the majority of gene families. MetCap is freely available to users from: http://soilecology.biol.lu.se/metcap/. Conclusion: MetCap is facilitating probe-based target enrichment as an easy and efficient alternative tool compared to complex primer-based enrichment for large-scale investigations of metagenomes. Our results have shown efficient large-scale target enrichment through MetCap-designed probes for a soil metagenome. The web service is suitable for any targeted metagenomics project that aims to study several genes simultaneously. The novel bioinformatics approach taken by the web service will enable researchers in microbial ecology to tap into the vast diversity of microbial communities using targeted metagenomics as a cost-effective alternative to whole metagenome sequencing.

KW - Bioinformatics

KW - Environmental sequencing

KW - Functional genes

KW - Metagenome

KW - Probe design pipeline

KW - Targeted metagenomics

KW - Sequence capture

KW - MetCap

UR - https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-015-0843-2

U2 - 10.1186/s12859-015-0501-8

DO - 10.1186/s12859-015-0501-8

M3 - Article

VL - 16

JO - BMC Bioinformatics

T2 - BMC Bioinformatics

JF - BMC Bioinformatics

SN - 1471-2105

M1 - 65

ER -