BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis

Zhiguang Qiu, Li Yuan, Chun Ang Lian, Bin Lin, Jie Chen, Rong Mu, Xuejiao Qiao, Liyu Zhang, Zheng Xu, Lu Fan, Yunzeng Zhang, Shanquan Wang, Junyi Li, Huiluo Cao, Bing Li, Baowei Chen, Chi Song, Yongxin Liu, Lili Shi, Yonghong TianJinren Ni, Tong Zhang, Jizhong Zhou, Wei Qin Zhuang, Ke Yu*

*此作品的通讯作者

科研成果: 期刊稿件文章同行评审

11 引用 (Scopus)

摘要

Metagenomic binning is an essential technique for genome-resolved characterization of uncultured microorganisms in various ecosystems but hampered by the low efficiency of binning tools in adequately recovering metagenome-assembled genomes (MAGs). Here, we introduce BASALT (Binning Across a Series of Assemblies Toolkit) for binning and refinement of short- and long-read sequencing data. BASALT employs multiple binners with multiple thresholds to produce initial bins, then utilizes neural networks to identify core sequences to remove redundant bins and refine non-redundant bins. Using the same assemblies generated from Critical Assessment of Metagenome Interpretation (CAMI) datasets, BASALT produces up to twice as many MAGs as VAMB, DASTool, or metaWRAP. Processing assemblies from a lake sediment dataset, BASALT produces ~30% more MAGs than metaWRAP, including 21 unique class-level prokaryotic lineages. Functional annotations reveal that BASALT can retrieve 47.6% more non-redundant opening-reading frames than metaWRAP. These results highlight the robust handling of metagenomic sequencing data of BASALT.

源语言英语
文章编号2179
期刊Nature Communications
15
1
DOI
出版状态已出版 - 12月 2024
已对外发布

指纹

探究 'BASALT refines binning from metagenomic data and increases resolution of genome-resolved metagenomic analysis' 的科研主题。它们共同构成独一无二的指纹。

引用此