AWGRS: Automates paired-end whole genome re-sequencing data analysis framework

Xiujuan Sun, Fa Zhang, Xiaohua Wan, Jinzhi Zhang

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In order to enable people to avoid too many cumbersome and complex operations of the command line and repeated parameter adjustments, automates pair-end whole genome re-sequence (aWGRS) data processing whereby pre-installed dependencies are presented in this paper, which are used to map reads to a reference and realign variations. This method presents aWGRS which is a method that takes as input paired-end reads and a reference genome and returns re-sequencing information. The concept behind the development of this tool is that re-sequencing requires several steps: alignment to the reference, single nucleotide polymorphisms (SNPs) calling, Insertion / Deletion (InDels) calling, structure variant (SVs) calling, and annotation. By introducing and adjusting a new concept called the recall rate, the coverage rate and accuracy rate can be met at the same time. Within the range of recall rate, a variation is evaluated by two criteria: the quality value and the number of reads that support it, and one read with higher quality value and larger supported number will be picked out finally. Genome-wide genetic variations between precocious trifoliate orange and its wild type are identified in [1], and empirical results show that there is a big reduction in the amount of variation and great improvement of accuracy between the results of aWGRS and [1] which offered by the Beijing Genomics Institute (BGI). Overall, the adjustable parameters adopted in aWGRS can affect the results of the experiment and the default filtering strategy using the mutation recall rate also can attain good results automatically.

源语言英语
主期刊名Proceedings - 2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016
编辑Kevin Burrage, Qian Zhu, Yunlong Liu, Tianhai Tian, Yadong Wang, Xiaohua Tony Hu, Qinghua Jiang, Jiangning Song, Shinichi Morishita, Kevin Burrage, Guohua Wang
出版商Institute of Electrical and Electronics Engineers Inc.
910-916
页数7
ISBN(电子版)9781509016105
DOI
出版状态已出版 - 17 1月 2017
已对外发布
活动2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016 - Shenzhen, 中国
期限: 15 12月 201618 12月 2016

出版系列

姓名Proceedings - 2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016

会议

会议2016 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2016
国家/地区中国
Shenzhen
时期15/12/1618/12/16

指纹

探究 'AWGRS: Automates paired-end whole genome re-sequencing data analysis framework' 的科研主题。它们共同构成独一无二的指纹。

引用此