Analysis of codon usage patterns in citrus based on coding sequence data

Zenan Shen, Zhimeng Gan, Fa Zhang, Xinyao Yi, Jinzhi Zhang, Xiaohua Wan*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

27 Citations (Scopus)

Abstract

Background: Codon usage is an important determinant of gene expression levels that can help us understand codon biology, evolution and mRNA translation of species. The majority of previous codon usage studies have focused on single species analysis, although few studies have focused on the species within the same genus. In this study, we proposed a multispecies codon usage analysis workflow to reveal the genetic features and correlation in citrus. Results: Our codon usage analysis workflow was based on the GC content, GC plot, and relative synonymous codon usage value of each codon in 8 citrus species. This approach allows for the comparison of codon usage bias of different citrus species. Next, we performed cluster analysis and obtained an overview of the relationship in citrus. However, traditional methods cannot conduct quantitative analysis of the correlation. To further estimate the correlation among the citrus species, we used the frequency profile to construct feature vectors of each species. The Pearson correlation coefficient was used to quantitatively analyze the distance among the citrus species. This result was consistent with the cluster analysis. Conclusions: Our findings showed that the citrus species are conserved at the genetic level and demonstrated the existing genetic evolutionary relationship in citrus. This work provides new insights into codon biology and the evolution of citrus and other plant species.

Original languageEnglish
Article number234
JournalBMC Genomics
Volume21
DOIs
Publication statusPublished - Dec 2020
Externally publishedYes

Keywords

  • Citrus
  • Codon usage
  • Correlation
  • Evolution
  • GC biology

Fingerprint

Dive into the research topics of 'Analysis of codon usage patterns in citrus based on coding sequence data'. Together they form a unique fingerprint.

Cite this