The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana

Steven B. Cannon, Arvind Mitra, Andrew Baumgarten, Nevin D Young, Georgiana May

Research output: Contribution to journalArticlepeer-review

1701 Scopus citations

Abstract

Background: Most genes in Arabidopsis thaliana are members of gene families. How do the members of gene families arise, and how are gene family copy numbers maintained? Some gene families may evolve primarily through tandem duplication and high rates of birth and death in clusters, and others through infrequent polyploidy or large-scale segmental duplications and subsequent losses. Results: Our approach to understanding the mechanisms of gene family evolution was to construct phylogenies for 50 large gene families in Arabidopsis thaliana, identify large internal segmental duplications in Arabidopsis, map gene duplications onto the segmental duplications, and use this information to identify which nodes in each phylogeny arose due to segmental or tandem duplication. Examples of six gene families exemplifying characteristic modes are described. Distributions of gene family sizes and patterns of duplication by genomic distance are also described in order to characterize patterns of local duplication and copy number for large gene families. Both gene family size and duplication by distance closely follow power-law distributions. Conclusions: Combining information about genomic segmental duplications, gene family phylogenies, and gene positions provides a method to evaluate contributions of tandem duplication and segmental genome duplication in the generation and maintenance of gene families. These differences appear to correspond meaningfully to differences in functional roles of the members of the gene families.

Original languageEnglish (US)
Article number10
JournalBMC plant biology
Volume4
DOIs
StatePublished - Jun 1 2004

Fingerprint

Dive into the research topics of 'The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana'. Together they form a unique fingerprint.

Cite this