Publications des agents du Cirad

Cirad

Construction of a comprehensive library of repeated sequences for the annotation of Citrus genomes

Giraud D., Choisne N., Summo M., Sidibé-Bocs S., Vassilieff H., Costantino G., Droc G., Teycheney P.Y., Maumus F., Ollitrault P., Luro F.. 2025. BMC Genomic Data, 26 : 13 p..

DOI: 10.57745/ZVYEBU

DOI: 10.1186/s12863-025-01321-6

Background: The comprehensive annotation of repeated sequences in genomes is an essential prerequisite for studying the dynamics of these sequences over time and their involvement in gene regulation. Currently, the diversity of repeated sequences in Citrus genomes is only partially characterized because the annotations have been performed using heterogeneous bioinformatics tools, each with its specificity and dedicated only to the annotation of transposable elements. Results: We combined complementary repeat-finding programs including REPET, CAULIFINDER, and TAREAN, to enable the identification of all types of repetitive sequences found in plant genomes, including transposable elements, endogenous caulimovirids, and satellite DNAs. A fine-grained annotation method was first developed to create a consensus sequence library of repeated sequences identified in the genome assemblies of C. medica, C. micrantha, C. reticulata, and C. maxima, the four ancestral parental species involved in the formation of economically valuable cultivated Citrus varieties. A second, faster annotation method was developed to enrich the dataset by adding new repeated sequences retrieved from genome assemblies of other Citrus species and closely related species belonging to the Aurantioideae subfamily. The final reference library contains 3,091 consensus sequences, of which 94.5% are transposable elements. The diversity of endogenous caulimovirids was characterized for the first time within the genus Citrus, contributing 160 consensus sequences to the final reference library. Finally, 10 satellite DNAs were also identified. Conclusion: Combining multiple repeat detection methods enables the comprehensive annotation of all repeated sequences in Citrus genomes. Using the final reference library reported in this work will improve our understanding of the dynamics of repeated sequences during Citrus speciation, particularly following the genome duplication and hybridization events that led to mod

Mots-clés : citrus; génome; séquence nucléotidique; citrus medica; citrus maxima; citrus reticulata; bioinformatique; phylogénie; identification; transposon; génomique; multiplication végétative; génie génétique

Documents associés

Article (a-revue à facteur d'impact)

Agents Cirad, auteurs de cette publication :