Publications des agents du Cirad

Cirad

TOGGLe (Toolbox for Generic NGS Analyses) A framework to quickly build pipelines and to perform large-scale HTS analysis

Orjuela Castellanos J., Ravel S., Dereeper A., Tando N., Sabot F., Tranchant-Dubreuil C.. 2018. In : JOBIM 2018 Abstracts. Marseille : SFBI, p. 330-331. Journées Ouvertes Biologie, Informatique et Mathématiques (JOBIM 2018), 2018-07-03/2018-07-06, Marseille.

High throughput sequencing (HTS) data analyses are done every day for biologist and bioinformatics in order to give biological sense of their data. These analyses must to be reproducible, robust and efficient. A generic tool TOGGLe (Toolbox for Generic NGS Analyses) was developed to allow running simple and complex pipelines without require any programming skills. This workflow manager is friendly to users and transparent for developers. User only need basic Linux commands and to specify freely their favorite software parameters through a simple text file given. TOGGLe manages, controls, verifies and concatenates every step in your favorite workflow. This workflow manager checks structure and compatibility of steps given in the software parameters file, it builds a workflow and launches it. TOGGLe reports parameters, commands executed, software versions as well as errors if they occur. These informations are kept in logs and reports files. Results are organized in a structured tree of directories. TOGGLe allows compressing or removing intermediate data and it uses scheduler machinery. TOGGLe integrates a large panel of tools for HTS analyses (demultiplexing, cleaning, trimming, calling, assembly, structural variation detection, transcriptomic ...) and post-analysis (haplotype detection, population structure ...). This workflow manager is highly flexible on the data type, working on sequencing raw data (Illumina, 454 or Pacific Biosciences), as well as on various other formats (e.g. SAM, BAM, VCF). This HTS workflow manager allows running parallel analysis or global, where several samples will be analysed together. TOGGLe was used on numerous sequencing projects with high number of samples, or/and high depth sequencing. It was shown to be highly adaptable to various biological questions as well as to a large array of computing architecture and data. In this poster, we are going to show how users can enjoy of TOGGLe in their data analysis.

Documents associés

Communication de congrès

Agents Cirad, auteurs de cette publication :