OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

TomExpress, a unified tomato RNA-Seq platform for visualization of expression data, clustering and correlation networks

Zouine, Mohamed and Maza, Elie and Djari, Anis and Lauvernier, Mattieu and Frasse, Pierre and Smouni, Abdelaziz and Pirrello, Julien and Bouzayen, Mondher TomExpress, a unified tomato RNA-Seq platform for visualization of expression data, clustering and correlation networks. (2017) Plant Journal, 92 (4). 727 -735. ISSN 0960-7412

(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Official URL: http://dx.doi.org/10.1111/tpj.13711


The TomExpress platform was developed to provide the tomato research community with a browser and integrated web tools for public RNA-Seq data visualization and data mining. To avoid major biases that can result from the use of different mapping and statistical processing methods, RNA-Seq raw sequence data available in public databases were mapped de novo on a unique tomato reference genome sequence and post-processed using the same pipeline with accurate parameters. Following the calculation of the number of counts per gene in each RNA-Seq sample, a communal global normalization method was applied to all expression values. This unifies the whole set of expression data and makes them comparable. A database was designed where each expression value is associated with corresponding experimental annotations. Sample details were manually curated to be easily understandable by biologists. To make the data easily searchable, a user-friendly web interface was developed that provides versatile data mining web tools via on-the-fly generation of output graphics, such as expression bar plots, comprehensive in planta representations and heatmaps of hierarchically clustered expression data. In addition, it allows for the identification of co-expressed genes and the visualization of correlation networks of co-regulated gene groups. TomExpress provides one of the most complete free resources of publicly available tomato RNA-Seq data, and allows for the immediate interrogation of transcriptional programs that regulate vegetative and reproductive development in tomato under diverse conditions. The design of the pipeline developed in this project enables easy updating of the database with newly published RNA-Seq data, thereby allowing for continuous enrichment of the resource.

Item Type:Article
Additional Information:Thanks to Wiley editor. The definitive version is available at : http://onlinelibrary.wiley.com/doi/10.1111/tpj.13711/epdf
HAL Id:hal-01607612
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
French research institutions > Institut National de la Recherche Agronomique - INRA (FRANCE)
Other partners > Université Mohammed V-Rabat - UM5 (MOROCCO)
Laboratory name:
Christophe Klopp - TULIP (ANR-10-LABX-41) - European COST Action FA1106
Deposited On:16 Nov 2017 08:32

Repository Staff Only: item control page