Abstract
Accurate characterisation of splice junctions (SJs) as well as transcription start and end sites in reference transcriptomes allows precise quantification of transcripts from RNA-seq data, and enables detailed investigations of transcriptional and post-transcriptional regulation. Using novel computational methods and a combination of PacBio Iso-seq and Illumina short-read sequences from 20 diverse tissues and conditions, we generated a comprehensive and highly resolved barley reference transcript dataset from the European 2-row spring barley cultivar Barke (BaRTv2.18). Stringent and thorough filtering was carried out to maintain the quality and accuracy of the SJs and transcript start and end sites. BaRTv2.18 shows increased transcript diversity and completeness compared with an earlier version, BaRTv1.0. The accuracy of transcript level quantification, SJs and transcript start and end sites have been validated extensively using parallel technologies and analysis, including high-resolution reverse transcriptase-polymerase chain reaction and 5'-RACE. BaRTv2.18 contains 39 434 genes and 148 260 transcripts, representing the most comprehensive and resolved reference transcriptome in barley to date. It provides an important and high-quality resource for advanced transcriptomic analyses, including both transcriptional and post-transcriptional regulation, with exceptional resolution and precision.
Original language | English (US) |
---|---|
Pages (from-to) | 1183-1202 |
Number of pages | 20 |
Journal | Plant Journal |
Volume | 111 |
Issue number | 4 |
DOIs | |
State | Published - Aug 2022 |
Bibliographical note
Funding Information:The authors acknowledge the Research/Scientific Computing teams at The James Hutton Institute and NIAB for providing computational resources and technical support for the "UK's Crop Diversity Bioinformatics HPC" (BBSRC grant BB/S019669/1), use of which has contributed to the results reported. We thank Susanne König and Axel Himmelbach from IPK Gatersleben for preparing and sequencing the Iso-seq libraries. The authors acknowledge the support of the Biotechnology and Biological Sciences Research Council (Grant Numbers BB/S020160/1, BB/S004610/1, BB/R014582/1) to MC, RZ, MB, WG, RWa and JWSB; The Rural & Environment Science & Analytical Services Division of the Scottish Government (Grant Number SRP WP2.1 and 2.2) to RZ, WG, MB, LM, NM, MS, CS, JF and RWa; a joint PhD scholarship from the James Hutton Institute and the University of Dundee/BBSRC DTP (BB/M010996/1) to JSWB, RZ and JCE; German Research Council (DFG) (Grant number STE 1102/15-1) to NS and RWo; and the National Science Foundation (Grant number ERA-CAPS 1844331) to GM and AH.
Funding Information:
The authors acknowledge the Research/Scientific Computing teams at The James Hutton Institute and NIAB for providing computational resources and technical support for the "UK's Crop Diversity Bioinformatics HPC" (BBSRC grant BB/S019669/1), use of which has contributed to the results reported. We thank Susanne König and Axel Himmelbach from IPK Gatersleben for preparing and sequencing the Iso‐seq libraries. The authors acknowledge the support of the Biotechnology and Biological Sciences Research Council (Grant Numbers BB/S020160/1, BB/S004610/1, BB/R014582/1) to MC, RZ, MB, WG, RWa and JWSB; The Rural & Environment Science & Analytical Services Division of the Scottish Government (Grant Number SRP WP2.1 and 2.2) to RZ, WG, MB, LM, NM, MS, CS, JF and RWa; a joint PhD scholarship from the James Hutton Institute and the University of Dundee/BBSRC DTP (BB/M010996/1) to JSWB, RZ and JCE; German Research Council (DFG) (Grant number STE 1102/15‐1) to NS and RWo; and the National Science Foundation (Grant number ERA‐CAPS 1844331) to GM and AH.
Publisher Copyright:
© 2022 The Authors. The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Keywords
- Iso-seq
- RNA-seq
- barley
- reference transcript datasets
- transcriptomic analysis