Draft:3’ mRNA-seq

From Wikipedia, the free encyclopedia
  • Comment: Notable but needs many more refs, thank you Ozzie10aaaa (talk) 12:59, 9 May 2024 (UTC)

3' mRNA-seq is a quantitative, genome-wide transcriptomic technique based on the barcoding of the 3' untranslated region (UTR) of mRNA molecules. Unlike standard bulk RNA-seq, where short sequencing reads are generated along the entire length of mRNA transcripts, only the 3' end of polyadenylated RNAs are sequenced in 3' mRNA-seq. This approach results in a need for fewer reads to quantify the expression of a gene and reduces the sequencing depth required per sample while providing robust and reliable transcriptome-wide read-outs of gene expression levels comparable to full-length RNA-seq methods.[1][2]

Sample barcoding and the reduced per-sample sequencing depth also allow higher levels of sample multiplexing per experiment and lower the cost of transcriptome sequencing compared to full-length RNA-seq methods. These factors are crucial for large-scale, ultra-high-throughput gene expression studies or studies assessing differential gene expression between different experimental conditions or cell types[2]

Some 3' mRNA-seq technologies, like Bulk RNA Barcoding and Sequencing (BRB-seq) commercialized by Alithea Genomics further streamline the library preparation process by pooling up to 384 samples very early in the workflow for a cost per sample tantamount to profiling four individual genes using conventional qRT-PCR, in a workflow requiring less than two and a half hours hands-on time[2]. An increasing number of 3' mRNA-seq techniques also include unique molecular identifiers (UMIs) in sample barcodes to uniquely label each mRNA molecule and to distinguish between original mRNA transcripts and duplicates that result from PCR amplification.

History[edit]

The sample barcoding approach used in 3' mRNA-seq was first established in the field of single-cell transcriptomics, where sample and mRNA barcoding allowed hundreds to thousands of single cells to be multiplexed in one experiment[3]. Single-cell RNA profiling technologies like CEL-seq2, SCRB-seq, and STRT-seq also allowed the pooling of large sets of samples into one unique sequencing library at an early stage in the protocol due to the addition of sample barcodes recognizing the 3’ poly(A)-tail of mRNA molecules[4][5][6].

However, while early iterations of 3’ mRNA-seq methods employed oligo-dT priming to enrich for the 3’ poly(A) regions of mRNA molecules, they often did not include the option to multiplex samples early in the workflow or to include UMIs to correct for amplification errors (Moll et al., 2014). Subsequent iterations and refinements of the method now often include combinations of UMIs and sample barcodes, with workflows optimized specifically for early multiplexing, and suitable for ultra-high-throughput sequencing experiments[2] .

Method[edit]

Numerous 3’ mRNA-seq methods exist, such as BRB-seq, QuantSeq, 3’Pool-seq, TagSeq, and QIAseq[2][7][8][9].

Each method relies on an initial reverse transcription step in which mRNAs are labeled with sample barcodes. Reverse transcription can be performed with oligo dT primers, barcoded oligo dT primers, or template-switching oligos[2][6][7]. In contrast, bulk RNA-seq library preparation methods like Illumina TruSeq mRNA Stranded kit use random priming of pre-fragmented RNA for reverse transcription to ensure reads are generated along the entire length of mRNA transcripts [1].

Second-strand synthesis is then performed in each method by DNA polymerase 1  nick translation or PCR, resulting in double-stranded complementary DNA (cDNA). This is followed by a process called tagmentation, in which double-stranded cDNA is fragmented and tagged using Tn5 transposase, which cleaves the cDNA and ligates adaptors for library amplification. Some methods use random primers for this stage[2][7].

Library indexing and PCR amplification then take place, resulting in libraries enriched for the 3’ untranslated region of mRNAs and suitable for short-read sequencing on Illumina or MGI sequencing instruments[2][7][6].

Advantages of 3' mRNA-seq[edit]

3’ mRNA-seq methods are generally cheaper per sample than standard bulk RNA-seq methods[2][7][8][9]. This is because of the lower sequencing depth required due to only the 3’ end of mRNA molecules being sequenced instead of the whole length of entire transcripts. Read depths of between one million and five million reads are recommended in commercialized 3’ mRNA-seq protocols and are suitable for detecting the majority of highly expressed genes[10][11]. This also allows more samples to be sequenced in the same sequencing run. The sample throughput for 3’ mRNA-seq library preparation differs per method but can allow up to 384 samples to be processed in plates, with options for automation[2][11]. For methods where samples are pooled early in the workflow, consumable use and cost are further reduced. For instance, BRB-seq is up to 25 times cheaper than Illumina TruSeq stranded mRNA library preparations, with a cost equivalent to assessing four genes by RT-qPCR[2].

The methods are largely insensitive to RNA degradation because only the 3’ region of mRNA transcripts are prepared for sequencing, regardless of how fragmented the rest of the mRNA molecules are due to degradation. This makes 3’ mRNA-seq methods suitable for both high-quality and degraded RNA with RIN <6 and results in data of a quality similar to full-length RNA-seq methods[2]. However, as only the 3’ region of mRNA molecules are sequenced, 3’ mRNA-seq methods are not suitable for the analysis of full-length transcripts, splice variants, fusion genes, or RNA editing.

References[edit]

  1. ^ Ma, Feiyang; Fuqua, Brie K.; Hasin, Yehudit; Yukhtman, Clara; Vulpe, Chris D.; Lusis, Aldons J.; Pellegrini, Matteo (2019-01-07). "A comparison between whole transcript and 3' RNA sequencing methods using Kapa and Lexogen library preparation methods". BMC Genomics. 20 (1): 9. doi:10.1186/s12864-018-5393-3. ISSN 1471-2164. PMC 6323698. PMID 30616562.
  2. ^ a b c d e f g h i j k l Alpern, Daniel; Gardeux, Vincent; Russeil, Julie; Mangeat, Bastien; Meireles-Filho, Antonio C. A.; Breysse, Romane; Hacker, David; Deplancke, Bart (December 2019). "BRB-seq: ultra-affordable high-throughput transcriptomics enabled by bulk RNA barcoding and sequencing". Genome Biology. 20 (1): 71. doi:10.1186/s13059-019-1671-x. ISSN 1474-760X. PMC 6474054. PMID 30999927.
  3. ^ Ziegenhain, Christoph; Vieth, Beate; Parekh, Swati; Reinius, Björn; Guillaumet-Adkins, Amy; Smets, Martha; Leonhardt, Heinrich; Heyn, Holger; Hellmann, Ines; Enard, Wolfgang (February 2017). "Comparative Analysis of Single-Cell RNA Sequencing Methods". Molecular Cell. 65 (4): 631–643.e4. doi:10.1016/j.molcel.2017.01.023. ISSN 1097-2765. PMID 28212749.
  4. ^ Islam, Saiful; Kjällquist, Una; Moliner, Annalena; Zajac, Pawel; Fan, Jian-Bing; Lönnerberg, Peter; Linnarsson, Sten (2012-04-05). "Highly multiplexed and strand-specific single-cell RNA 5′ end sequencing". Nature Protocols. 7 (5): 813–828. doi:10.1038/nprot.2012.022. ISSN 1754-2189. PMID 22481528.
  5. ^ Hashimshony, Tamar; Senderovich, Naftalie; Avital, Gal; Klochendler, Agnes; de Leeuw, Yaron; Anavy, Leon; Gennert, Dave; Li, Shuqiang; Livak, Kenneth J.; Rozenblatt-Rosen, Orit; Dor, Yuval; Regev, Aviv; Yanai, Itai (2016-04-28). "CEL-Seq2: sensitive highly-multiplexed single-cell RNA-Seq". Genome Biology. 17 (1): 77. doi:10.1186/s13059-016-0938-8. ISSN 1474-760X. PMC 4848782. PMID 27121950.
  6. ^ a b c Soumillon, Magali; Cacchiarelli, Davide; Semrau, Stefan; Oudenaarden, Alexander van; Mikkelsen, Tarjei S. (2014-03-05), Characterization of directed differentiation by high-throughput single-cell RNA-Seq, doi:10.1101/003236, retrieved 2024-05-29
  7. ^ a b c d e Moll, Pamela; Ante, Michael; Seitz, Alexander; Reda, Torsten (2014-11-25). "QuantSeq 3′ mRNA sequencing for RNA quantification". Nature Methods. 11 (12): i–iii. doi:10.1038/nmeth.f.376. ISSN 1548-7091.
  8. ^ a b Lohman, Brian K.; Weber, Jesse N.; Bolnick, Daniel I. (2016-04-22). "Evaluation of TagSeq, a reliable low-cost alternative for <scp>RNA</scp>seq". Molecular Ecology Resources. 16 (6): 1315–1321. doi:10.1111/1755-0998.12529. ISSN 1755-098X. PMID 27037501.
  9. ^ a b Sholder, Gabriel; Lanz, Thomas A.; Moccia, Robert; Quan, Jie; Aparicio-Prat, Estel; Stanton, Robert; Xi, Hualin S. (2020-01-20). "3'Pool-seq: an optimized cost-efficient and scalable method of whole-transcriptome gene expression profiling". BMC Genomics. 21 (1): 64. doi:10.1186/s12864-020-6478-3. ISSN 1471-2164. PMC 6971924. PMID 31959126.
  10. ^ "MERCURIUS™ BRB-seq kit for Illumina® | Alithea Genomics". alitheagenomics.com. Retrieved 2024-05-29.
  11. ^ a b "QuantSeq 3' mRNA-Seq Family". Lexogen. Retrieved 2024-05-29.