Pseudoalignment and Expectation Maximization

แชร์
ฝัง
  • เผยแพร่เมื่อ 23 ส.ค. 2024
  • Aligning RNA-seq consists of a two step process. The first step identifies coordinates of a read relative to a reference genome or reference transcripts. Due to similarities between genes and transcripts most reads can not be aligned uniquely, resulting in ambiguously aligned reads. To accurately estimate the correct transcript counts, a second step proportionally assigns these counts to transcripts. This step is called the Expectation Maximization of transcript counts. While the math can be a bit confusing, the actual implementation is easy. The Jupyter notebook with an example implementation can be found here:
    github.com/lac...

ความคิดเห็น • 3

  • @Muuip
    @Muuip ปีที่แล้ว

    Great approach to explain and test! Thanks much appreciated!👍

  • @citron2725
    @citron2725 9 หลายเดือนก่อน +1

    i guess because longer transcripts have higher probability for having more reads mapping to them than short transcripts during sequencing?

  • @Muuip
    @Muuip ปีที่แล้ว

    Anything on VOOM normalization?