3. Introduction to Data Preprocessing

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 ม.ค. 2025

ความคิดเห็น •

  • @nikitamaurya4518
    @nikitamaurya4518 8 หลายเดือนก่อน

    love this!

  • @NIPTMED
    @NIPTMED ปีที่แล้ว

    Bowtie2. I use it for aligment with reference before I call variant by GATK ?

  • @changfengchen9250
    @changfengchen9250 3 ปีที่แล้ว

    What if there happen to be two independent reads that have the same 5' and 3' ends?

    • @ChipsterTutorials
      @ChipsterTutorials  3 ปีที่แล้ว +2

      Assuming this is regarding duplicate marking, here's a nice documentation regarding MarkDuplicates function: gatk.broadinstitute.org/hc/en-us/articles/360037052812-MarkDuplicates-Picard-

  • @edoardoabeni6450
    @edoardoabeni6450 4 ปีที่แล้ว +1

    At 9:17 the speaker did not finish to explain why there are 2 matches!

    • @dawei6697
      @dawei6697 4 ปีที่แล้ว +1

      Horrible speakers struggling with drawing complete sentences.... she only explained what the 2 matches are NOT, but murmured through what they really are..”match the positions in the reference genome”? Hello? Base matches positions? Orange matches Apple?

    • @ChipsterTutorials
      @ChipsterTutorials  3 ปีที่แล้ว +3

      This is a recording of a live lecture -the audience members are asking some questions in between, which are unfortunately not completely audible. The lecturer continues to explain the same issue with the 3M in the beginning of the read: the message is that it is checked whether the bases positions align, regardless if they are mismatches or matches. So insertions and deletions are sort of the beef here. Here's for example a nice text explaining the CIGAR: sites.google.com/site/bioinformaticsremarks/bioinfo/sam-bam-format/what-is-a-cigar