You explained CIGAR string very good. Thanks a lot❤. I find some string such as 74M2S, 55M2S ... in this column. What does "S" stand for? and what does that mean?
Humble request that can u make a short video of what is Sam, bam, sorted bam and index bam files are and how we can interpret these files? I would greatly appreciate it!
Can you write a linux comman on how to extract specfic gene sequence from file forexample chromsome x start-end postion is known and seprate that sequence in fasta format and save it
Happy to see a successful Khushbu! From 3:08 - 4:02 your explanation was not ideal / correct. At that position, both A and G have been accepted because the two are similar - in being Purines of the Nucleic Acids. Similarly within two DNA strands C=T, while with two RNA strands C=T, all three C (Cytosine), T (Thymine) and U (Uracil) being the three Pyrimidines of the Nucleic Acids. Am I right Ms. Patel?
Thank you for pointing it out! As far as my understanding goes, and please correct me if I have understood it wrong, at position 14 even though there are different bases in reference and query, it is still reported as M because it is in alignment match i.e. read base lines up with a base in the reference. There is definitely a sequence mismatch. If it's not an alignment match, then CIGAR reports out insertion or deletion or alignment gap.
I plan on making a video on RNA-Seq alignment and gene quantification. Once we get the count matrices, usually we perform differential expression analysis. The further downstream steps can be subjective and depend on what questions we are asking and the goal of our analysis.
Thank you for the video. I intend learning bioinformatics myself. I am new to the field though I have some knowledge of (molecular)biology. Please, can you provide a guide/path that I can follow to start learning from scratch.
I have compiled a list of resources for someone who is starting out: khushbupatel.notion.site/Bioinformatics-Starter-Pack-ae982892a9fb44569f735abc52ec9dd8 Besides, I think the best way to introduce yourself to this field is to take up an online course. There are many platforms that offers beginner courses, some of them are Coursera and Udemy. Check out the linked document where you will find access to many other helpful resources!
One of the best explaination of SAM/BAM file, Thanks a bunch, keep it up
Neat and clear explanation of SAM/BAM format. Nice work! Thanks!
nice pills of knowledge to refresh the basics after a long period of lab-only work! thanks!
Really awesome, valuable, & well put together content. Thank you :)
You explained CIGAR string very good. Thanks a lot❤. I find some string such as 74M2S, 55M2S ... in this column. What does "S" stand for? and what does that mean?
Thank you for this video !!
Please share this presentation, It will be so helpful
Thank you
Humble request that can u make a short video of what is Sam, bam, sorted bam and index bam files are and how we can interpret these files? I would greatly appreciate it!
Can you write a linux comman on how to extract specfic gene sequence from file forexample chromsome x start-end postion is known and seprate that sequence in fasta format and save it
Thank you for explaining this! I have a doubt in the CIGAR string the 3M1I3M1D5M what does M I and D stand for here?
M stands for Match, D for Deletion and I for Insertion
Happy to see a successful Khushbu! From 3:08 - 4:02 your explanation was not ideal / correct. At that position, both A and G have been accepted because the two are similar - in being Purines of the Nucleic Acids. Similarly within two DNA strands C=T, while with two RNA strands C=T, all three C (Cytosine), T (Thymine) and U (Uracil) being the three Pyrimidines of the Nucleic Acids. Am I right Ms. Patel?
Thank you for pointing it out! As far as my understanding goes, and please correct me if I have understood it wrong, at position 14 even though there are different bases in reference and query, it is still reported as M because it is in alignment match i.e. read base lines up with a base in the reference. There is definitely a sequence mismatch. If it's not an alignment match, then CIGAR reports out insertion or deletion or alignment gap.
Hey your videos are really great and informative but can you please provide sources from where you got the images that you present in your videos!
Can u plz make series of rna seq data analysis step by step
I plan on making a video on RNA-Seq alignment and gene quantification. Once we get the count matrices, usually we perform differential expression analysis. The further downstream steps can be subjective and depend on what questions we are asking and the goal of our analysis.
Thank you for the video.
I intend learning bioinformatics myself.
I am new to the field though I have some knowledge of (molecular)biology.
Please, can you provide a guide/path that I can follow to start learning from scratch.
I have compiled a list of resources for someone who is starting out: khushbupatel.notion.site/Bioinformatics-Starter-Pack-ae982892a9fb44569f735abc52ec9dd8
Besides, I think the best way to introduce yourself to this field is to take up an online course. There are many platforms that offers beginner courses, some of them are Coursera and Udemy. Check out the linked document where you will find access to many other helpful resources!
Alright.
Thank you very much.
Thanks a lot. Which terminal are you using to view sam/bam file in such a way?
I think she has a mac, so probably the mac command line. It should be very similar to linux.
That's right. I use mac's terminal to perform command line operations.
To be more precise, bash or zsh terminal.
@@francescosilvestro2092 I am using bash
Thanks👍👍👍👍👍👍👍👍👍👍!!!!!!!!!!!!
nice explations.but it would be nicer if you explain slowly or take a pause.
agree but I usually set audio at 0.75X and that does it. But thanks again Ms Patel.
can i get your email ?
You can find my email in the description of every video.