Understanding VCF file | Variant Call Format Part 3/3

แชร์
ฝัง
  • เผยแพร่เมื่อ 18 ก.ย. 2024

ความคิดเห็น • 35

  • @ibabiome189
    @ibabiome189 3 ปีที่แล้ว

    Amazing videos, crisp and worth the time invested watching.
    Solution to KCQ 5 is as below:
    POS : 2
    REF : TCG
    ALT1 : TG - {Deletion of C at position 3}
    ALT2 : T - {Deletion of C and G at position 3 and 4 respectively}
    ALT3 : TCAG - {Insertion of A at position 4, which pushes G from position 4 to position 5}

    • @aspangle1
      @aspangle1 2 ปีที่แล้ว +1

      But why are these 3 different mutations not listed as separate lines of the VCF file? Why separate them by commas on the same line?

  • @elhafafadoua7552
    @elhafafadoua7552 3 ปีที่แล้ว +1

    Hello, thank you for your efforts, it is well explained. For the C-->G, it is a transversion, because C and G do not belong to the same family ( C is pyrimidine and G is a purine ).

    • @LiquidBrain
      @LiquidBrain  3 ปีที่แล้ว +1

      Thanks for posting this, mutation naming scheme is confusing 😅😅

    • @russiachand
      @russiachand 3 ปีที่แล้ว +1

      transitions (AT or GC) or transversions (AG, AC, GT or CT)

  • @Dr.WinniesBioWorld
    @Dr.WinniesBioWorld 5 หลายเดือนก่อน

    great couple of videos. Very helpful

  • @helenabiasibettibrendler2818
    @helenabiasibettibrendler2818 3 ปีที่แล้ว +1

    Thank you! The videos are super insightful and helpful! Keep up the awesome work!

  • @ollelinux
    @ollelinux 3 ปีที่แล้ว +2

    Thanks a bunch! It helped me greatly in my work :)

  • @arwabashanfar5037
    @arwabashanfar5037 4 ปีที่แล้ว +2

    Hello. where can i check the answer? i did not find it :( .. thank you very much

  • @lmarkal
    @lmarkal 3 ปีที่แล้ว

    Thanks for the videos. Very informative for my next job interview!

  • @genomicsandbioinformatics9628
    @genomicsandbioinformatics9628 2 ปีที่แล้ว

    Great explanation, would you explain how ref and alt alleles are assigned in a vcf file. Is it assigned on the basis of allele frequency? As in a larger population there may be different types of snps such as A, C, T, G, then how only one snp is assigned as Alt allele? Is it assigned on the basis of its frequency in the population? E.g In different individuals of a population, there may be many possible snps at a specific position such as A, T, C, G. So who can we know that which snp could be the Alt allele?

  • @musicspinner
    @musicspinner 3 ปีที่แล้ว +1

    Why wouldn't example 3 (t=3:50) have the POS=3, REF=C, and ALT=CA instead? Wouldn't that be the same but more efficient?
    Great set of videos by the way 👌

    • @musicspinner
      @musicspinner 3 ปีที่แล้ว

      Do VCF's then not necessarily use the most efficient way of reporting then?

    • @LiquidBrain
      @LiquidBrain  3 ปีที่แล้ว +1

      It was displayed as two/three digit since the position was on 2, and thus the REF and the ALT is displayed as that. To be honest I am not really sure why it was design this way, and different libraries sometimes do something different with their output. but both method of representation actually describe the same outcome.

  • @annuranagamechangeroflife542
    @annuranagamechangeroflife542 3 ปีที่แล้ว

    How we can split VCF files from single VCF file and how we convert single VCF file to pfam format and how we can use plinkseq ?

  • @elenips7231
    @elenips7231 2 ปีที่แล้ว

    Very helpful video , thank you!! I am not really familiar with bioinformatics and in this part of my project, I am trying two compare two VCF files corresponding to the results of healthy tissue and tumor tissue. I want to compare these VCF files and remove their similarities. More specific I want to remove the information of the healthy tissue from the tumor one. Have you any suggestions on which tool I should use or any way that I can do my analysis? thank you in advance!

  • @HaileG-2020
    @HaileG-2020 7 หลายเดือนก่อน

    Thank you very much, Sir.
    Can you share the bash script for genomic variant format (GT) conversion from 0/0, 0/1, 1/1, ./. to 0, 1, 2, and NA, from 0, 1, 2, and NA to letters/nucleotide bases (diploid form: AA, CC, GG, TT) or directly from 0/0, 0/1, 1/1, ./. to letters/nucleotide bases (diploid form) and vice versa. I think these are the backbone for any downstream data analysis, and I am also facing many problems related to those. The script for file form and genomic variance conversion (GT) may be also in R script or Python script. Waiting for your kind response.

  • @sagek7949
    @sagek7949 9 หลายเดือนก่อน

    This is super helpful! Subscribed :)

  • @scottieteichmer3777
    @scottieteichmer3777 2 ปีที่แล้ว

    I am working with a vcf file and am looking for information on how missense and nonsense data is represented without going to the summary html. Could you point me in the right direction?

    • @LiquidBrain
      @LiquidBrain  2 ปีที่แล้ว

      Not sure what would be your down stream analysis on the vcf, but maybe have a look at the vcftools (vcftools.sourceforge.net/), or there's a lot of tools available in the Galaxy.org (usegalaxy.org/) (VCFfilter) to get the statistics around the the vcf data.
      I am not exactly how to do it, but do email us at liquidbrain.r@gmail.com and I can see what i can do

  • @bill9623
    @bill9623 2 ปีที่แล้ว

    Thank you for this making this video

  • @chesterhung6154
    @chesterhung6154 2 ปีที่แล้ว

    Example 5 mutation is called "Translocation"

  • @lekshmirk3252
    @lekshmirk3252 3 ปีที่แล้ว

    Nice lecture big thanks. Could you pls make a video on analysis using tablet

  • @ssssteve5283
    @ssssteve5283 3 ปีที่แล้ว

    Nice video. Please keep going. Thank you very much.

  • @jemimahbanganan1759
    @jemimahbanganan1759 2 ปีที่แล้ว

    Hi! I was wondering if is it possible to create my own .vcf file? If it is, how do you create one? Because I have my genotypic data with SNP data but in .csv file. I need to convert it to vcf file to use it for GWAS. I do appreciate if you can answer this. Thank you.

    • @LiquidBrain
      @LiquidBrain  2 ปีที่แล้ว

      For vcf you can generate them from the bam file after you mapping step, there's a great guide here from EMBL's European Bioinformatics Institute :)
      www.ebi.ac.uk/sites/ebi.ac.uk/files/content.ebi.ac.uk/materials/2014/140217_AgriOmics/dan_bolser_snp_calling.pdf

  • @kubectlgetpo
    @kubectlgetpo ปีที่แล้ว

    Hey Brandon -- could you update the slides link please?

  • @RenanSantos-px9ml
    @RenanSantos-px9ml 3 ปีที่แล้ว

    Hello, I have a VCF file, but I am not sure how to open it. Do you have an advice?

    • @LiquidBrain
      @LiquidBrain  3 ปีที่แล้ว

      Hi, how big is the file? Small file can open in notepad directly, if it is too large you can try to use galaxy to process the file and view from.there

  • @darklordsemih
    @darklordsemih 3 ปีที่แล้ว

    Nice tutorial!

  • @YH_C1111
    @YH_C1111 3 ปีที่แล้ว

    Thank you so much! It is useful for me job interview~

  • @ismailalsalom1779
    @ismailalsalom1779 3 ปีที่แล้ว

    thank you

  • @danielnakamura6430
    @danielnakamura6430 3 ปีที่แล้ว

    thank youuu