Import Sequences From NCBI in R

แชร์
ฝัง
  • เผยแพร่เมื่อ 15 ม.ค. 2025

ความคิดเห็น • 9

  • @swarupdas8403
    @swarupdas8403 2 ปีที่แล้ว +2

    Sir, I find your videos very helpful please upload more like this

  • @BRYANJESUSCHEROCALLE
    @BRYANJESUSCHEROCALLE ปีที่แล้ว +1

    Thank you so much Sir

  • @MrArunavadatta
    @MrArunavadatta ปีที่แล้ว +1

    Thanks a lot ☺️

  • @sophiedelitene5049
    @sophiedelitene5049 ปีที่แล้ว +1

    Hello
    Thanks for this video 🤗
    I want to know if it’s possible to import information for 1000 sequences through a file containing differents accession numbers of sequences.
    Thanks for you help

    • @bioinformaticswithease2904
      @bioinformaticswithease2904  ปีที่แล้ว +1

      Yes it is not difficult. It is however somewhat slow due to the speed at which the records are pulled from ncbi but not nearly as slow as downloading the sequences manually. The code to do this is as follows:
      ```{r}
      library(traits)
      #First read each line from the text file and put it into a df cell
      accessionsdf

    • @bioinformaticswithease2904
      @bioinformaticswithease2904  ปีที่แล้ว +1

      You can also download any of the other values for each accession number. The available values that are shown at www.rdocumentation.org/packages/traits/versions/0.5.0/topics/ncbi_byid, are:
      taxon - taxonomic name (may include some junk, but hard to parse off)
      taxonomy - organism lineage
      gene_desc - gene description
      organelle - if mitochondrial or chloroplast
      gi_no - GI number
      acc_no - accession number
      keyword - if official DNA barcode
      specimen_voucher - museum/lab accession number of vouchered material
      lat_lon - longitude/latitude of specimen collection event
      country - country/location of specimen collection event
      paper_title - title of study
      journal - journal study published in (if published)
      first_author - first author of study
      uploaded_date - date sequence was uploaded to GenBank
      length - sequence length
      sequence - sequence character string

    • @sophiedelitene5049
      @sophiedelitene5049 ปีที่แล้ว

      Thanks a lot for your help 🥳
      I do this successfully without major problem
      A little problem was the number of sequences to insert in the loop but finally I managed to download sequences and informations by step of 200 and after I did a “rbind” for merging all informations in one data frame
      Thanks again 🙏🏼🙏🏼

    • @motminhdihoc
      @motminhdihoc ปีที่แล้ว

      @@bioinformaticswithease2904 Thank you very much for your help, Sir. I want to ask how to import downloaded sequences or my lab own sequences data set to R?

  • @brendanmaher8267
    @brendanmaher8267 ปีที่แล้ว

    How do we do this the other way? I’m trying to give R a dna sequence to search blast with and print the results of the blast search