Cleaning and manipulating data with the tidyverse: dplyr, readr, and stringr in action (CC121)

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 ม.ค. 2025

ความคิดเห็น • 27

  • @Riffomonas
    @Riffomonas  3 ปีที่แล้ว +2

    What part of working with regular expressions do you find the most frustrating?

  • @getachewaga7734
    @getachewaga7734 ปีที่แล้ว

    HI Pat, great presentation. I love your tutorials. Many thanks for your generous gift.

  • @timmytesla9655
    @timmytesla9655 2 ปีที่แล้ว +2

    I learnt about rename_all() for the first time. Thank you.

    • @Riffomonas
      @Riffomonas  2 ปีที่แล้ว

      Wonderful! So glad you learned something new. I mainly use tolower, but you can use any function with rename_all

  •  2 ปีที่แล้ว +2

    Great example of real use for manipulating data. Thanks!

    • @Riffomonas
      @Riffomonas  2 ปีที่แล้ว

      Thanks Victor! 🤓

  • @haraldurkarlsson1147
    @haraldurkarlsson1147 ปีที่แล้ว

    Clear and purpose-driven!

  • @m.shihamadam4682
    @m.shihamadam4682 ปีที่แล้ว

    I am loving it - just found about the channel!

  • @marialeticiabonatelli5768
    @marialeticiabonatelli5768 2 ปีที่แล้ว

    So happy, it worked! And I had a completely different raw data - from dada2!

  • @mocabeentrill
    @mocabeentrill ปีที่แล้ว

    Thanks Pat. Really great tutorial.

  • @mikodine
    @mikodine ปีที่แล้ว

    Hi Pat, great tutorial! Thanks a lot for this :)

  • @davidjackson7675
    @davidjackson7675 ปีที่แล้ว

    Nice job.

  • @shrez28
    @shrez28 3 ปีที่แล้ว +1

    Will machine learning methods such as performing random forest on compositional data (as we have relative abundances for the genera) require any type of data transformation?

    • @Riffomonas
      @Riffomonas  3 ปีที่แล้ว

      We’ll do some transformations but not for compositionality. Depending on your application that may be something to experiment with

    • @shrez28
      @shrez28 3 ปีที่แล้ว

      @@Riffomonas Thanks, will wait for the tutorials. Your videos are really helpful :)

  • @ahmed007Jaber
    @ahmed007Jaber 2 ปีที่แล้ว +1

    Hi Pat, thank you for this. Is there a way to specify in the regular expression
    if you don't find this regex, look for 2nd form, 3rd form etc???

    • @Riffomonas
      @Riffomonas  2 ปีที่แล้ว

      Tha is for watching! Sorry. Not that I’m aware of other than setting up multiple separate searches

    • @ahmed007Jaber
      @ahmed007Jaber 2 ปีที่แล้ว

      @@Riffomonas i see. what I am thinking of and facing is that i have some entries like a code
      D alphabit
      d digits
      DDD-ddd-DDD
      this patter could repeat fully or partially e.g.
      DD-dd-DDD-ddd-dddd
      DDD-ddd-DDD-DDD-ddd-DDD
      the delimiter is always there

    • @Riffomonas
      @Riffomonas  2 ปีที่แล้ว

      @@ahmed007Jaber Ah, maybe try using the separate function. If you look back through past episodes I have one in there on how to use it. Basically, it lets you separate a column by a delimeter into new columns

    • @ahmed007Jaber
      @ahmed007Jaber 2 ปีที่แล้ว

      @@Riffomonas thank you; will check it

    • @ahmed007Jaber
      @ahmed007Jaber 2 ปีที่แล้ว

      @@Riffomonas HI Pat, thank you for the effort you put in replying and sharing knowledge. your comments clicked with me.
      did look quickly at the aforementioned video and tried the following in regex to match another pattern
      str_extract(x, pattern= "(........) | (......)")
      it worked like a charm

  • @EMlSHI
    @EMlSHI 3 ปีที่แล้ว +2

    Could you introduce how to clean up and manipulate taxnomy data output from qiime2?

    • @Riffomonas
      @Riffomonas  3 ปีที่แล้ว +2

      Hah! I likely have a conflict of interest, but I’d strongly encourage you to work with your data in mothur instead

  • @wapsyed
    @wapsyed ปีที่แล้ว

    You look and sound like a smart version of Seth Rogen haha

  • @richardbennett4365
    @richardbennett4365 ปีที่แล้ว

    The presenter appears to know R; however he doesn't seem to know that parentheSIS is the singular for parentheSES. 😮

    • @Riffomonas
      @Riffomonas  9 หลายเดือนก่อน

      Sorry for my midwestern drawl that tends to mangle some words. At least I know that “data” is plural though. #amirite?

    • @richardbennett4365
      @richardbennett4365 9 หลายเดือนก่อน

      It's all good. No problem.