How to Apply PCA before K-means Clustering in R Programming (Example) | Principal Component Analysis

Positron IDE for R & Python | How to Install & Use | Better than RStudio? | Ft. @milos-makes-maps

Insightful Data Visualization Using ggplot2 in R (Ft. @rappa753 ) | Drawing Advanced Plots & Graphs

没有一个挑战成功的 #路飞#海贼王

แกล้งเพื่อนด้วย EL GRAN MAJA ใน มายคราฟ (ตาย!?)

ห้องน้ำบ้านหลังใหม่ #ห้องน้ำ

Fuzzy Matching in R (Example) | Approximate String, Name & Text Search | adist(), agrep() & amatch()

Statistics Globe

มุมมอง 8 426

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 14 พ.ย. 2024

ความคิดเห็น • 26

@loancet1878 2 ปีที่แล้ว ⁺¹
Thank you su much ! What you explain is exactly what I was looking for to deal with my data !
@StatisticsGlobe 2 ปีที่แล้ว
This is great to hear Loancet!
@haraldurkarlsson1147 2 ปีที่แล้ว ⁺¹
Use case.
I may have an actual use case for this. In my courses I give so-called fill-in-the-blank(s) questions. Students frequently misspell words in the most inventive ways possible (not by design of course) and I am pretty flexible in terms of giving full credit for "near misses". however, sometimes I wonder "How close is that answer actually?" This lesson gives me some ideas of how that may be accomplished by calculating the LD distance. The course management program I use (Blackboard) is not nearly good enough to do this however by itself. I would have generate versions I accept based on LD and feed those versions to Blackboard myself. I thought I would share these thoughts of mine.
Thanks for the wonderful videos.
@StatisticsGlobe 2 ปีที่แล้ว ⁺¹
Thank you very much for sharing this use case Haraldur! Indeed, this should be a good example where fuzzy matching is useful.
@jennykim5795 2 ปีที่แล้ว ⁺²
Hi, excellent video!!!! What is the default method for measuring distance for function "stringdist" here? Since you didn't set the method, I was curious.
@StatisticsGlobe 2 ปีที่แล้ว
Hey Jenny, thank you very much for the kind feedback, glad you like the video! The default method of stringdist is oas. You can find more info on this here: www.rdocumentation.org/packages/stringdist/versions/0.9.8/topics/stringdist Regards, Joachim
@haraldurkarlsson1147 2 ปีที่แล้ว ⁺¹
Excellent video. Very interesting stuff!
I do have a request or suggestion. Kerby could you do a video or a series of video on NLP (Natural Language Processing)? It seems to be a field that is gaining steam. My son is a layer and a data scientist who studies NLP for legal docs and I would love to know what he does for a living.
@StatisticsGlobe 2 ปีที่แล้ว
Thanks for the kind words and the great suggestion Haraldur! I'll forward it to Kirby. Regards, Joachim
@robertjl5619 2 ปีที่แล้ว ⁺³
Awesome tutorial. Levenstein distance still doesn't beat speed of fuzzyLookup in excel which is a shame. Neither does fuzzy join package. Frustrating bottleneck for automation but the performance is unquestionable. Tokenized jaccard in fuzzyLookup in excel still the king.
@StatisticsGlobe 2 ปีที่แล้ว ⁺¹
Hey Robert, thanks a lot for the kind words and the additional info!
@robertjl5619 2 ปีที่แล้ว ⁺¹
@@StatisticsGlobe love your vids bud and your no bullshit approach. keep it up!
@StatisticsGlobe 2 ปีที่แล้ว ⁺¹
Thanks mate! :)
@tildawilson1198 ปีที่แล้ว ⁺¹
How are you viewing the actual values ([1] "Bill Clintion" "Barack Obama") rather than just the numbers ([1] 5 3) in this? I see you switch back and forth a bunch of times but I'm not sure how you're doing that.
@cansustatisticsglobe ปีที่แล้ว
Hello Tilda,
You can use the value=TRUE argument in the use of agrep() function. It would give you the exact values or use the amatch() in square brackets to identify the index positions in the pres_df data frame. The script is given below the video. You should click on show more to see it.
Regards,
Cansu
@jelly3388 ปีที่แล้ว ⁺¹
amazing!
@matthias.statisticsglobe ปีที่แล้ว
Hey Jelly, thanks for the positive feedback! Glad you like the video!
@manny1manito2 2 ปีที่แล้ว ⁺²
this is great, would fuzzy_join work with dates?
@StatisticsGlobe 2 ปีที่แล้ว ⁺¹
Thank you! I have never done this myself, but this Stack Overflow thread seems to discuss your question: stackoverflow.com/questions/58718287/fuzzyjoin-with-dates-in-r
@andrea-mj9ce 2 ปีที่แล้ว ⁺¹
So _amatch_ is the most general function here for fuzzy matching
@StatisticsGlobe 2 ปีที่แล้ว
Hey Andrea, sorry for the delayed response, I was on vacation and couldn't reply earlier. Could you please explain your comment in some more detail? I'm afraid I don't get it :) Regards, Joachim
@1453angela 4 หลายเดือนก่อน ⁺¹
Hello! If I want to do an exact match and a fuzzy match at the same time how can I do it? 🥺
@StatisticsGlobe 4 หลายเดือนก่อน
Hey, I'm not sure if I understand your question. How would this work theoretically?
@Tommygun0110 ปีที่แล้ว ⁺¹
nice
@matthias.statisticsglobe ปีที่แล้ว
Hi Olphy, thanks for the comment! Glad you like it!
@paulboutros6093 2 ปีที่แล้ว ⁺¹
What do you suggest for a large data? (About 600,000)
@StatisticsGlobe 2 ปีที่แล้ว
Hey Paul, have you tried the code of this video? Did you get any error messages?

ต่อไป

เล่นอัตโนมัติ

How to Apply PCA before K-means Clustering in R Programming (Example) | Principal Component Analysis

How to Apply PCA before K-means Clustering in R Programming (Example) | Principal Component Analysis

Positron IDE for R & Python | How to Install & Use | Better than RStudio? | Ft. @milos-makes-maps

Positron IDE for R & Python | How to Install & Use | Better than RStudio? | Ft. @milos-makes-maps

Insightful Data Visualization Using ggplot2 in R (Ft. @rappa753 ) | Drawing Advanced Plots & Graphs

Insightful Data Visualization Using ggplot2 in R (Ft. @rappa753 ) | Drawing Advanced Plots & Graphs

没有一个挑战成功的 #路飞#海贼王

没有一个挑战成功的 #路飞#海贼王

แกล้งเพื่อนด้วย EL GRAN MAJA ใน มายคราฟ (ตาย!?)

แกล้งเพื่อนด้วย EL GRAN MAJA ใน มายคราฟ (ตาย!?)

ห้องน้ำบ้านหลังใหม่ #ห้องน้ำ

ห้องน้ำบ้านหลังใหม่ #ห้องน้ำ

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

Draw PCA Biplot & Loading Plot in R (Example) | Apply & Visualize Principal Component Analysis

Draw PCA Biplot & Loading Plot in R (Example) | Apply & Visualize Principal Component Analysis

Row & Column Operations Using dplyr in R | Select, mutate, rename, arrange, slice, filter, count

Row & Column Operations Using dplyr in R | Select, mutate, rename, arrange, slice, filter, count

Read, Row-Bind, Summarize & Visualize Multiple Data Sets in R | tidyverse, readr, dplyr & ggplot2

Read, Row-Bind, Summarize & Visualize Multiple Data Sets in R | tidyverse, readr, dplyr & ggplot2

Analyze & Visualize Country Data in R | tidyverse, dplyr & ggplot2 | Group, Summarize & Draw Bars

Analyze & Visualize Country Data in R | tidyverse, dplyr & ggplot2 | Group, Summarize & Draw Bars

Calculate Grouped Summary Statistics in R | group_by & summarize of dplyr Package | Multiple Columns

Calculate Grouped Summary Statistics in R | group_by & summarize of dplyr Package | Multiple Columns

How to Create a Tree Height Map in R | Example Ft @milos-makes-maps | Visualize Global Forest Canopy

How to Create a Tree Height Map in R | Example Ft @milos-makes-maps | Visualize Global Forest Canopy

Analysis of Variance (ANOVA) in R | Tukey's HSD Test, Visualization, Assumption Check, Normality

Analysis of Variance (ANOVA) in R | Tukey's HSD Test, Visualization, Assumption Check, Normality

ChatGPT Coding Limitations! #ChatGPT #OpenAI #AI #Programming #Coding #DataScience

ChatGPT Coding Limitations! #ChatGPT #OpenAI #AI #Programming #Coding #DataScience

Rey Mysterio kept Kurt Angle guessing

Rey Mysterio kept Kurt Angle guessing

ข่าวชาวนา ไร่ละ1000 14 พ.ย.2567

ข่าวชาวนา ไร่ละ1000 14 พ.ย.2567

จากโรงเรียนสู่โรงบาล! ชวนทั้งบ้านไปส่งบีบีเปิดเทอม แต่ไปจบที่โรงบาลได้ไง | BB Memory

จากโรงเรียนสู่โรงบาล! ชวนทั้งบ้านไปส่งบีบีเปิดเทอม แต่ไปจบที่โรงบาลได้ไง | BB Memory

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

Make A List Eat With Alek EP.12 ‘อาเล็ก’ พา ‘หลิง - ออม’ ตะลุยกินทั่วห้างฉลองสิ้นปีแบบจัดเต็ม

Make A List Eat With Alek EP.12 ‘อาเล็ก’ พา ‘หลิง - ออม’ ตะลุยกินทั่วห้างฉลองสิ้นปีแบบจัดเต็ม

เซอร์ไพรส์พี่ชาย พากลับไปเจอทีมงานพม่า(พี่นาย,พี่แหวน,พี่โจ) ที่ไม่เจอนาน2ปี!! ดีใจจนร้องไห้

เซอร์ไพรส์พี่ชาย พากลับไปเจอทีมงานพม่า(พี่นาย,พี่แหวน,พี่โจ) ที่ไม่เจอนาน2ปี!! ดีใจจนร้องไห้

ถ้าครูบามีโปเกม่อน #shorts

ถ้าครูบามีโปเกม่อน #shorts

拿柴房造了一个森林衣帽间~I turned the Woodshed into a Forest-Themed Closet.丨Liziqi Channel

拿柴房造了一个森林衣帽间~I turned the Woodshed into a Forest-Themed Closet.丨Liziqi Channel