A/B Testing in Data Science Interviews by a Google Data Scientist | DataInterview

DataInterview

มุมมอง 107 464

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 25 มิ.ย. 2024
👉 Looking for a comprehensive AB testing course? Visit www.datainterview.com/
👉 Join the Data Scientist Interview Bootcamp: www.datainterview.com/bootcam...
====== ✅ Details ======
🤔 Ever wondered how to ace the A/B testing questions in data science interviews?
Explained by Dan, a former data scientist at Google, explains how to design an A/B test based on a real-life example. The procedure and tips covered in the video are insightful for data science interviews at top companies such as Google and Meta.
Dan, the host, was a data scientist formerly at Google and PayPal. He launched datainterview.com/ to help candidates like you eliminate frustrations about the data science interview process and increase your success.
As an interview coach, Dan helped several clients land their dream jobs as IC and managerial DS roles at top companies such as Google, Meta, Amazon and such. Message him at Dan@DataInterview.com for help!
👍 Make sure to hit the like, and check out datainterview.com/
====== ⏱️ Timestamps ======
0:00 Intro
1:02 7 Steps in AB Testing
3:02 Step 1 - Problem Statement
4:44 User Funnel
6:09 Success Metric
8:06 Step 2 - Hypothesis Testing
9:41 Step 3 - Design the Experiment
11:34 Step 4 - Run the Experiment
12:32 Step 5 - Validity Check
15:13 Step 6 - Interpret Results
16:13 Step 7 - Launch Decision
====== 📚 Other Useful Contents ======
1. Principles and Frameworks of Product Metrics | TH-cam Case Study
Link: / principles-and-framewo...
2. How to Crack the Data Scientist Case Interview
Link: / crack-the-data-scienti...
3. How to Crack the Amazon Data Scientist Interview
Link: / crack-the-amazon-data-...
====== Connect ======
📗 LinkedIn - / danleedata
📘 Medium - / datainterview
วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 101

@SM-gf7ux 9 หลายเดือนก่อน ⁺²⁰
Dan - watching your videos helped me land a data scientist position at a FAANG company. So grateful for your knowledge and your ability to share it with a wide audience :)
@raamstarraamstar5115 2 หลายเดือนก่อน
Can u please share me. What’s way you followed to land a job in FAANG please
@grabngoinfo ปีที่แล้ว ⁺³
This is one of the best videos about AB testing interview prep on TH-cam. Great job and thanks for sharing!
@adazhu8211 11 หลายเดือนก่อน ⁺⁵
This is the best video I’ve seen about A/B testing. This is exactly a day to day DS work in tech companies. I very appreciate your sharing and looking forward for your new videos. Wishes you all the best!
@masonlee8573 2 ปีที่แล้ว ⁺²⁶
This is a great video step-by-step explanation and logics through the A/B testing setup! I am a recent DS bootcamp graduate and was struggling with this topic and this video helped tons! Thanks Dan
@blueberryblue2520 ปีที่แล้ว
Your English pronunciation is the most comfortable and tolerable throughout TH-cam. Thank you so much for that. You saved my life.
@phamvinguyen ปีที่แล้ว ⁺¹
This is the best video about A/B testing in practice I've ever seen. Thank you for sharing
@user-uc2tq4ng6z 10 หลายเดือนก่อน
The best video about A/B testing, definitely!
@alicel6813 2 ปีที่แล้ว ⁺²
Love the content! So much valuable insight into A/B Testing! Keep sharing more content.
@skid-ed2qk 2 หลายเดือนก่อน
Thanks for sharing! This video on A/B testing is hands down the most informative and practical one I've seen on TH-cam.
@ustckewen 2 ปีที่แล้ว ⁺³
Thank you very much, Dan! Very Helpful.
@martingarcia3999 10 หลายเดือนก่อน
your channel is GOLD! THANK YOU
@TuanSihan 2 ปีที่แล้ว
I'm currently preparing for Data Scientist interviews and this video dropped right on time. Thank you so much for such amazing content, Dan!!
@cococnk388 ปีที่แล้ว ⁺¹
How was the interview?
@TheYashRathi 10 หลายเดือนก่อน ⁺¹
This is such a great video. This video is enough when you are preparing for analytics experimentation interviews
@vaibhavgedigeri7289 2 ปีที่แล้ว ⁺⁶
This video literally made me subscribe to you channel Dan! I have gone through countless materials and research papers on experimentation and by this is one of the best summary of the steps! Great work and I am excited to see more videos from you(especially around pitfalls for multiple metrics and more)!
@DataInterview 2 ปีที่แล้ว ⁺¹
Thanks Viabhav! I created the lesson based on what I wish I had when I first learned about AB testing a couple years ago -- Dan
@lightmusicable 2 ปีที่แล้ว ⁺¹
Thank you for the great content, really loved it. It will be great if you could have some videos around most commonly used statistical tests in depth and A/B testing pitfalls. Will keep watching！
@princeok12 ปีที่แล้ว
Extremely helpful. Thank you for this step by guide to A/B Testing.
@TadBitGains 2 ปีที่แล้ว ⁺²
I was asked a similar A/B testing question on a Data Science interview with LinkedIn. Had I seen this video a week prior, I would’ve made it to the 3rd round. Better late than never. Great content! I just subscribed.
@DataInterview 2 ปีที่แล้ว ⁺¹
Best of luck with your prep! -- Dan
@prachihamlai2717 18 วันที่ผ่านมา
Thank you for creating this video. It provides such a clear and concise overview of A/B testing 👏
@hemalitanna ปีที่แล้ว ⁺²
I am a product designer, and this is so great to learn. Thanks for explaining it in detail in a simple to understand way!
@DataInterview ปีที่แล้ว
Thank you!
@rishabagrawal5332 2 ปีที่แล้ว
Great content. Simple concise explanation.
@shounokbanerjee4757 2 ปีที่แล้ว ⁺⁴
please make a video on how the hypothesis is being tested and we get the p value/CI
@madluvx 2 ปีที่แล้ว ⁺²
Thank You for the concise explanation.
@DataInterview 2 ปีที่แล้ว ⁺¹
Of course! -- Dan
@apoorvasrini2196 2 ปีที่แล้ว ⁺¹
This video is awesome and really helpful if anyone wants to learn AB testing but doesn't have the time to finish the entire popular Udacity course. Thank you for making it so clear and interpretable!
@DataInterview 2 ปีที่แล้ว ⁺⁵
And that’s why I made this video. Ran into a lot of frustrations when I was learning AB testing myself.
@rezarafieirad ปีที่แล้ว
great work. i love this course and it helped me alot.
@ryanmitchell2208 ปีที่แล้ว ⁺¹
This video made me subscribe to your channel. Extraordinarily detailed explanation of all of the things that go into experiment design.
@kesemadi8075 ปีที่แล้ว
Great content! Thank you!
@Whateverrrrr-no5em 2 หลายเดือนก่อน
the best a b testing video
@JayAgaskar ปีที่แล้ว ⁺¹
Amazing explaination and super concise, love it
@DataInterview ปีที่แล้ว
Thanks man! - Dan
@iLoveTurtlesHaha 2 หลายเดือนก่อน
Bro I have an interview in the morning and your video is giving me the confidence I need to go through with it. 😅
@aravindkramesh 2 ปีที่แล้ว ⁺¹
*Please upload more A/B testing videos. This topic is too confusing. Could you please upload a video explaining the power of hypothesis test and how to choose the sample size for an A/B test?*
@surekhakatreddy6400 2 ปีที่แล้ว ⁺²
A very detailed explanation to get started on A/B Testing. Thank you so much.
@DataInterview 2 ปีที่แล้ว ⁺¹
Of course!
@surekhakatreddy3531 2 ปีที่แล้ว
@@DataInterview Can you please explain difference between Frequentist and Bayesian approaches in A/B Testing.
@andresfelipehiguera785 7 หลายเดือนก่อน
Excellent video!
@alexandragrishechko6618 ปีที่แล้ว
Clear and helpful, thank you
@DataInterview ปีที่แล้ว
Thank you!
@mingzhouzhu4668 2 ปีที่แล้ว
this is so helpful
@yoggita 2 ปีที่แล้ว
Respect !! ❤️
@navkaransinghbrar4380 4 หลายเดือนก่อน
🎯 Key Takeaways for quick navigation:
00:00 *🎯 AB testing is vital for data science interviews at top companies.*
01:09 *🛠️ Seven steps of AB testing: Understand problem, define hypotheses, design experiment.*
03:00 *🛍️ Example: Designing experiment for an online store, clarify success metrics.*
05:31 *📊 Success metrics must be measurable, sensitive, and timely.*
08:00 *🧪 Establish hypotheses, set significance level, plan experiment.*
09:38 *📐 Design experiment: Determine sample size, run for 1-2 weeks.*
11:30 *🕵️ Check validity: Perform sanity checks, ensure randomization balance.*
14:44 *📈 Interpret results: Analyze metric direction, P-value, and confidence interval.*
17:23 *💡 Make decisions: Evaluate scenarios based on lift and confidence intervals.*
Made with HARPA AI
@kashi1504 3 หลายเดือนก่อน
This was really good
@TheElementFive 2 ปีที่แล้ว ⁺²
Excellent content as always Dan! As an expansion on the topic, could you consider doing a video on variation reduction techinques (e.g., CUPED, MLRATE) ?
@DataInterview 2 ปีที่แล้ว ⁺¹
Yes!
@Y14km 6 หลายเดือนก่อน
hi dan, first of all thanks for all the great work. Couple of questions on my end:
-In another video (spotify case) you mentioned success metric shouldnt be top line such as retention. However, most PMs would like to see improvement in KPIs like retention. If success metric isnt something like retention, how do we guarantee the link between success metric and topline KPI (such as does improvement in CTR also improve retention in long/short term?
-another question is say we saw a signicant change in country a, but no change in country b. Since unified product would be more desirable(it doesnt make sense to roll out different versions of product to different countries) how do we make the final decision here?
@cococnk388 ปีที่แล้ว ⁺³
Very interesting ..well understood except the launch decision part .. the diagram with horizontal lines at different levels.
@samw2066 หลายเดือนก่อน ⁺¹
Excellent video. Should the candidate list the guardrail metrics way before step 7 (i.e., step 1) so those are known & agreed upon before the test starts ?
@user-hw8gx9vh5v 4 หลายเดือนก่อน
Hi Dan! Thanks for this great video. In the end, when you compare practical significance, do you compare the experiment Lift value with the confidence interval? Is this possible even that the formula of confidence interval uses absolute difference?
@kangel0909 9 หลายเดือนก่อน ⁺²
Hmmm, revenue per day per user. That seems much more variable than conversions per user sessions, which is binomial with bounded variance. Especially for clothing on an ecommerce site that isn't Amazon. Can you help me understand why you chose this as the primary success metric?
@aakashdusane 2 หลายเดือนก่อน
oh! I was thinking the same thing!
@yifengwu5893 ปีที่แล้ว
Great video. But I have a question. Isn't Step 5 Validity Check ought to be done before running the real experiment?
@ayushbhandari5771 3 หลายเดือนก่อน
Hey Dan, really great video. One question. What will be the possible metric if we want to test a recommendation system for a fee i.e. for items shown on homepage rather than when search happens.
@shivanshugupta6503 ปีที่แล้ว ⁺¹
Can you explain same concept using jupyter notebook where you are following all the concepts that you talk in this video.?
@friendsplain 11 หลายเดือนก่อน ⁺¹
@datainterview This is an amazing review of A/B testing in interviews! Quick question, if we have a minimal sample size value calculated, why do we also determine a duration of an experiment? I.e. if sample size is calculated to 500 users with searches per group, shouldn't the test duration be how long it takes a business to reach 500 users during the test period? And not an arbitrarily chosen time such as 1 to 2 weeks?
@BryceChudomelka ปีที่แล้ว ⁺¹
This is amazing. Thank you. I will send you a Starbucks gift card if I get the job.
@mengfanying4781 6 หลายเดือนก่อน
Hi Dan, for step 6, how do you get the confidence interval (3.4%, 5.4%)? Thank you.
@alecryan8220 2 หลายเดือนก่อน
Hey Dan, can you talk about the decision to pick revenue-per-user-per-day? Doesn't using this metric require the delta method or bootstrapping if randomizing by user? Thanks!
@ayushsengar4153 ปีที่แล้ว ⁺³
Which statistical test did you use to get the p-value while interpreting the results
@sunstar123 9 หลายเดือนก่อน
Same question. How did he get p value as 0.001
@saeedesmailii 6 หลายเดือนก่อน
Great video and summary. Just note that you've made a subtle mistake in explaining the novelty effect. The novelty effect in interpreting A/B test results refers to the impact that the introduction of a new feature or change may have on user behavior simply because it is new or different, and it's not about differentiating between new and existing users.
@sneharaj2152 2 ปีที่แล้ว ⁺³
Why is this a two sided hypothesis test instead of a straightforward one sided test?
@adithyajob8728 10 หลายเดือนก่อน
awesome
@jashanmalhi9791 2 ปีที่แล้ว
Hey Dan I am currently in my final semester of Data Science undergrad degree and am graduating a year early. I unfornately do not have any internships, but do have some projects. I am a little confused on what do after I graduate. Whether I should start applying to jobs or do a bootcamp. Please any advice will help.
@nothingspecific314 2 ปีที่แล้ว ⁺²
Hi Dan,
Thanks for the video.
1. Isn't it inappropriate to compare the orders per user per day of treatment vs control? What if a big proportion of the treatment group do not use the treatment (don't use the searchbox driven by the new search algorithm)?
2. How do you define orders/user/day? is it = (total orders)/(total users x days run) or do you define the statistic at a user level and generate the distribution of means?
3. How would you assess the influence of position and popularity biases on your results?
@vibzee4118 2 ปีที่แล้ว
Nice questions, now I am also curious to know the answers
On proportion of groups not using search box, my thought is, we may take care of that by checking the stage at which user is entering in funnel
@itsleomarx ปีที่แล้ว
Do you think that I need to get a course on data science first and then your course or I can jump straight into yours?
@kinder458 8 หลายเดือนก่อน
Excellent content, I would appreciate just a slight improvement.
When you've assumed you want an alpha of 5% (in this case 2.5 per side) and you've also determined the beta. There is a different equation that gives you an output of how many people the experiment should be performed according to those assumptions. i don`t now wich one is better but from what i see is that yours doesn`t take into account the d of the experiment
more importantly, you can`t always use the sigma, it depends on several cases like the size of the test, independency of the samples and his unbiased estimator
the variance could be sigma OR S^2 OR sp OR SD
n>=(Sigma * (Z(1-a) + Z(1-Beta) / miu0 - miu1)^2
@hao-chuanhsieh8586 ปีที่แล้ว ⁺¹
Wouldn‘t it violate the independent assumption if we use revenue per user per day for a/b testing? Since a high portion of low engaged users would just keep not buying anything.
@kaiyan9589 หลายเดือนก่อน
what software/ tool is used for the A/B testing example in the video?
@beyondtheclouds95 ปีที่แล้ว
when you increase power (1-beta), you are actually increasing recall not precision
@anirbansen7132 2 ปีที่แล้ว
Thanks for the great video. I have a question w.r.t the sample size that you mentioned. With a 50:50 split on a website, there will be numerous sessions coming in. So, is the sample size the minimum number of sessions we need on each side to run a test. Or do we randomly sample X samples from all the incoming sessions, X being the sample size?
@DataInterview 2 ปีที่แล้ว ⁺³
Randomly sample users, not sessions.
@aakashdusane 2 หลายเดือนก่อน
What if we have same user making multiple purchases across the day? Does it make sense to have metric as revenue-per-session / revenue-per-search
if we have multiple purchases by same user in a day, and metric is revenue per user per day, wont we have to change the randomization unit?
@monicaakannan9461 หลายเดือนก่อน
@datainterview Wouldn't the click through rate or number or products purchased per user per day be a more effective success metric? If it is revenue per user per day, what if the control group purchases only one high priced product whereas treatment group purchases multiple low prices products, the recommendation system works but the revenue would be higher for the control group
@randomancy 7 หลายเดือนก่อน
In an A/B interview, is it common to need to calculate the Z-value or interview endpoints manually?
@minhmuic ปีที่แล้ว
How do you get the p value 0.001 ?
@gemini22581 2 หลายเดือนก่อน
What is lift?
@agustinnegri6273 2 ปีที่แล้ว
Hi. Thanks for the Great video. At the moment I'm stuck with sample size, conversion rate during the test (let's say 2 weeks) and getting a histogram of the data. After 2 weeks I'll get one conversion rate. How can I get an histogram from only one data. "Inside" the CR there is a huge N (let's say 1000 users). I'm confused how can I extract the data from 1000 users to make a CR histogram. Could you help me out? Cheers
@MashiroRedo 2 ปีที่แล้ว
Amplitude or have a data engineer/swe get it for you.
@SumedhaNagpalxyz ปีที่แล้ว
Hi Dan, Very informative video!
Would you have suggestions on how to select randomization units for B2B products like data bricks and slack, which have a many to 1 relationship between users and accounts?
@DataInterview ปีที่แล้ว
Randomize at the account level - sort of like cohort-based randomization. So users within the sample cohort is exposed to the same variation condition.
@ONoesBird 2 ปีที่แล้ว
Why do we use a two sided hypothesis test here? Can we not assume the new algorithm is at least the same as the old one and use the extra power?
@juanmora537 ปีที่แล้ว ⁺¹
You're allowing for the possibility that the new ALGO might actually hurt your KPI goals
@thomasanderson857 2 ปีที่แล้ว ⁺⁵
Can you explain how you got p-value = 0.001 and confidence interval = [3.4, 5.4]?
@monikakhanchandani2742 2 ปีที่แล้ว ⁺¹
I have the same question
@samiyehmahmoudian9673 2 ปีที่แล้ว ⁺¹
I have the same question too
@pal999 2 ปีที่แล้ว ⁺¹
We're all Waiting for the reply @Dan
@yankeephan87 2 ปีที่แล้ว ⁺¹
@datainterview
@abdullahfaizal7739 5 หลายเดือนก่อน ⁺¹
Your explanation is great but the you could avoid the background music. Infuriating!...
@robertwilsoniii2048 10 หลายเดือนก่อน
Welp, these videos tell me I am qualified to work at google in data science. Nevertheless, I don't see job openings in 2023. And I dropped out of school, so I've never been offered an interview despite my ability in statistics.
@robertwilsoniii2048 10 หลายเดือนก่อน
On the other hand, I'm not sure I really care about how effective ads are and getting users to make purchases. I'd rather study other things like whether or not certain types of jobs accelerate biological aging non-linearly.
@davidbrown6612 2 ปีที่แล้ว
??????? ?
@abdibabu177 ปีที่แล้ว
I ni cash
@ilikegeorgiabutiveonlybeen6705 หลายเดือนก่อน
idc about "acing" anything

ต่อไป

เล่นอัตโนมัติ

Learn Product Sense in 9 Minutes | Data Science Interviews | Interview Coach (Ex-Google)