Amazon Business Intelligence Mock Interview: Duplicate Products

แชร์
ฝัง
  • เผยแพร่เมื่อ 28 พ.ค. 2020
  • Today I'm joined by Shashank who is a data scientist and business intelligence engineer at Amazon!
    Question page: www.interviewquery.com/questi...
    Want to be featured in the next mock interview video? Apply here: airtable.com/shrdQrwKK7xxGLm6l
    👉 Subscribe to my data science channel: bit.ly/2xYkyUM
    🔥 Get 10% off your next data science interview prep: www.interviewquery.com/pricin...
    ❓ Check out our data science interview course: www.interviewquery.com/course...
    🔑 Get professional coaching here: www.interviewquery.com/coachi...
    🐦 Follow us on Twitter: / interview_query
    Shashank and I work on a business case study problem that is asked pretty frequently for Amazon business intelligence engineers and analysts. It involves breaking down a vague problem in a database into a solvable solution that makes sense and can scale for Amazon's purposes. At the end we go over feedback and tips and tricks for the next one!
    Quick Links:
    0:35 - Shashank Intro
    2:55 - Business Intelligence Case Question
    11:55 - Feedback and Tips
    More from Jay:
    Follow me on Linkedin: / jay-feng-ab66b049
    Read my personal blog: datastream.substack.com/
    Find me on Twitter: / datasciencejay
    Read more:
    The Amazon Business Analyst Interview: www.interviewquery.com/blog-a...
    Amazon Business Intelligence Engineer Interview: www.interviewquery.com/blog-t...
    Amazon Machine Learning Questions: www.interviewquery.com/blog-a...
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 55

  • @prasshanthvp1082
    @prasshanthvp1082 3 ปีที่แล้ว +51

    My 2 cents: Product similarity is a complex problem to solve even for established top 4 companies. Image matching for product requires data science solution but that's just one small part. String matching could be managed better with data science solution, but if all you have is SQL, you can still create a good baseline using string functions like regex fns for matching. Your product matching confidence increases when you introduce more parameter to match. Dimensions like weight, size, height as pointed by someone in comments are good examples, Reviews are not simply due to sheer noise and overhead involved in parsing long winded strings. Think of what parameters will you look at when you buy a smartphone, a good laptop, a nice pair of earphones and retain only those parameters which makes sense to match two products.

  • @dhaneshshirodkar365
    @dhaneshshirodkar365 3 ปีที่แล้ว +22

    This is more like a Data Science interview

  • @upskill9257
    @upskill9257 2 ปีที่แล้ว +11

    I've been looking for realistic data science/analytics case study discussions for some time. This has been by far the most relatable one for me. Please add more content like this.

  • @manoharlakshmana6171
    @manoharlakshmana6171 4 ปีที่แล้ว +28

    This is what we need !!! Case study mock interview !!! Thank you so much !!! Hope you will upload more such videos.

  • @saumya4582
    @saumya4582 3 ปีที่แล้ว +1

    Thanks for this and many other videos Jay! I was looking for ways to get to know about the interview questions asked to work upon and stumbled upon your channel. Super helpful as I had been rejected at my dream company's interview and was starting again to work on my gaps to be able to apply again for the interviews.

  • @manoharlakshmana6171
    @manoharlakshmana6171 4 ปีที่แล้ว +2

    Please Have a video on : Suppose an ecommerce website is planning for a flash sale. How would you select which products to put on the sale.you can add more questions to it and then followed by A/B testing what kind of statistical tests would be used and where.

  • @mallikatiwari252
    @mallikatiwari252 4 ปีที่แล้ว +12

    Super Helpful ! Can you please do similar video on Amazon Business Analyst?

  • @xiaoyangwu9270
    @xiaoyangwu9270 4 ปีที่แล้ว +3

    Thanks for all the advices on implementing ML/DS algorithms into data engineer/Data warehouse part of the task, I am always wondering how those two can be combined when comes to solving problems togather

  • @nitishnayak6834
    @nitishnayak6834 3 ปีที่แล้ว +13

    Can we use dimensions and weight of the mobile. Definitely similar products will have similar description and design

  • @jaganshemthegay5490
    @jaganshemthegay5490 4 ปีที่แล้ว +4

    Consider sone master data attributes and combination of these attribute values gives the uniqueness to derive .. color , size , iOS version , first available date etc and some other features and do deduplication and map all product names to one unique artificial key

  • @jaganshemthegay5490
    @jaganshemthegay5490 4 ปีที่แล้ว +1

    De duplication is great master data quality exercise that needs domain experts inputs and study of the features and attributes..
    Once de duplicated transactions can be tied down to unique product is which gives better analytics

  • @SoraAICreator
    @SoraAICreator 3 ปีที่แล้ว +3

    This is really helpful. Thanks a lot. Can you do a mock Product Analyst Interview as well?

  • @tigerbear3038
    @tigerbear3038 2 ปีที่แล้ว +1

    SKUs often differ from merchant to merchant. You have to look at the model number or UPC

  • @bibiworm
    @bibiworm 3 ปีที่แล้ว +3

    This is a very classic interview question for e commerce business. 👍 But this mock interview only scratched the surface of it. I have been asked the same question. I was grilled on similarity measures, cnn for image processing, rnn for nlp, etc.

    • @upskill9257
      @upskill9257 2 ปีที่แล้ว

      Can you share some additional parameters that were not covered here?

  • @EmWatsonable
    @EmWatsonable 4 ปีที่แล้ว +1

    Thanks for sharing! This is super helpful :)

  • @ambarishdeshmukh143
    @ambarishdeshmukh143 3 ปีที่แล้ว

    Thank you for the Video and the contents. Really appreciated.

  • @pushkarajpalnitkar1695
    @pushkarajpalnitkar1695 4 ปีที่แล้ว +2

    Hi, Jay and Shashank, thanks for the video. I have one doubt though. Did he have to give data structures and algorithms' interview for the job?

    • @iqjayfeng
      @iqjayfeng  4 ปีที่แล้ว +2

      Nope I don’t believe so

  • @alexz7032
    @alexz7032 3 ปีที่แล้ว

    Awesome stuff! thanks

  • @yashsolanki069
    @yashsolanki069 3 ปีที่แล้ว +1

    You are doing really great sir👍

  • @karthikjosyula8002
    @karthikjosyula8002 4 ปีที่แล้ว +2

    wouldn't equivalent classes solve the problem of de-duping the classes Apple iPhone 10 = iPhone X... We can store these classes in under same ID and use the ID. Even tracking seller details where same seller has same product, labelled differently, posted multiple times.

  • @porwalabhishek5
    @porwalabhishek5 4 ปีที่แล้ว +2

    This is so helpful

  • @kushagrasen1848
    @kushagrasen1848 3 ปีที่แล้ว

    we can also take the product dimensions

  • @kalpeshsajnani9952
    @kalpeshsajnani9952 3 ปีที่แล้ว

    Thanks a lot for such a informative video

  • @chitranjantrivedi6008
    @chitranjantrivedi6008 2 ปีที่แล้ว +2

    Bro - Do we need to know Data Structures and Algorithms to crack Business Intelligence Engineer interviews at Amazon?

  • @tonestrada2000
    @tonestrada2000 ปีที่แล้ว +1

    Do you have sample data for this? This seems a good case study for beginners like me

  • @mr.garammasala1398
    @mr.garammasala1398 3 ปีที่แล้ว +2

    Fuzzy look up/ fuzzy grouping works here.

  • @asthasrivastava9564
    @asthasrivastava9564 3 ปีที่แล้ว

    Quick question - Has anyone faced a question like this for the BIE role (may or may not be Amazon)?

  • @fionaw9942
    @fionaw9942 4 ปีที่แล้ว +6

    Does BIE need to have ML knowledge?

    • @iqjayfeng
      @iqjayfeng  4 ปีที่แล้ว +3

      Depends on the role but usually no

  • @mitchieyy
    @mitchieyy 2 ปีที่แล้ว

    if the product have the same unique id such as SKU UPC they definitely get under the same listing when you list them. as an amazon seller i know that sellers get the product from the same manufacturer and they use their own brand name. that’s why you see the same product over and over again on amazon but they will have their distinct brand name, like kirkland and huggies get their diapers from the same manufacturer. with the big companies they manufacture the product slightly differently and package them differently. but with small amazon seller they don’t do all that and their product photo looks the same. In this case i don’t think they could be identified as the same because they are different brand. So to identify true duplicate that have similar image most importantly you need to check their brand are the same and then their key product description such as number of unit weight color size version are the same should they be considered possible duplicates.

  • @FPrimeHD1618
    @FPrimeHD1618 2 ปีที่แล้ว +1

    This helps out a lot, much appreciated. I have an interview for a Sr BIE position next week. First time interviewing with Amazon and I'm freaking out lol.

    • @extcresources531
      @extcresources531 2 ปีที่แล้ว

      What was the result of your interviews? What all were asked? If you could share it will help a soul. Thanks mate

    • @FPrimeHD1618
      @FPrimeHD1618 2 ปีที่แล้ว

      @@extcresources531 I actually was offered a really good role as a data scientist in my current company that I couldn't pass up, so I stopped all interviews.

  • @julialjo7921
    @julialjo7921 2 ปีที่แล้ว +5

    “Nobody knew how to solve these tasks” and then Rajesh came all the way from India to save them.

  • @allison-hd1fg
    @allison-hd1fg 2 ปีที่แล้ว +1

    I think this case study is more for data scientist role. you sure this is a case for BIE?

    • @iqjayfeng
      @iqjayfeng  2 ปีที่แล้ว

      Definitely for BIE roles at Amazon specifically

  • @saniyajaswani4462
    @saniyajaswani4462 2 ปีที่แล้ว +1

    His solution is not scalable
    If we make such algorithm, how do we scale for 10000 products.
    Amazon does least manual intervention on duplication product detection

  • @vincentpun123
    @vincentpun123 2 ปีที่แล้ว

    nice

  • @toekneema
    @toekneema 3 ปีที่แล้ว

    what is a DFIDF? 7:40

    • @nanfengbb
      @nanfengbb 3 ปีที่แล้ว

      Should be TFIDF, a technique commonly used for NLP.

  • @annachan6706
    @annachan6706 3 ปีที่แล้ว +2

    why not where product name LIKE '%iphone%'

    • @extcresources531
      @extcresources531 2 ปีที่แล้ว

      Cos that would give you same brand but phones of different models.. iphone 6 iphone 7 iphone 8 etc..

  • @iqjayfeng
    @iqjayfeng  3 ปีที่แล้ว +5

    Thanks for watching! If you want to prepare for Business Intelligence roles, then Interview Query has you covered! We have diverse questions to prepare for your Business Intelligence interview and stay ahead of the curve.
    www.interviewquery.com/p/business-intelligence-interview-questions

  • @nhrakib1572
    @nhrakib1572 3 ปีที่แล้ว

    hi dear how are you

  • @BasitBhat
    @BasitBhat 3 ปีที่แล้ว +4

    Stop moving that chair. Just a quick feedback.👍🏼

  • @romeomonteque
    @romeomonteque 2 ปีที่แล้ว

    Um. Uh. Um. Uummmm. Um.. um. Before every sentence. Ummmm

  • @vallr-humans9121
    @vallr-humans9121 3 ปีที่แล้ว +1

    Can't even understand what he is speaking. Sitting ungroomed. 👎