Movie Recommendation System With Python And Pandas: Data Project

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 ต.ค. 2024

ความคิดเห็น • 75

  • @vikasparuchuri
    @vikasparuchuri ปีที่แล้ว +7

    Here's all of the code for this video - github.com/dataquestio/project-walkthroughs/blob/master/movie_recs/movie_recommendations.ipynb . And you can download the dataset here - files.grouplens.org/datasets/movielens/ml-25m.zip . Enjoy :)

  • @abdulkareemridwan8762
    @abdulkareemridwan8762 2 ปีที่แล้ว +7

    Lost interest in ML earlier this year..your tutorial was really a turnaround..Really appreciate 🙏

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว

      That's amazing to hear, Abdulkareem! -Vik

    • @SoundofSilence1
      @SoundofSilence1 2 ปีที่แล้ว

      @@Dataquestio yes Vik is an amazing teacher.

  • @prityar042
    @prityar042 11 หลายเดือนก่อน +2

    This project was really amazing and I have to say this video is very underrated. I actually shared this with my batch mates they liked it too.

  • @shachisinghal8856
    @shachisinghal8856 2 หลายเดือนก่อน

    Finally, someone has used a different dataset. Thanku😊

  • @meditationhealingmusic6550
    @meditationhealingmusic6550 ปีที่แล้ว +1

    Thank you Much appreciate it so for walking us through this project . I am very excited to learn every single day Dataquest.

  • @johndiba1321
    @johndiba1321 2 ปีที่แล้ว +1

    attempting to apply this lesson and data scraping to create a book recommendation system for my portfolio. should be able to get a nice dataset from goodreads

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว

      Hi John - I made a video about book recommendations earlier that might help - th-cam.com/video/x-alwfgQ-cY/w-d-xo.html .

  • @yapwlm913
    @yapwlm913 ปีที่แล้ว +3

    Hi Vik, that is a great demonstration of building a recommendation system. Thank You! But it might be more interesting if the constructed recommendation system is applied to Streamlit as the framework can be more solid I think?

  • @charlesvictory169
    @charlesvictory169 10 หลายเดือนก่อน

    You are too good!!! This was very helpful. I had to subscribe immediately. Thanks so much

  • @staniherstaniher9300
    @staniherstaniher9300 2 ปีที่แล้ว +3

    Nice video. Please, can you make a video where you evaluate this model using metrics such as ndcg@, diversity, accuracy...?

  • @hanweiz84
    @hanweiz84 2 ปีที่แล้ว +2

    Appreciate if you can also show how to host this on a web server. Thanks a lot! This is awesome

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว +2

      Thanks, Ang! I'll look into doing this for a future video. -Vik

  • @narsimharao8565
    @narsimharao8565 2 ปีที่แล้ว

    Feel in love with the tutorial ❤️.

  • @vanesszatoke2977
    @vanesszatoke2977 2 ปีที่แล้ว +2

    Hi! Very good demonstration of building a recommendation system. The best which I have found!
    I have a question. Is this user-based or item-based collaborative recommendation type what you are doing? Because other videos I checked, there created a kind of user-item matrix and checked the correlation between users or movies according to the type of rec systems (user- or item-based).
    If I should make my bet I would say it is more like an item-based one, but I am not sure! 😄
    Thank you if you are answering!

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว +1

      Hi Vanessa - I would consider this to be a version of user-based collaborative filtering. We have to make some modifications, since we're only passing in a single movie (versus a matrix of our preferences). We're then finding movies that people similar to us liked more than the general population liked.

  • @narsimharao8565
    @narsimharao8565 2 ปีที่แล้ว +1

    Also we can make recommendations using knn clusters, so those who like action movie, may get action movie recommended. But we have to do more analysis why users given most rating for them. Am I correct Vik.., please correct me. Just doubt

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว

      Hi - you could use knn to segment users, then base predictions on the clusters. You could also use k-nearest neighbors. You probably won't get much over collaborative filtering (the technique here), since they both use similar techniques to find similar users.

  • @ekeminiben6885
    @ekeminiben6885 5 หลายเดือนก่อน

    Thank you very much sir for this inspiring tutorial.
    Please I want to build a recommender system, "The aim of this study is to design and implement a Recommender System for clothing styles based on user body type derived from user body measurements."
    Please can you help with this kind of project or how can I go about it from getting the dataset to completion.
    Thank you

  • @gandiyasasri
    @gandiyasasri 4 หลายเดือนก่อน

    It is very good and most useful in our daily life

  • @rajeevmenon1975
    @rajeevmenon1975 2 ปีที่แล้ว

    Real interesting video Vikas. Really engrossing. Keep coming up with such quality stuff.

  • @swapnilchowdhury3957
    @swapnilchowdhury3957 ปีที่แล้ว +1

    I have written the code in vscode Jupyter notebook. I am facing a problem because the input and output widgets are not displaying. No text box is appearing. How to solve this issue

  • @yadavvishu1869
    @yadavvishu1869 ปีที่แล้ว +1

    I am running the same code in vs code but it only showing two value we put in code input not giving any button or search bar out😢 how to tackle it

  • @ayeshaabbas8696
    @ayeshaabbas8696 ปีที่แล้ว

    Thank You So Much Sir. lots of respect ..

  • @Han-ve8uh
    @Han-ve8uh 2 ปีที่แล้ว

    1. Why at 12:43 "most similar result is last in list"? According to np.argpartition docs, "The order all elements in the partitions is undefined". You only provided -5 so we are only certain the -5th position is correct but can make no conclusions on -4 to -1. If a sequence of ints was passed to kth parameter instead, then we can be sure of the order of last 5.
    2. For this demo we always start with a single movie_id, then do the calculations, feels like some work will be repeated if we change the input movie_id and all of the work is done at inference time. Are there opportunities for caching or precomputing anywhere?
    3. Why is a .unique() added at 20:40? That line was focused on movie_id = 1. I assume for each movie, each user will only rate it once? This means given movie_id = 1, all users will already be unique so no need for unique(). If each movie can be rated by a user twice (either with same or different scores), that feels like bad DB design of appending instead of updating. If indeed there were multiple ratings from same user for same movie, we should deduplicate (eg. take latest in time rating) during data preprocessing before any sort of recommendation analysis to prevent hacky fixes like adding .unique() to work around bad data. What do you think?

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว

      Hi Han, great questions.
      1. That's a good point about argpartition. In practice, the results appear to be ordered, so I didn't worry too much about it. For example, exact matches are always the top result. As you mentioned, you could pass in a sequence to get 100% guaranteed ordering.
      2. There's a tradeoff between simplicity of the solution (making it easier to teach/demo) and speed of the solution. We're precomputing the tf/idf matrix and other items that are common across all movies. If I was deploying this to a web service, instead of precomputing, I would just cache outputs. So the first generation would be slow, but subsequent searches would hit the cache. This is because precomputing for all of the movies would take a while, and if we wanted to update our algorithm, etc, we'd need to redo all the precomputation.
      3. In this case, I added .unique() as a defensive check. I don't think it was necessary, since the data should be unique on movieId/userId pairs. But yes, if we were doing this in production, we would want to deduplicate upfront instead of on the fly for better performance.

  • @ayunymoba5974
    @ayunymoba5974 10 หลายเดือนก่อน +1

    is this count as hybrid recommendation? cuz there is tf idf cosine simillarity (content based) and also item based filtering (collaborative based)?

    • @domakondajyothi33
      @domakondajyothi33 7 หลายเดือนก่อน

      ya even i want to know this>>>is this hybrid or colab??

  • @shreyam3259
    @shreyam3259 ปีที่แล้ว

    Hello, I am learning programming Python by myself. I was wondering if you could tell me, in brief, the overall workflow of this project so it would be easier to understand. (Maybe 5-6 points to summarize). And why we choose this particular method.

  • @thehiddenguy655
    @thehiddenguy655 ปีที่แล้ว

    Thank You sir this helps me a lot

  • @soumyaranjith2951
    @soumyaranjith2951 ปีที่แล้ว

    Thank you so much Sir😍🙏🙏🙏

  • @kaiserkonok
    @kaiserkonok ปีที่แล้ว

    Loved this video🔥

  • @doopao
    @doopao ปีที่แล้ว

    Vik u r the very best!

  • @mr.random4960
    @mr.random4960 7 หลายเดือนก่อน

    Which method is used here? Collaborative or content based?

  • @domakondajyothi33
    @domakondajyothi33 7 หลายเดือนก่อน

    this really helped me ...but at the end i got an error as 'list' object has no attribute 'indexing'...so what can i do

  • @johhnykimsey5180
    @johhnykimsey5180 2 ปีที่แล้ว

    thank you very much it was a great video

  • @nil-xo4ce
    @nil-xo4ce 2 ปีที่แล้ว +1

    sick video 🔥

  • @tanishqshivram9419
    @tanishqshivram9419 ปีที่แล้ว

    sir i am not getting any output neither am I getting any error . Can you pls help me out?

  • @khanhtruongphamngoc2246
    @khanhtruongphamngoc2246 4 หลายเดือนก่อน

    how to evaluate the accuracy of the model sir?

  • @alirezanorouzi8924
    @alirezanorouzi8924 ปีที่แล้ว

    thanks for sharing , i use it

  • @quizzesya
    @quizzesya ปีที่แล้ว

    The type of this recommendation system is content based filtering right?

  • @nanaphiona4462
    @nanaphiona4462 2 ปีที่แล้ว

    Thanks for the inspirations

  • @tekinbayrakl7886
    @tekinbayrakl7886 ปีที่แล้ว

    hi. when we create recommedation score, you said we want big difference between similar and all. Why is that?

  • @aishwaryakolte538
    @aishwaryakolte538 11 หลายเดือนก่อน

    In building search box, my recommendations are not changing. It shows same recommendations as for Toy Story. I made the after changes of removing the fixed movie title, yet there was no change in the recommendations.
    Could you please help for the same?

  • @soumyaranjith2951
    @soumyaranjith2951 ปีที่แล้ว

    I can't import the data into jupiter notebook.When I try to import it, there is an error occuring.

  • @LazyLee295
    @LazyLee295 ปีที่แล้ว

    Hi, this is the recommender item based on users' rating but can you do the recommender user based on item they like? for exemple if a user have a list anime they like then we recommend based on that list. Thank you for reading my comment.

  • @k-popworldwide3282
    @k-popworldwide3282 ปีที่แล้ว

    Can someone please explain the part in this built system where the data preprocessing, Train Test Split, Model Fit, and Model Tune have been done?

  • @Sparkss22
    @Sparkss22 ปีที่แล้ว

    this is popularity based or content based???

  • @Rosh__138
    @Rosh__138 9 หลายเดือนก่อน

    Which algorithms are used in this video for building model?
    Anyone can tell!!??

  • @chinmoypadhi
    @chinmoypadhi 2 ปีที่แล้ว

    how can we add k-fold cross validation technique to this collaborative filtering model?Any example will be great

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว

      You would need to define an error metric, and then label data. Then you could evaluate against the metric. There's an example here with measuring the ranks of NBA MVPs - th-cam.com/video/3cn1nHlbFVw/w-d-xo.html .

  • @Zuthilios
    @Zuthilios ปีที่แล้ว

    What environment are you building this in? I was following this tutorial but in gitpod for me, the jupyter widgets aren't behaving. The Text and TextArea widgets don't ever appear, a FloatText widget will sometimes appear, IntSlider and Select widgets will often appear and sometimes none of them will appear. This changes randomly even when making no changes. The output space is always there and working correctly, but I can't seem to find any cause or solution to this issue, it's thrown me off continuing this tutorial.
    Perhaps it's a version issue, I'll try following your code for the versions you used as a last option.

    • @Dataquestio
      @Dataquestio  ปีที่แล้ว

      That's strange - I used JupyterLab on my own computer.

  • @dayaramd2709
    @dayaramd2709 2 ปีที่แล้ว

    very very good job

  • @rahouanitoufik7375
    @rahouanitoufik7375 ปีที่แล้ว

    How to build this recommendations system drug in Java

  • @talhajalil8674
    @talhajalil8674 ปีที่แล้ว

    I have written the same exact code and when I use "Men 1995" as title to look for similarity I get probability of zero for entire array. Why?

  • @sauravchauhan9280
    @sauravchauhan9280 2 ปีที่แล้ว

    i am building a web app for this but cant figure out what model to save

  • @anushkab8867
    @anushkab8867 ปีที่แล้ว

    how can i make recommendation system based on genres??

  • @AmIThereYet.
    @AmIThereYet. ปีที่แล้ว

    What algorithm is used here?

  • @hiashraful
    @hiashraful ปีที่แล้ว

    How can I build this project in vscode?

  • @RajRaj-b5r
    @RajRaj-b5r ปีที่แล้ว

    sir i want synopsis of this project asap

  • @maglionejm
    @maglionejm 2 ปีที่แล้ว

    It would be very interesting to build a web application with Flask for the search engine. Could you show that in your next video?
    Also, it would be awesome to make an API with the generated model... What do you think? Using pickle perhaps?

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว +1

      Hi Juan - I'll take a look at doing this as a part 2 video. You could make an API for sure - I would look into this - www.django-rest-framework.org/ .

    • @SoundofSilence1
      @SoundofSilence1 2 ปีที่แล้ว

      @@Dataquestio it would be really great if you could show us how to build a web app with the same.

    • @devzaks8912
      @devzaks8912 ปีที่แล้ว

      My own concern is how to make an API with the model. Then we can make requests to it with a movie and get recommendations.
      Please anyone that has done this should let me know 😔

  • @ManojYadav-ut7ew
    @ManojYadav-ut7ew ปีที่แล้ว

    which model is used to create this?

  • @sanika8866
    @sanika8866 ปีที่แล้ว

    how to import dataset as csv??

  • @Shankara018
    @Shankara018 2 ปีที่แล้ว

    can i add this project in my portfolio?

    • @Dataquestio
      @Dataquestio  2 ปีที่แล้ว

      You definitely can. I'd recommend following some of the next steps and building the project out a little more on your own, though.

  • @abex8713
    @abex8713 8 หลายเดือนก่อน

    Does it have UI

  • @asishkottakota3920
    @asishkottakota3920 2 ปีที่แล้ว

    @Dataquest i am unable to get the widget

  • @vijayarun_8579
    @vijayarun_8579 8 หลายเดือนก่อน

    what does the similar_user_recs.index gives ? anyone please text if know