Are SQL joins bad for performance?

แชร์
ฝัง

ความคิดเห็น • 37

  • @alifawzi8197
    @alifawzi8197 ปีที่แล้ว +2

    for the last two day's i was watching your content,
    i genuinely appreciate the time and the effort that you put into this vidoes, your content is amazing and it's well explained,
    Thank you so much for sharing such a content.

    • @DatabaseStar
      @DatabaseStar  11 หลายเดือนก่อน +1

      Thanks for the comment! I'm glad you like the videos!

  • @sdmagic
    @sdmagic 2 หลายเดือนก่อน +1

    Very well done, sir. I do get asked about the effort made to normalize a database. The insight offered by your very clear explanation will go a long way in helping to answer those queries.
    Your videos and website are a great resource for the database developer community.

    • @DatabaseStar
      @DatabaseStar  2 หลายเดือนก่อน +1

      Thanks for the kind words! I'm glad you like the video and my channel.

  • @dav.R7
    @dav.R7 2 หลายเดือนก่อน +1

    I've always had this question: Does the number of joins affect performance or not? This video answered all my questions.

    • @DatabaseStar
      @DatabaseStar  2 หลายเดือนก่อน +1

      I’m glad it was helpful!

  • @NotMeEitherOfficial
    @NotMeEitherOfficial 5 หลายเดือนก่อน

    In your case separated tables win againts the single table with no indexes, in my case separated tables make more cost since each table has about 40 columns, but the single table on provide the columns that needed for the user, lets said 40x4 is 160 columns but in single table we approximitely only combine 20 columns each. I will try to implement this indexes with my company databases, as we seems need this indexes to be implemented. Working on old databases that been laying around for decades with MyISAM engine with millions of rows and try to make the performance faster as it getting slower every single month. Thanks for the video its really helpful. I will also considering to ask management to migrate to other database engine or even other DBMS like PostgreSQL, using Cache is kinda eating too much memory considering our company budget that run all the apps in one server.

    • @DatabaseStar
      @DatabaseStar  5 หลายเดือนก่อน +1

      Good point, it also depends on how many columns you need to return. Just because the separate tables have 40 columns, doesn't mean you necessarily need to select all 40 columns. But if you need all these columns, then a single table may make more sense like you are using.

  • @navjotsingh2457
    @navjotsingh2457 20 วันที่ผ่านมา

    Ty

    • @DatabaseStar
      @DatabaseStar  20 วันที่ผ่านมา

      You're welcome

  • @higiniofuentes2551
    @higiniofuentes2551 10 หลายเดือนก่อน

    Thank you for this very useful video!

    • @DatabaseStar
      @DatabaseStar  10 หลายเดือนก่อน +1

      Glad it was helpful!

  • @TheCodeConnoisseur
    @TheCodeConnoisseur ปีที่แล้ว +1

    Excellent

  • @Sdirimohamedsalah
    @Sdirimohamedsalah 4 หลายเดือนก่อน

    Thank very much for this constructive demo.
    I have question: when you indexed the columns, it reduced the total cost. But not the total execution time. Why you prefer reducing the total cost over the execution time which is crucial to for an applications in production ?

    • @DatabaseStar
      @DatabaseStar  4 หลายเดือนก่อน

      Thanks! Good question. I believe it's because the data set was so small and the execution time was small that it didn't really impact the time. On a larger data set you may see a bigger difference in execution time.

    • @Sdirimohamedsalah
      @Sdirimohamedsalah 4 หลายเดือนก่อน

      @@DatabaseStarthank you for your response.

  • @ftet1
    @ftet1 2 หลายเดือนก่อน

    Hi, can you offer us the DDL scripts that you used to set up your example database? Then we can recreate it directly. That would be great. Thanks and regards \sdohn

    • @DatabaseStar
      @DatabaseStar  2 หลายเดือนก่อน

      Good idea! The sample database (olympics) is available on GitHub here: github.com/bbrumm/databasestar/tree/main/sample_databases/sample_db_olympics
      The script for the queries used in this video is now on GitHub as well: github.com/bbrumm/databasestar/tree/main/videos/100_joins

  • @u12blue
    @u12blue 26 วันที่ผ่านมา

    Are all these concepts applicable on cloud services? Wouldnt there be a difference?

    • @DatabaseStar
      @DatabaseStar  26 วันที่ผ่านมา

      They should apply regardless of where your database is hosted.

  • @CanRau
    @CanRau 10 หลายเดือนก่อน

    Very insightful thanks a lot ❤ what does noc stand for?

    • @DatabaseStar
      @DatabaseStar  10 หลายเดือนก่อน

      Thanks! NOC stands for National Olympic Committee.

  • @milenfrom
    @milenfrom ปีที่แล้ว +1

    Hi,
    Thank you for the great insight.
    What is the software you use to run the queries and the explain feature?

    • @DatabaseStar
      @DatabaseStar  ปีที่แล้ว +2

      Thanks! I'm using a tool called pgAdmin, which is a common SQL editor for Postgres databases.

  • @tanzimibthesam5861
    @tanzimibthesam5861 10 หลายเดือนก่อน

    Do you think joins can have impact on scalability? Thanks

    • @DatabaseStar
      @DatabaseStar  10 หลายเดือนก่อน

      No I don't think so. However, once your database gets pretty large, you'll be looking into all kinds of techniques to improve performance, and one of them may involve caching or creating summary tables which means fewer joins - but it has tradeoffs.

  • @maxyudin
    @maxyudin ปีที่แล้ว

    👍

  • @CheatingStoriesWithChinwe
    @CheatingStoriesWithChinwe ปีที่แล้ว

    🤔

  • @romanbunshaft8412
    @romanbunshaft8412 7 หลายเดือนก่อน

    now try ordering by something )

    • @DatabaseStar
      @DatabaseStar  7 หลายเดือนก่อน

      We can add an Order By but the point still stands.

    • @romanbunshaft8412
      @romanbunshaft8412 7 หลายเดือนก่อน

      its VERY depends on a query itself. joins are not always faster and better. @@DatabaseStar

  • @luco-games
    @luco-games 9 หลายเดือนก่อน

    This is completely wrong.
    0) Operating in "less than a second" units for 100-200k tables is like saying this car was not expensive, it's was less than a million dollars. You need to show execution times millis from the query plan (EXPLAIN ANALYZE).
    1) You execution time seemed to be 5 times faster for the denormalized query.
    2) Joining will ALWAYS be slower than one tabel if you join big tables (sorry but in 2023 few hundred k rows is nothing, even on a local machine).
    3) You didn't explain how actually JOIN works behind the scenes but I get it because it would ruin the whole video.

    • @DatabaseStar
      @DatabaseStar  9 หลายเดือนก่อน +3

      Thanks for the feedback. I wouldn't say it's "completely wrong" because the video demonstrates the concept step-by-step and shows numbers.
      I can create another video that demostrates this with larger tables, as I think it would be more beneficial.
      0) That's a good point, which is why I didn't refer to the time taken when talking about the query, I referred to the cost from the execution plan. For larger tables & longer queries I would have also used the time taken.
      1) I don't think it was 5 times faster, the cost comparison (after indexes) was 822 vs 1,239, so it's a little faster.
      2) I don't think joining will always be slower than one table if working with big tables. That's the point of this video - joining is not always slower if you have a normalised design and indexes, BUT it depends on the query.
      3) I don't need to explain how join works behind the scenes for this video to be useful.

    • @NotMeEitherOfficial
      @NotMeEitherOfficial 5 หลายเดือนก่อน

      0) they does point that with "cost", as you typed "few k rows is nothing even on local machine", so the execution time will be really thin and can't be use for the actually benchmark since probably other apps consume the gap, as also now days computer has something called multi threading processor.
      1) it does, i guessed the first answer could answer the second point too.
      2) "always" is not the perfect fit, cause in some cases its actually work as he also provided the cost, unindexed tables is one of the problems.
      3) he explain with the flow pgadmin provided, well when people "trying" to explain about something and then it not satisfy you, you will just criticize them?