Databricks, Delta Lake and You

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ก.ย. 2024
  • Databricks, Lakes & Parquet are a match made in heaven, but explode with extra power when using Delta Lake. This session will dive into the details of how Databricks Delta works and how to make the most of it.
    Speaker: Simon Whiteley SQLbits.com/sp...
    SQLbits.com/Ses...
    Tags: Optimising,Developing,Managing,Cloud,Databricks,Python,Spark,Data Lake,Big data analytics,Modern Analytics,delta lake

ความคิดเห็น • 29

  • @ericbegg8727
    @ericbegg8727 ปีที่แล้ว +6

    Simon - you couldn't be any better at explaining all these concepts. Thanks again

  • @nithints302
    @nithints302 ปีที่แล้ว +1

    Accidentally hopped on to your channel I can listen for the whole day

  • @Markttt5
    @Markttt5 2 ปีที่แล้ว +3

    Hey Simon, if I was still based in the UK, I’d be knocking on your door and handing you a beer. Fantastic video (again) - this is going to help me help my organisation so much. You are by far, the best speaker, most passionate dude about data that I watch on TH-cam. Many thanks.

  • @andycarter9845
    @andycarter9845 2 ปีที่แล้ว

    Deserves many more thousands of views. Fantastically clear.

  • @mdzakariabarbhuiya1608
    @mdzakariabarbhuiya1608 3 ปีที่แล้ว +2

    This is one of the best explanation on Delta Lake!!

  • @sunnysoni88
    @sunnysoni88 3 ปีที่แล้ว +1

    I have never seen such a clear video for Delta Lake, This is just great stuff. My understanding of Delta Lake is so well now, Thanks for sharing your knowledge

  • @chandraxg1
    @chandraxg1 ปีที่แล้ว +1

    Simon... thank you so much for an excellent video...

  • @adityajakka9856
    @adityajakka9856 2 ปีที่แล้ว +2

    Great job explaining the Delta Lake, Simon. I thought you did a fantastic job with your slides and working examples. That's exactly how I look to learn new data concepts. More power to you, mate :)

  • @RodrigoBocanegraCruz
    @RodrigoBocanegraCruz 2 ปีที่แล้ว

    Great video. Thanks Simon!

  • @tj_lee
    @tj_lee 2 ปีที่แล้ว +1

    Great content, help clarifies a lot on delta tables!

  • @nimesharya909
    @nimesharya909 2 ปีที่แล้ว +1

    awesome video, precise , clear and with easy to understand examples

  • @Boompiee
    @Boompiee 2 ปีที่แล้ว +1

    Great video as usual Simon, thank you very much!

  • @jeevanb8623
    @jeevanb8623 2 ปีที่แล้ว

    Beautifully Explained...

  • @manideepatalukdar9201
    @manideepatalukdar9201 2 ปีที่แล้ว

    Thanks you so much! This is such a clear explanation of Delta concepts!

  • @denermoreira15
    @denermoreira15 2 ปีที่แล้ว

    just amazing

  • @simonheath8701
    @simonheath8701 2 ปีที่แล้ว

    I'm new to DataBricks and found this as my first video when searching. What a Gem. Haven't bothered to watch any others as it was such a great journey. As someone who spent over 30 years using SQL and saw all the big data stuff from afar I was thinking they are basically using unindexed flat files with a 16 node server cluster... hmmn, that's not advancement. Seeing how they added SQL, journalling and transaction management - I wonder how long it will take them to add indexes and create a block structured database ;)

    • @SQLBits
      @SQLBits  2 ปีที่แล้ว

      Lovely to hear, thank you simon! I am sure the team at th-cam.com/channels/mRI-X6XoeH2dQE4BShRU9Q.html will love to hear this!

    • @mohammedsafiahmed1639
      @mohammedsafiahmed1639 ปีที่แล้ว

      hey simon, when you say block structured database, you mean in opposition to traditional rdbms like sql server which are page structured db, right?

  • @realblummusic
    @realblummusic 3 ปีที่แล้ว +1

    Quality stuff. Subscribed!

  • @murtazajabalpurwala8124
    @murtazajabalpurwala8124 2 ปีที่แล้ว

    Very nice video. One of the best videos for understanding the data lake related complex issues. One recommendation is sound audibility should be improved. Thanks again for the amazing video

    • @SQLBits
      @SQLBits  2 ปีที่แล้ว

      Thank you for sharing your opinion! All these sessions are recorded LIVE at SQLBits in front of a crowd, so we do apologize if the audio isn't of the best quality!

  • @siddhu1076
    @siddhu1076 2 ปีที่แล้ว

    Wonderful explanation 🙂👍

  • @Knigh7z
    @Knigh7z 2 ปีที่แล้ว

    The warehouse is also generally optimised for concurrent queries over many consumers which lake tools like Spark are not and is where Databricks SQL is closing the gap.

  • @kcbonzer
    @kcbonzer 2 ปีที่แล้ว +1

    Hello Simon, this is one of the most lucid videos I have come across. Thank you conveying the message in a very simple manner.
    I am curious to know what you take is on Snowflake vs Databricks !
    Ingestion, Storage, Architecture, Performance & Cost based comparison. A professional, unbiased & candid opinion, if you will :)

    • @SQLBits
      @SQLBits  2 ปีที่แล้ว +1

      Hey! He has a video out on our channel about 'Databricks VS Synapse Analytics' if that's something your interested in (We'll let him know about Snowflake!) th-cam.com/video/FjsnVueXijQ/w-d-xo.html

  • @RodrigoBocanegraCruz
    @RodrigoBocanegraCruz 2 ปีที่แล้ว

    Hi, is it delta suitable for tracking data changes overtime, like for examples every day? Or is it more suitable for tracking transformation changes in a given dataset? I have read is more about the second but want to check. Thanks!

    • @mohammedsafiahmed1639
      @mohammedsafiahmed1639 ปีที่แล้ว

      deltra records every single transaction that happens to a table in the delta log. Every time a transaction happens like an update, insert delete or merge, it gets recorded as a json in the delta log. And it gets a version number. Updates and deletes do not physically update and delete the files, but just update the transaction log. This gives you the ability to travel back per transaction basis. Its pretty cool.

  • @bobhaffner5902
    @bobhaffner5902 3 ปีที่แล้ว

    Hi Simon, great video! Hey, do you know if updating a delta table via Synapse Spark NB is supported?

    • @SQLBits
      @SQLBits  2 ปีที่แล้ว

      Hey Bob, Simon has his own YouTUbe Channel here if it has any content you are interested in! - th-cam.com/channels/mRI-X6XoeH2dQE4BShRU9Q.html