Delta Live Tables Demo: Modern software engineering for ETL processing

แชร์
ฝัง
  • เผยแพร่เมื่อ 1 มิ.ย. 2024
  • Get started for free: dbricks.co/try
    View the other demos on the Databricks Demo Hub: dbricks.co/demohub
    Watch this demo to learn how to use Databricks Delta Live Tables to build a declarative ETL pipeline for batch and streaming data with SQL.
    Delta Live Tables (DLT) is the first ETL framework that uses a simple declarative approach to building reliable data pipelines and automatically manages your infrastructure at scale so data analysts and engineers can spend less time on tooling and focus on getting value from data.
    Learn more at databricks.com/product/delta-...
    Get the Delta Lake: Up & Running by O’Reilly ebook preview to learn the basics of Delta Lake, the open storage format at the heart of the lakehouse architecture. Download the ebook: dbricks.co/3IEjl5c
    Connect with us:
    Website: databricks.com
    Facebook: / databricksinc
    Twitter: / databricks
    LinkedIn: / databricks
    Instagram: / databricksinc
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 17

  • @amansehgal9917
    @amansehgal9917 2 ปีที่แล้ว +10

    This is great. Can you share notebook for querying the transaction log and presenting it in reDash?

  • @marcinsiara
    @marcinsiara ปีที่แล้ว +2

    Great video, easy to understand, good overview for the Databricks beginners. Thanks!

  • @gekodragon100
    @gekodragon100 ปีที่แล้ว

    Good video thank you. Quick question, is the DLT lineage also auotmatically avaialable and visisble in Unity Catalog?

  • @jolettin6408
    @jolettin6408 ปีที่แล้ว +1

    Do you have a link to show how the queries work for monitoring the data quality. Thanks

  • @simonhu5814
    @simonhu5814 4 หลายเดือนก่อน

    Well explained. Thank you

  • @joegenshlea6827
    @joegenshlea6827 9 หลายเดือนก่อน

    Thank you for this video. I'm a little confused about what the "data.stations' refers to? Is it an array in the source json?

  • @willf7493
    @willf7493 ปีที่แล้ว +1

    Nice demo, but why does the comment for the "cleaned_station_status" table say "partitioned by station_id" when the code actually uses the last_updated_date column? You should update the comment in that notebook. :-)

  • @tanushreenagar3116
    @tanushreenagar3116 10 หลายเดือนก่อน

    GREAT VIDEO

  • @nontapatsumalnop4740
    @nontapatsumalnop4740 ปีที่แล้ว +1

    Anyone knows how to create a live dashboard like this in databricks?

  • @sid0000009
    @sid0000009 ปีที่แล้ว

    Can an API hosted on an App service in anyway fetch Delta live tables data ? thanks

  • @ibozhu
    @ibozhu 2 ปีที่แล้ว +2

    It’s been in gated preview for too long, when will it be made GA?

  • @mohammedsafiahmed1639
    @mohammedsafiahmed1639 ปีที่แล้ว +2

    am I missing something or does the video really doesnt show how he got all those files in the data lake in the first place?

    • @samgreene7961
      @samgreene7961 5 หลายเดือนก่อน

      He mentions the python scripts/notebooks that get the data. Probably using an api and saving results to DBFS. I’m sure you can find how to do that in other videos.

    • @azazmir9340
      @azazmir9340 หลายเดือนก่อน

      Hes using autoloader to load the data probably from an s3 bucket, Azure cloud storage or volume