How to Use Great Expectations for Data Quality Checks with Airflow

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 ต.ค. 2024
  • In this video I'll go through how you can connect to and use great expectations for data quality checks as a part of your Airflow pipeline!

ความคิดเห็น • 24

  • @joaovitoralmeidaaraujobelc6993
    @joaovitoralmeidaaraujobelc6993 4 หลายเดือนก่อน +1

    Very simple video with excellent explanation and not overcomplicating things. Thanks for sharing it!

  • @roopashastri9908
    @roopashastri9908 3 หลายเดือนก่อน +1

    Great explaination!Any thoughts on how we can save the great expectation results in the Database?

    • @thedataguygeorge
      @thedataguygeorge  2 หลายเดือนก่อน

      I would configure the expectation results storage location to be a bucket and then have a pipeline that takes the expectation results and stores them in a database

  • @roopashastri9908
    @roopashastri9908 3 หลายเดือนก่อน +1

    Also how can we automate the threshold changes with the changing business needs?

    • @thedataguygeorge
      @thedataguygeorge  2 หลายเดือนก่อน

      You'd want to have another helper pipeline that checks for changing business requirements and then either alerts you or makes adjustments

  • @roopashastri9908
    @roopashastri9908 3 หลายเดือนก่อน

    Also can we include more than one expectation in the expectation file?

  • @roopashastri9908
    @roopashastri9908 3 หลายเดือนก่อน +1

    On failure of great expectation validation, would this raise alerts?

    • @thedataguygeorge
      @thedataguygeorge  2 หลายเดือนก่อน

      Yes as long as you have Alerts configured for your Airflow DAG

  • @criistiina71
    @criistiina71 2 หลายเดือนก่อน

    May I know if, we can create our own expectations. If I have a expectations who is not in the script that is on the documentation Could I create my own one?For Example, if one column is created from a formula and used a diferent database Could I create a expectation of who makes the math?
    Hi, from Colombia :)

    • @thedataguygeorge
      @thedataguygeorge  2 หลายเดือนก่อน +1

      Definitely can create your own expectations, honestly one of the best features of great expectations!

    • @criistiina71
      @criistiina71 2 หลายเดือนก่อน

      @@thedataguygeorge Do you have a video-tutorial where you’re teaching how to connect GX with Databricks? 😊

  • @BubbaB2323
    @BubbaB2323 ปีที่แล้ว +1

    Very useful bud. Thank you.

    • @thedataguygeorge
      @thedataguygeorge  ปีที่แล้ว +1

      No problem, do it all for you!

    • @BubbaB2323
      @BubbaB2323 ปีที่แล้ว +1

      @@thedataguygeorge will reach out on the side to talk shop if that's cool, loving your work.

    • @thedataguygeorge
      @thedataguygeorge  ปีที่แล้ว

      Always cool!

  • @LucasGomes-q9t
    @LucasGomes-q9t 2 หลายเดือนก่อน

    On minute 3:57 how could create the default file of great_expectations? I created the json but I got a blank one.

    • @thedataguygeorge
      @thedataguygeorge  2 หลายเดือนก่อน

      You then just fill out that json with all the expectation info you want!

  • @karangupta_DE
    @karangupta_DE ปีที่แล้ว +1

    Hi, do you prefer soda or great expectations?

    • @thedataguygeorge
      @thedataguygeorge  ปีที่แล้ว +1

      I've only recently started using Soda so I'm not sure if I have enough experience to form a definitive opinion, but I have definitely enjoyed the UX much more so far, SCL is a lot more human readable than great expectation "expectations" imo

  • @maheshbhatm9998
    @maheshbhatm9998 9 หลายเดือนก่อน +1

    Thank You

    • @thedataguygeorge
      @thedataguygeorge  9 หลายเดือนก่อน

      No worries, let me know if there's any other videos you'd like to see!