Intro To Data Orchestration With Airflow

แชร์
ฝัง

ความคิดเห็น • 8

  • @mahtabkhedri6052
    @mahtabkhedri6052 ปีที่แล้ว +1

    Very useful thanks

    • @Astronomer
      @Astronomer  ปีที่แล้ว

      Thanks for watching!

  • @jugsma6676
    @jugsma6676 2 ปีที่แล้ว

    If possible, Could you share with us the repo (github) link of Dag's

  • @dataecstasy
    @dataecstasy 10 หลายเดือนก่อน +1

    what would be the github integration look like for if companies want to keep the dag code on github

    • @Astronomer
      @Astronomer  9 หลายเดือนก่อน

      There's a few different approaches, check out this link that goes through various ways you can set up CICD to use github to store your DAG code! docs.astronomer.io/astro/ci-cd-templates/github-actions

  • @JavierBeneitoBarquero
    @JavierBeneitoBarquero 2 ปีที่แล้ว +2

    This is Marc Lamberti's webinar about sensors: th-cam.com/video/8J0h-Vlc_44/w-d-xo.html (you mentioned at 11:30)

  • @waterhill
    @waterhill 9 หลายเดือนก่อน

    Can Airflow be used to orchestrate a spark streaming YARN job that pulls data from Kafka and writes to HDFS?.. the idea is if the spark streaming job queues and it can be monitored/alerted/detected and restarted automatically by Airflow?

    • @Astronomer
      @Astronomer  9 หลายเดือนก่อน +1

      Oh definitely they can! Check out this link for the different options you have for managing Spark via Airflow, you'll probably want to use a Spark hook registry.astronomer.io/providers/apache-airflow-providers-apache-spark/versions/4.1.5