What They Don't Tell You About Apache Airflow

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 พ.ย. 2024

ความคิดเห็น • 13

  • @demohub
    @demohub ปีที่แล้ว

    This video has definitely given a better understanding of Airflow and now have some ideas on how to use it more effectively for projects.

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      What a wonderful thing to hear! Happy to help

  • @88andreaw
    @88andreaw ปีที่แล้ว

    Thanks for the video! I have not used Airflow for some years and it is great to see the evolution :)

  • @JeanBzh
    @JeanBzh ปีที่แล้ว +3

    "Airflow is hard" seems strange to me as a misconception :)
    I found it very easy when I started with Airflow a few years ago, and it was one of my selling points to implement it in my company at the time

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      Agree, but some people think it is 🥹

    • @aborecki
      @aborecki ปีที่แล้ว

      It is hard for people used to GUI based orchestration tools - like Informatica Power Center , SSIS etc. The truth is that a lot of developers/analysts involved in data processing are not proficient in Python and very often in programming as well. In Airflow you have to like to code and and have proper mindest to appreciate it.

  • @TheAgentOfDeath
    @TheAgentOfDeath ปีที่แล้ว +1

    Just to clarify, so the modern way to write dags is with [@dags][@task] decorator. And the old way is [with DAG() as dag:] and [task_a]. I,ve taken your course. A great as it is I still get confused because there seems to be multiple ways to write dags and tasks.

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      Hi Kevin,
      Correct, the taskflow api with the decorators gives a much easier way to author your DAGs. However, you can still use the with dag syntax. It’s a matter of preference 😉

  • @mpianoplayer3
    @mpianoplayer3 11 หลายเดือนก่อน

    We can't use task api for bash or sql query execute operators. We are forced to use xcom in those cases.

  • @as978
    @as978 ปีที่แล้ว

    Is datasets time aware? If I load a table in Redshift on 2023-01-01 and 2023-01-02, will the downstream tasks that depend on these datasets start on those given days? Will the downstream run on 2023-01-02 wait for the 2023-01-02 dataset or it will run if the upstream run only ran for 2023-01-01.

  • @thelifehackerpro9943
    @thelifehackerpro9943 ปีที่แล้ว

    Limitations of airflow

  • @EthanShalev
    @EthanShalev ปีที่แล้ว

    This was a "what's new in airflow" but worded as misconceptions.
    Got me to watch through and learn nothing new, sadly.
    Well produced, all in all, but a little misleading.

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว +1

      I’m actually happy that you didn’t learn anything new. That means you are fully aware of what Airflow is today and I’m glad to know that 🙏 if there is any topic you would like to see, please let me know