What They Don't Tell You About Apache Airflow

Airflow DAG dependencies: The Datasets, TriggerDAGRunOperator, and ExternalTaskSensor

Cheapest Fancy Furniture Factory | Home Furniture | Best Unique and Space Saving FurnIture

ผมโดนหมอนี่ตุ๋ยผมเลยให้มันชดใช้ แตกแน่.. | Minecraft #minecraft #มายคราฟ #fyp #minecraftmemes #ตลก

สาวโง่ถูกบังคับแต่งงานกับคนแปลกหน้าพิการแทนน้องสาว แต่เธอไม่คาดคิดว่าเขาเป็น CEO ที่ปกปิดตัวตน

PASULOL รามเกียรติ์ ตอนที่ 4 นนทกพบรัก

The New Way of Scheduling DAGs in Airflow with Datasets

Data with Marc

มุมมอง 16 380

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 14 พ.ย. 2024

ความคิดเห็น • 32

@trench6118 2 ปีที่แล้ว ⁺⁴
Airflow has been on fire lately - I love TaskFlow API and dynamic task mapping. Data aware scheduling came out at a perfect time and simplified a real problem for me
@MarcLamberti 2 ปีที่แล้ว ⁺¹
Other great features are coming. Stay tuned ;)
@ЕвгенийПрочан ปีที่แล้ว ⁺¹
This is awesome. No more such ugly Triggers, Sensors and etc. Thx for explanation Marc!
@MarcLamberti ปีที่แล้ว ⁺¹
you're welcome :)
@richie.edwards 2 ปีที่แล้ว ⁺²
I started working more with Airflow at my job and your videos have been very helpful when I want to switch up learning format and not look through docs to get exposed to concept.
@MarcLamberti 2 ปีที่แล้ว ⁺¹
Thank you 🙏
@RobsonLanaNarvy ปีที่แล้ว ⁺¹
Nice demonstration, I will test a MySQL as dataset to explore this feature
@brunosompreee 11 หลายเดือนก่อน ⁺¹
Great content as always Marc!
@practicalgcp2780 ปีที่แล้ว ⁺²
Amazing video Marc! This is a truly amazing feature! Although, one thing I couldn't seem to find is a way to pass some parameters to the consumer DAG. Is there a way to access the context of what triggered the DAG? Or the extra params can be passed in the Dataset? This could be useful metadata such as the latest timestamp of some data been updated which can be useful to the downstream processes when triggered. Thank you!
@davideairaghi6763 ปีที่แล้ว ⁺¹
Hi Marc, datasets looks like to be very useful but how they can be used to trigger a dag based on a SQL database update? Is there any example of it? Thanks in advance
@minnieshi2934 ปีที่แล้ว ⁺¹
Same thing, great /not direct comment about:
If the producer DAG ‘s task had defined the outlet, but does not really access the file/folder. Or has nothing to do with the content of the URI in the task logic, what would happen? the consumer DAG still runs.
So it is really just using the URI as a link between the two DAGs.
@VallabhGhodkeB 11 หลายเดือนก่อน
Yeah exactly it is just the URI that acts as a bridge. It does not actually point to anything
@rohithspal 2 ปีที่แล้ว ⁺³
A very utilitarian feature!..
Isn't "task aware scheduling" a more appropriate name for this feature? Since there is no real interaction with data .
@MarcLamberti 2 ปีที่แล้ว ⁺²
I think there will be real interaction with data at some point 😉
@ady3949 ปีที่แล้ว
Hi Marc, This is indeed amazing feature.
I try to use dataset scheduling, but when job finished/or failed, it is didn't trigger my on_success_callback/my_failure_callback. While using normal scheduling (ex: @hourly, etc), it trigger my on_success_callback/my_failure_callback. Is there any config that I missed? or is it a bug?
@tiankun4450 ปีที่แล้ว ⁺¹
can i use template var (like ds_nodash) in Dataset uri ?
@lifeofindians1695 2 ปีที่แล้ว
Hi Marc,
I am watching your airflow architecture video
In single node Executor update the metastore
In multi architecture executor put data in queue
So who will update the metastore in multinode after job is done
Queue or executor
@askmuhsin 2 ปีที่แล้ว ⁺¹
Hi Marc, This is indeed a truly amazing feature.
Just wondering if there is always going to be an instance of consumer DAG triggered for every file (URI) change.
ie.. in case the consumer DAG is running while the producer DAG has created a new file change, will the new change cause a new coumser DAG instance to run on the new data (ie.. while the previous conumser instance is still running). if that makes sense.
as always thank you for the content.
@MarcLamberti 2 ปีที่แล้ว
yes
@alfahatasi 6 หลายเดือนก่อน
How to extract table from postgre database instead of txt file as dataset. Is there an example video for this?
@RalfredoSauce ปีที่แล้ว
how do we trigger it off an SQL table update rather than a file? He mentions its possible but I can't seem to find documentation for it anywhere
@Jeoffrey54 2 ปีที่แล้ว
Amazing 👀
@minnieshi2934 ปีที่แล้ว
Very good to comment that external system updates the dataset file will NOT make the consumer DAG to run.
@MarcLamberti ปีที่แล้ว
Not yet. But it will be possible very soon
@Empusas1 ปีที่แล้ว
You mentioned that the consumer dag triggered by the dataset is alway run when the producer dag did run successfully, not when the dataset has changed. Let`s say the producer has a compare task, and only changes the dataset if necessary. In that case the consumer would always run anyway. Any way to solve that?
@Alex-4ch หลายเดือนก่อน
What if pass extras for the consumer with the flag to act or no
@bettatheexplorer1480 2 ปีที่แล้ว
is this only available on airflow >= 2.4 ?
@MarcLamberti 2 ปีที่แล้ว
Yes
@MarcLamberti 2 ปีที่แล้ว
Yes
@bettatheexplorer1480 2 ปีที่แล้ว
Cloud composer doesn’t have airflow 2.4 yet 😞
@imosolar 2 ปีที่แล้ว
please update the udemy with dataset
@MarcLamberti 2 ปีที่แล้ว ⁺¹
coming

ต่อไป

เล่นอัตโนมัติ

What They Don't Tell You About Apache Airflow

What They Don't Tell You About Apache Airflow

Airflow DAG dependencies: The Datasets, TriggerDAGRunOperator, and ExternalTaskSensor

Airflow DAG dependencies: The Datasets, TriggerDAGRunOperator, and ExternalTaskSensor

Cheapest Fancy Furniture Factory | Home Furniture | Best Unique and Space Saving FurnIture

Cheapest Fancy Furniture Factory | Home Furniture | Best Unique and Space Saving FurnIture

ผมโดนหมอนี่ตุ๋ยผมเลยให้มันชดใช้ แตกแน่.. | Minecraft #minecraft #มายคราฟ #fyp #minecraftmemes #ตลก

ผมโดนหมอนี่ตุ๋ยผมเลยให้มันชดใช้ แตกแน่.. | Minecraft #minecraft #มายคราฟ #fyp #minecraftmemes #ตลก

สาวโง่ถูกบังคับแต่งงานกับคนแปลกหน้าพิการแทนน้องสาว แต่เธอไม่คาดคิดว่าเขาเป็น CEO ที่ปกปิดตัวตน

สาวโง่ถูกบังคับแต่งงานกับคนแปลกหน้าพิการแทนน้องสาว แต่เธอไม่คาดคิดว่าเขาเป็น CEO ที่ปกปิดตัวตน

PASULOL รามเกียรติ์ ตอนที่ 4 นนทกพบรัก

PASULOL รามเกียรติ์ ตอนที่ 4 นนทกพบรัก

คุยแซ่บShow : “เจี๊ยบ - ปูไข่” ไม่มีปัญหาเครื่องดื่มงานแต่งเพราะเพื่อนหิ้วมาเอง!!!

คุยแซ่บShow : “เจี๊ยบ - ปูไข่” ไม่มีปัญหาเครื่องดื่มงานแต่งเพราะเพื่อนหิ้วมาเอง!!!

Airflow DAG: Make your data pipelines better!

Airflow DAG: Make your data pipelines better!

DBT Rookie to Guru: End-to-End ELT Pipeline Project with DBT, Snowflake, and Tableau

DBT Rookie to Guru: End-to-End ELT Pipeline Project with DBT, Snowflake, and Tableau

Airflow DAG: Coding your first DAG for Beginners

Airflow DAG: Coding your first DAG for Beginners

Airflow with DBT tutorial - The best way!

Airflow with DBT tutorial - The best way!

Airflow Sensors : Get started in 10 mins

Airflow Sensors : Get started in 10 mins

Airflow XCom for Beginners - All you have to know in 10 mins

Airflow XCom for Beginners - All you have to know in 10 mins

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

When Cucumbers Meet PVC Pipe The Results Are Wild! 🤭

ห้องน้ำบ้านหลังใหม่ #ห้องน้ำ

ห้องน้ำบ้านหลังใหม่ #ห้องน้ำ

"หนุ่ม กรรชัย"ซัดเจ็บ ปะทะ"กฤษอนงค์"ประโยคนี้จี๊ดมาก | SCLbb112 : คมชัดลึกออนไลน์

"หนุ่ม กรรชัย"ซัดเจ็บ ปะทะ"กฤษอนงค์"ประโยคนี้จี๊ดมาก | SCLbb112 : คมชัดลึกออนไลน์

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

แคสซี่มาช่วยหมูเด้ง แต่... 😱 | Garena Free Fire

ฮาทั้งสนาม ปทุมรัตน์ เมื่อ นิกร น้อย ขึ้นบนเวทีสาวน้อย

ฮาทั้งสนาม ปทุมรัตน์ เมื่อ นิกร น้อย ขึ้นบนเวทีสาวน้อย

พี่แกล้งน้อง ลอยกระทงกลางน้ำ!

พี่แกล้งน้อง ลอยกระทงกลางน้ำ!

SCUM Rangers Live-004 | จัดบ้าน

SCUM Rangers Live-004 | จัดบ้าน

Khi Liam Harrison cho đối thủ 5 lần đo sàn chỉ trong 1 hiệp đấu

Khi Liam Harrison cho đối thủ 5 lần đo sàn chỉ trong 1 hiệp đấu