Code along - build an ELT Pipeline in 1 Hour (dbt, Snowflake, Airflow)

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 พ.ย. 2024

ความคิดเห็น • 179

  • @twickAttack
    @twickAttack 8 หลายเดือนก่อน +45

    Hey Jay, thank you for the video. I'd be happy to see you doing more ELT pipelines and focus on your thought's process ( I can watch longer format 1-2 hours) - why you do things in that way, why is it important and whatnot; and you can throw some explainers to anything else you do and the reason behind it. I think senior DE and others with experience do things bit automatically and it takes time for the newbies to pick up on those skills. So, your thought process for doing things instead of just doing the things is priceless for anyone watching, including me. Appreciate your video, dude :)

    • @jayzern
      @jayzern  8 หลายเดือนก่อน +9

      Thank you! Will try to create more useful content

    • @miguelhermar
      @miguelhermar 7 หลายเดือนก่อน

      Completely agree 😊

  • @Dev_Zyoom
    @Dev_Zyoom 8 หลายเดือนก่อน +13

    honestly never knew about dbt and glad to learn it here thank you

  • @ChrisUK70
    @ChrisUK70 12 วันที่ผ่านมา +1

    When ETL came about the Cloud did not exist, I was writing shell scripts and SQL almost 30 years ago to do ETL. Useful video thanks!

  • @PhiNguyen-iz9go
    @PhiNguyen-iz9go 8 หลายเดือนก่อน +5

    This code along session (starting from scratch with environment setup, codebase structure ...) is soooooooo helpful. Hope to see more examples like this. Keep up the work my man

    • @PhiNguyen-iz9go
      @PhiNguyen-iz9go 8 หลายเดือนก่อน

      I watched the video "How you start learning Data Engineering..." and wondering that can you do a live coding that step through all those aspects (from SQL, command lines... to Kafka...) in 1 project? I think it would help a lot...

    • @jayzern
      @jayzern  8 หลายเดือนก่อน +1

      Glad to hear it's helpful! 👍
      It's great to hear feedback on what type of live coding videos you find insightful. Will keep note on Kafka and Command lines

  • @aritra1414
    @aritra1414 15 วันที่ผ่านมา

    Concise and to the point. It was very helpful. Thanks, please show more end to end complex projects like this

  • @diaconescutiberiu7535
    @diaconescutiberiu7535 7 หลายเดือนก่อน +5

    Awesome video! I already recommended this to my entire team. Please make more like this, they are extremely helpful.
    Idea for next video: dbt for Snowflake (again) but with Data Vault 2.0 modeling. I would love to see the logic behind creating dim and fact tables, how you define the stg files for creating the hubs/satellites/links.

    • @jayzern
      @jayzern  7 หลายเดือนก่อน

      Oof yea I did consider doing a Data Vault model where we showcase how hubs, satellites and links work but didn't think ppl would be interested. Thanks for raising 👍

  • @EddieVanWilder
    @EddieVanWilder 6 วันที่ผ่านมา

    This video has the exact answer to my questions as I'm diving into data modeling for analytics. I guaranteee to everyone who is doing this for their first time that they will find this video super helpful.
    Would be cool to see dbt with Cosmos for smoother operation 👌
    EDIT: I was literally just getting into the Deployment part of the video, and there you introduce using Cosmos for Airflow. Kudos!!

  • @kmonish9119
    @kmonish9119 6 หลายเดือนก่อน +6

    I have been struggling with dbt and airflow for a long time. For some reason I could not connect the dots. Having some mixture of knowledge - I landed on this tutorial and it just glued all my scattered dots well. Thanks Jayzern!!! Really appreciate the efforts :)

  • @nothan_nah
    @nothan_nah 8 หลายเดือนก่อน +4

    Thanks for sharing this dbt tutorial! It’s definitely super hot rn and useful to learn. 🎉

  • @rileylee2866
    @rileylee2866 18 วันที่ผ่านมา

    very good session, helped me get a much more concrete idea about how those tools look like and how they work together

  • @anikethdeshpande8336
    @anikethdeshpande8336 5 หลายเดือนก่อน +5

    i'm new to snowflake, dbt and airflow,
    this is awesome tutorial, got to learn a lot
    thank you jayzern

  • @mohitupadhayay1439
    @mohitupadhayay1439 3 หลายเดือนก่อน +8

    AN ABSOLUTE GOLDMINE OF AN INFORMATION WHICH NOT AY UDEMY OR TH-cam TUTOR HAS PROVIDED YET!

  • @peekknuf
    @peekknuf 8 หลายเดือนก่อน +2

    Extremely useful content, i especially liked live googling and debugging parts

    • @jayzern
      @jayzern  8 หลายเดือนก่อน +1

      Thank you for the support! Hope other people find it useful too.

  • @PhanNguyen_010
    @PhanNguyen_010 8 หลายเดือนก่อน +3

    thank you so much for this tutorial. hope you have more videos in the future

    • @jayzern
      @jayzern  7 หลายเดือนก่อน

      Thanks man!

  • @heck0782
    @heck0782 8 หลายเดือนก่อน +2

    Great video and explanation. we need more videos from you.

  • @DivineSam-w6m
    @DivineSam-w6m 3 หลายเดือนก่อน +2

    This video is like a gold mine for building a portfolio especially for someone starting out as a Data Engineer like me!... Manny Thanks and Kudos to you!.. Love from India

    • @adityakulkarni3798
      @adityakulkarni3798 หลายเดือนก่อน +1

      Hey how did you use snowflake? Did you buy it because it shows me that it is a paid software

    • @leabensoussan4178
      @leabensoussan4178 21 วันที่ผ่านมา

      @@adityakulkarni3798 I am wondering the same thing

  • @dominicaleung7329
    @dominicaleung7329 หลายเดือนก่อน +1

    Thank you very much. This is very nice and concise tutorial, exactly what I need.

  • @GeorgeNyamao
    @GeorgeNyamao 3 หลายเดือนก่อน +2

    Thanks @jayzern. This tutorial is awesome. I will be recommending it to folks who struggle with connecting dbt with any database engine.

  • @christophercampo9099
    @christophercampo9099 2 หลายเดือนก่อน

    Thank you, thank you THANK YOU! This was so helpful, easy to follow and made perfect sense.

  • @popalex
    @popalex 7 หลายเดือนก่อน +1

    Great video.
    I would love to see a complex ETL pipelines.

  • @giovannimaia9652
    @giovannimaia9652 5 หลายเดือนก่อน +1

    Please post more videos, your videos are awesome and very instructive

  • @rakshavishwakarma1811
    @rakshavishwakarma1811 7 หลายเดือนก่อน +3

    thank you so much it is 100% worth and useful... expect some more videos in detail.... like prod deployment through git and git interation with airflow

  • @CosmicNomad
    @CosmicNomad 2 หลายเดือนก่อน

    This is such an amazing video @jayzern! The project taken was not overly complex but also not barebones and covered a lot of important stuff! Thanks for being thoughtful and including the code along link (else some of formatting issues would have bugged many newbies)!
    I think you should keep creating more videos as you are a good teacher. Only suggestion I have is may be include a bit more explanation, which will help beginners even more! Kudos!

  • @fun2badult
    @fun2badult 7 หลายเดือนก่อน +1

    Can you please post more videos like this? Really appreciate it. Helps me understand the Dbt/Snowflake/Airflow a lot

    • @jayzern
      @jayzern  7 หลายเดือนก่อน

      Yes sir am working on future videos right now!

  • @dogenature4801
    @dogenature4801 17 วันที่ผ่านมา

    Hi! really enjoy your tutorial, would like to see a tutorial how to create data CI/CD pipeline starting from pulling latest branch, running data test on staging, and deploy changes to production after test is complete since not lot of youtuber explaining this

    • @jayzern
      @jayzern  16 วันที่ผ่านมา

      This is actually a brilliant idea, thanks for the rec!

  • @anduamlaktadesse9284
    @anduamlaktadesse9284 6 หลายเดือนก่อน +2

    so supportive and completing the project .

  • @pavankumard5276
    @pavankumard5276 7 หลายเดือนก่อน

    Need more content like this!!! Really amazing video. Just one suggestion I would like to make before diving into the coding part it would be better if you could provide a real world scenario and reference that while writing you code. Thanks

    • @jayzern
      @jayzern  7 หลายเดือนก่อน +1

      Appreciate the feedback man 🙏 will try to incorporate more real-world context before and during the live coding part, that's a great idea

    • @pavankumard5276
      @pavankumard5276 7 หลายเดือนก่อน

      @@jayzern thanks a lot, waiting for some more tutorials😃

  • @Lhtokbgkmvfknv
    @Lhtokbgkmvfknv 26 วันที่ผ่านมา

    It's beautiful! Thx man!

  • @JohnS-er7jh
    @JohnS-er7jh 4 หลายเดือนก่อน

    Thanks very much for posting this! Definately earned another subscriber/viewer

  • @oludelehalleluyah6723
    @oludelehalleluyah6723 5 หลายเดือนก่อน

    I haven't a lot from this tutorial...
    Thank you

  • @AkashKandarkar
    @AkashKandarkar 5 หลายเดือนก่อน +1

    Amazingly explained 👌

  •  หลายเดือนก่อน +2

    Thank you for the video jayzern. When I push code into Git, should I push code of dbt only, or I need to push all code of dbt-dag ?

  • @ahmednasr3811
    @ahmednasr3811 6 หลายเดือนก่อน +2

    Thanks bro for your efforts ❤

  • @AwakenByMe
    @AwakenByMe 4 หลายเดือนก่อน +1

    WOW!! ,Thank you so much for this wonderful video, Please keep making dbt + airflow videos,
    I have one doubt, I can see that one task in airflow which is stg_tpch_orders have run + test in your dag, But it is not showing up in mine,
    Have you added any tests on stg_tpch_orders ? but maybe missed to show it into the video ?

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      Hmm it's hard to tell without looking at ur code, but there is a generic test for stg_tpch_orders that looks at the relationship between fct_orders and stg_tpch_orders. Check your generic_tests.yml file to confirm
      Thanks for the support man!

  • @DagStylez
    @DagStylez 5 หลายเดือนก่อน +1

    Excellent tutorial!!!

  • @kenneth1691
    @kenneth1691 7 หลายเดือนก่อน +1

    Thank you so much for this, I've been trying to learn how to do this and you helped me solve this
    Do you have trainings!!

    • @jayzern
      @jayzern  7 หลายเดือนก่อน

      Thanks man! Yea I'm working on live trainings too so stay tuned 🙌

  • @maikerodrigo4249
    @maikerodrigo4249 4 หลายเดือนก่อน

    Great tutorial, i've learning a lot thanks!

  • @梁喬萍
    @梁喬萍 4 หลายเดือนก่อน

    love this! thanks for sharing this tutorial, very useful

  • @thainguyenbalamquang386
    @thainguyenbalamquang386 4 หลายเดือนก่อน +1

    Thank you, love your work

  • @melvin9993
    @melvin9993 5 หลายเดือนก่อน +1

    Dude this is so good :)

  • @reneporto-ai
    @reneporto-ai 6 หลายเดือนก่อน

    WOW! That's is amazing tutorial, thanks a lot.

  • @wilcity
    @wilcity 2 วันที่ผ่านมา

    Great video! What text editor are you using?

  • @southafricangamer7174
    @southafricangamer7174 หลายเดือนก่อน

    So to my understanding, the singular tests really mean to check if nothing is the result of the query been tested.
    If the test is true, then nothing equates to the query been tested - Great your data is fine.
    If false, you should run that query to see what exactly are those rows.
    Confusing at first but makes sense now.

  • @i-see-right-through-you
    @i-see-right-through-you 4 หลายเดือนก่อน

    well done! great tutorial!

  • @mohammedvahid5099
    @mohammedvahid5099 7 หลายเดือนก่อน

    Pleas make complete videos on DBT WITH snowflake migration project with real time scenario videos bro thnk u❤ nice explaind

    • @jayzern
      @jayzern  7 หลายเดือนก่อน +1

      Thank you man! Will take that into consideration

  • @montoyescoful
    @montoyescoful 5 หลายเดือนก่อน +1

    Hi Jay. Question: Once you have created the Fact table, how does this process work if I run it again? Is it going to append new records and update the existing ones? Or is it going to drop and create the Fact table over again?

  • @prasadatluri
    @prasadatluri 8 หลายเดือนก่อน

    Great video Jay

    • @RohithPatelKanchukatla
      @RohithPatelKanchukatla 8 หลายเดือนก่อน

      Hii mr.prasad garu are you data engineer too?

    • @prasadatluri
      @prasadatluri 8 หลายเดือนก่อน +1

      @@RohithPatelKanchukatla Hi there. I am a Data Scientist

  • @saurabhpandey1882
    @saurabhpandey1882 4 หลายเดือนก่อน

    Thanks Jayzern,! if I can be of some help for your next video let me know!

  • @nadhasthirundhitan
    @nadhasthirundhitan 6 หลายเดือนก่อน

    excellent video, thank you

  • @corbanb
    @corbanb 2 หลายเดือนก่อน

    Jay! Thanks for the video and content very cool to see. Curious why Airflow over something like FiveTran besides the ability to self host? Any gotchas?

    • @jayzern
      @jayzern  2 หลายเดือนก่อน

      FiveTran is not really an orchestration tool - it's really meant for the "Extract Load" part only. It's great because of Unix philosophy, i.e. "do one thing, do one thing well only", whereas Airflow is more of a generalist, task-based orchestrator. Another thing is FiveTran is super expensive, unless you're working on something enterprise-y

  • @miguelhermar
    @miguelhermar 8 หลายเดือนก่อน +1

    Thanks Jay! Could you also upload into the Notion document the code for the dbt_dag.py file for the Airflow deployment? That's still missing 🙏🏻

    • @jayzern
      @jayzern  8 หลายเดือนก่อน +1

      Totally forgot about that, thanks for the reminder!

    • @miguelhermar
      @miguelhermar 8 หลายเดือนก่อน

      No worries, I realized you used it from the Cosmos github repo so I managed to find it there and finally was able to wire up everyhing and deploy it. 🤓 Thanks Jay. It's a super helpful tutorial. @@jayzern

  • @uppinder
    @uppinder 2 หลายเดือนก่อน

    26:00 item_discount_amount is supposed to be negative because the macro defined it as such. I also checked the data on snowflake and they're all negative amounts. Did I miss something?

  • @JoseR-ui9vn
    @JoseR-ui9vn 5 หลายเดือนก่อน

    Thanks Jayzern

  • @hasnaouiwafae6031
    @hasnaouiwafae6031 หลายเดือนก่อน

    I cannot run my dbt project. I’m still a beginner but I do not understand why this happens, considering that my macros directory is empty except for a .gitkeep file:
    Compilation Error
    dbt found two macros named "materialization_table_default" in the project
    "dbt".
    To fix this error, rename or remove one of the following macros:
    - macros/materializations/models/table/table.sql
    - macros/materializations/models/table.sql

  • @AaronAsherRandall
    @AaronAsherRandall 4 หลายเดือนก่อน

    This is great! At what point would you need to dockerize the files though? Sorry, new to data engineering. Thank you!

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      You can Dockerize it at the beginning, or once you have a baseline model working. I've seen cases where Data engineers start with Docker, or Dockerize it halfway! I personally prefer the latter

  • @StanleySI
    @StanleySI 5 หลายเดือนก่อน +1

    Just wonder in the real world scenario, where are all raw data stored? In AWS s3?

  • @dataengineermatheusbudin7011
    @dataengineermatheusbudin7011 4 หลายเดือนก่อน

    Hey, thanks for the project tutorial. i was wondering if there is the best way to deploy airflow on a cloud enviroment... I see a lot of Ec2 or EKS (kubernetes). But maybe i could work on ECS + Fargate? Which deploy method would you please recomend regarding a production scenario? (like beyond studies, thinking about a daily job task). Thank you mate

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      Airflow + EKS is probably the most common in the industry because of cost reasons and vertical scaling. You could use ECS + Fargate too, but fargate is really expensive!
      I don't have any recs atm, but will try to create more examples on production DAGs next time. Check out th-cam.com/video/Xe8wYYC2gWQ/w-d-xo.html in the meantime!

  • @truongnguyen813
    @truongnguyen813 6 หลายเดือนก่อน +1

    I'm struggling within the step to load dbt data_pipeline, it did not show in the airflow dag. How could I be wrong, can you support?

  • @AakashKumarDhal
    @AakashKumarDhal 3 หลายเดือนก่อน +4

    Error solved!!!!
    for anyone facing this error:
    Runtime Error
    Database error while listing schemas in database "dbt_db"
    Database Error
    250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting
    Try the second method to update account name for your project inside profile.yml file.
    account_locator-account_name

    • @Abhishek-jp6ib
      @Abhishek-jp6ib 3 หลายเดือนก่อน

      Thank you !

    • @albertcampillo
      @albertcampillo 2 หลายเดือนก่อน

      Thank you!

    • @juventuskid2
      @juventuskid2 หลายเดือนก่อน

      Hey! How did you go about updating the account name(or resolving the error)? I can't find the profile.yml file.

  • @thetrangia1091
    @thetrangia1091 8 หลายเดือนก่อน

    thank you very much

  • @saikoundinya9913
    @saikoundinya9913 4 หลายเดือนก่อน

    Dude, where did you even mention about dbt_project.yml file, in part 2 of the video, you directly jump to vscode
    what are the details ??

  • @BishalKarki-pe8hs
    @BishalKarki-pe8hs 6 หลายเดือนก่อน

    100% worth it

  • @MalvinSiew
    @MalvinSiew 6 หลายเดือนก่อน +2

    Hi Jay, thanks for the video. I'm having an issue connecting to Snowflake backend at the stage you first perform 'dbt run' @ 14:50 .
    This is the error I get:
    15:17:54 Encountered an error:
    Runtime Error
    Database error while listing schemas in database "dbt_db"
    Database Error
    250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting
    I've checked the profiles.yml file and all details are correct. Please help!

    • @parag2039
      @parag2039 6 หลายเดือนก่อน +1

      facing the same issue!!!!!! can anyone please help I've restarted and tried everything possible to figure out but failed

    • @parag2039
      @parag2039 6 หลายเดือนก่อน

      @MalvinSiew
      I solved one of the two errors I was facing. I did not have Git installed in my system. You can simply ask AI for prompts to guide you through the installation process.

    • @vitorcavalcante8495
      @vitorcavalcante8495 5 หลายเดือนก่อน

      Had the same problem, when passing the account_value with 'dbt init' I wasn't able to connecto using the ccount url value, only with the second option which was the - value

    • @oreschz
      @oreschz 3 หลายเดือนก่อน

      did u solve it? I have the same problem. what is the solution?

    • @AakashKumarDhal
      @AakashKumarDhal 3 หลายเดือนก่อน

      @@oreschz could you solve it?

  • @ShreyasSureshDhamore
    @ShreyasSureshDhamore 11 วันที่ผ่านมา

    Hi I am trying your proect and got stuk here can you here
    21:32:24 Unable to do partial parsing because saved manifest not found. Starting full parse.
    21:32:25 Encountered an error:
    Compilation Error
    Model 'model.DATA_PIPELINE.stg_tpch_orders' (models/staging/stg_tpch_orders.sql) depends on a source named 'tpch.orders' which was not found

  • @jeevankumarkondasingu34
    @jeevankumarkondasingu34 6 หลายเดือนก่อน

    Nice Explaination

  • @pythonmathui3057
    @pythonmathui3057 3 หลายเดือนก่อน

    I'm struggling with airflow connection to snowflake, can you make another video to elaborate it more?

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      For sure, I didn't explain the airflow integration with snowflake as much as I wanted to

  • @ozland7172
    @ozland7172 6 หลายเดือนก่อน

    Hello.. thanks for the tutorial.
    I know airflow runs the tasks/dags however I cannot follow one thing; how do we determine the order of the action items at 35:36 within dbt (I believe it is determined on dbt side) since we have only one dag running on this example? I appreciate if anyone replies.

  • @abdullahsiddique7787
    @abdullahsiddique7787 หลายเดือนก่อน

    Is data engineering dead with advent of AI ? What is the future of data engineering careers in your opinion ?

  • @maryam4071
    @maryam4071 7 หลายเดือนก่อน

    hi, I would like to know about singular test, we want to check negative value in test, why we use the condition as positive?

  • @ThiagoSilva-vh9fy
    @ThiagoSilva-vh9fy 3 หลายเดือนก่อน

    couldn't run int_order_items.sql because it ruturns a strange error: it says: "The selection criterion 'int_order_items.sql' does not match any enabled nodes". And if aI run "dbt run" it says: " unexpected '.' in line 1" at 20:22

  • @srikantaghosh2386
    @srikantaghosh2386 4 หลายเดือนก่อน

    At 32:21, how did you copy the dbt folders to airflow project?

  • @bazi15
    @bazi15 9 วันที่ผ่านมา

    make video related star and dimension modeling

  • @SakshiGowda-vl1ke
    @SakshiGowda-vl1ke หลายเดือนก่อน

    hi ! I'm having trouble connecting to snowflake. can someone please help me resolve it . I just started learning dbt and snowflake .
    Runtime Error
    Database error while listing schemas in database "dbt_db"
    Database Error
    250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting

  • @hiteshmohite7677
    @hiteshmohite7677 8 หลายเดือนก่อน

    hey, I have a small request
    can you please make a video on how to make use of pyspark efficiently in low spec system with huge amount of data

    • @jayzern
      @jayzern  8 หลายเดือนก่อน

      Low compute Spark + high volumes of data is challenging but will take note. Thx for the suggestion

  • @stephenarnold2343
    @stephenarnold2343 4 หลายเดือนก่อน

    I materialized marts as tables but int_order_items, int_order_items_summary and fct_orders are created as views instead of tables. How do I convert these views to tables?

  • @shloktalhar3981
    @shloktalhar3981 10 วันที่ผ่านมา

    Tell me one thing , is data engineering good job profile for freshers

  • @aiviet5497
    @aiviet5497 4 หลายเดือนก่อน

    I need a longer video. Please give me.

  • @anggipermanaharianja6122
    @anggipermanaharianja6122 หลายเดือนก่อน

    nice

  • @SteynGun-n2u
    @SteynGun-n2u 3 หลายเดือนก่อน

    hi guys kindly help me out, does only snowflakes and dbt is enought are i have to learn hadoop, spark etc i am working as data analyst for last 1 year and planning to switch to de

  • @khushikunte254
    @khushikunte254 หลายเดือนก่อน

    any prerequisites for this

  • @CSK-Studios
    @CSK-Studios 8 หลายเดือนก่อน

    Hi Jay, good one..am trying same way but getting below error " 1 of 1 ERROR creating view model dbt_schema.stg_tpch_line_items................. [ERROR in 0.04s]
    06:17:33
    06:17:33 Finished running 1 view model in 2.02s.
    06:17:33
    06:17:33 Completed with 1 error and 0 warnings:
    06:17:33
    06:17:33 Compilation Error in model stg_tpch_line_items (models\staging\stg_tpch_line_items.sql)
    06:17:33 'dict object' has no attribute 'type_string'
    06:17:33
    06:17:33 > in macro generate_surrogate_key (macros\sql\generate_surrogate_key.sql)
    06:17:33 > called by macro default__generate_surrogate_key (macros\sql\generate_surrogate_key.sql)
    06:17:33 > called by model stg_tpch_line_items (models\staging\stg_tpch_line_items.sql)"

    • @jayzern
      @jayzern  8 หลายเดือนก่อน

      Try checking if your dbt_utils version is correct. There seems to be a compile time error with calling generate surrogate key. The code is available in notion page.

    • @szhao2864
      @szhao2864 8 หลายเดือนก่อน

      I got the same error. How did you solve it?

  • @neosmith009
    @neosmith009 7 หลายเดือนก่อน

    Overall great, the airflow orchestration felt a bit clunky especially given that the source code had to be kept in the same directory.

    • @jayzern
      @jayzern  7 หลายเดือนก่อน

      Thx for the feedback 👍 ideally should wrap this in a container image, but for simplicity decided to keep it as code

    • @neosmith009
      @neosmith009 7 หลายเดือนก่อน

      @@jayzern Makes sense, any good resources on self hosting dbt core?

  • @prabhatgupta6415
    @prabhatgupta6415 4 หลายเดือนก่อน

    can u tell why have we used airflow since dbt cloud has feature to schedule the jobs?

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      If your company only uses dbt and no other tooling, dbt cloud works too
      However in the real world, it's hard to control your CRON schedule when you have many tools in your stack. Orchestrators job is to focus on scheduling. Linux philosophy of do one thing, do one thing well TLDR

  • @rvmnet2112
    @rvmnet2112 6 หลายเดือนก่อน

    One question here, As we have dbt jobs feature available in dbt cloud and it is very easy to create job here then why it is need to use airflow?

    • @jayzern
      @jayzern  6 หลายเดือนก่อน

      Yea that's great question! In theory dbt cloud can trigger jobs too, but in practice you'd want to decouple your orchestration tool away from your transformation tool for a myriad of reasons: ability to orchestrate other tools together with dbt, avoid vendor lock from dbt, many companies are comfortable with Airflow etc. It really depends on your tech stack

  • @albertcampillo
    @albertcampillo 2 หลายเดือนก่อน

    Hi @jayzern, thanks a lot for your video, really valuable content!

  • @aliceschirina8191
    @aliceschirina8191 หลายเดือนก่อน +1

    Hello, thanks for this tutorial. At the very beginning, when trying to run the "dbt deps" command I'm getting this error : "Encountered an error loading local configuration: dbt_cloud.yml credentials file for dbt Cloud not found. Download your credentials file from dbt Cloud to `C:\Users\a.schirina\.dbt`". I'm using dbt command locally and my profiles.yml in the .dbt folder is
    data_pipeline:
    target: dev
    outputs:
    dev:
    type: snowflake
    account: jpb45436
    # User/password auth
    user: alices
    password: mypassword
    role: dbt_role
    database: dbt_db
    warehouse: dbt_wh
    schema: dbt_schema
    threads: 4
    client_session_keep_alive: False
    Does anyone know the problem?

  • @OmerNadler
    @OmerNadler 7 หลายเดือนก่อน

    is do i need to pay on astro ? if i want to use this for prod env

  • @StephenRayner
    @StephenRayner 4 หลายเดือนก่อน

    You should check out Meltano

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      I've heard great things about Meltano!

  • @CybersecYT
    @CybersecYT 6 หลายเดือนก่อน

    How could i get the project folder structure?

  • @sanaomar2182
    @sanaomar2182 6 หลายเดือนก่อน

    How did he stat? did he create a wroksheet? I tried it but it di not work, the very first steps ?? what arethey?

    • @Rajdeep6452
      @Rajdeep6452 5 หลายเดือนก่อน

      yes you need to write the queries in a worksheet

  • @kirankumar1290
    @kirankumar1290 7 หลายเดือนก่อน

    Good

  • @goabc-u2o
    @goabc-u2o 24 วันที่ผ่านมา

    (venv) PS C:\Users\hsrak\Desktop\DataManagemet2
    ew\data_pipeline\dbt-dag> brew install astro
    brew : The term 'brew' is not recognized as the name of a cmdlet, function, script file, or operable program. Check the
    spelling of the name, or if a path was included, verify that the path is correct and try again.
    At line:1 char:1
    + brew install astro
    + ~~~~
    + CategoryInfo : ObjectNotFound: (brew:String) [], CommandNotFoundException
    + FullyQualifiedErrorId : CommandNotFoundException
    Please help me

  • @okkwok1753
    @okkwok1753 8 หลายเดือนก่อน

    I am not sure why I cannot open the notes, can anyone help?

    • @jayzern
      @jayzern  8 หลายเดือนก่อน +2

      I double checked the link and it's working, try this
      bittersweet-mall-f00.notion.site/Code-along-build-an-ELT-Pipeline-in-1-Hour-dbt-Snowflake-Airflow-cffab118a21b40b8acd3d595a4db7c15?pvs=74
      Let me know what error you see

  • @tianhockwoo3025
    @tianhockwoo3025 3 หลายเดือนก่อน

    Hello did anyone else face this error at Airflow after @32:50
    Broken DAG: [/usr/local/airflow/dags/dbt-dag.py]
    Traceback (most recent call last):
    File "/usr/local/lib/python3.12/site-packages/cosmos/operators/base.py", line 361, in __init__
    self.full_refresh = full_refresh
    ^^^^^^^^^^^^^^^^^
    File "/usr/local/lib/python3.12/site-packages/airflow/models/baseoperator.py", line 1198, in __setattr__
    if key in self.__init_kwargs:
    ^^^^^^^^^^^^^^^^^^
    AttributeError: 'DbtRunLocalOperator' object has no attribute '_BaseOperator__init_kwargs'. Did you mean: '_BaseOperator__instantiated'?
    please send help

    • @CosmicNomad
      @CosmicNomad 2 หลายเดือนก่อน +1

      I am facing the exact same error. Please post a reply, if you were able to figure out the fix. I'll do the same if I find a solution.

    • @CosmicNomad
      @CosmicNomad 2 หลายเดือนก่อน

      Ok, so I think I was able to find the thread related to this issue.. Its still open as of 8/18/2024 11pm PT..
      github.com/astronomer/astronomer-cosmos/issues/1161

  • @RockefellerBurgess
    @RockefellerBurgess หลายเดือนก่อน

    19395 Bryana Station

  • @abdelaliamghar5123
    @abdelaliamghar5123 3 หลายเดือนก่อน

    90' it's a long Time

    • @jayzern
      @jayzern  3 หลายเดือนก่อน

      1.5 hours?

  • @sanaomar2182
    @sanaomar2182 6 หลายเดือนก่อน

    how do you know your username>? jayzer? like I went back to my profile but it did not work. where can I find the name of my user?

    • @quintonflorence6492
      @quintonflorence6492 6 หลายเดือนก่อน

      In Snowflake you should see your user name in the bottom left corner. It'll be the top bolded value

  • @GarrettSchwarzenbach-u9v
    @GarrettSchwarzenbach-u9v 2 หลายเดือนก่อน

    Roberts Isle

  • @AlbertLorraine-t1y
    @AlbertLorraine-t1y หลายเดือนก่อน

    Ziemann Bypass

  • @RyanSpurr-k5v
    @RyanSpurr-k5v หลายเดือนก่อน

    Good vid but move your facecam out of the terminal