Airflow Data Pipeline with AWS and Snowflake for Beginners | Project

แชร์
ฝัง
  • เผยแพร่เมื่อ 2 พ.ย. 2024

ความคิดเห็น • 50

  • @mellownun9220
    @mellownun9220 ปีที่แล้ว +8

    Is there a benefit to using airflow instead of snowpipe for this purpose?

    • @alejandroflorian9574
      @alejandroflorian9574 8 หลายเดือนก่อน +2

      Imagine needing to consume and migrate not just a single table, but over 100. You'd have to create 100 pipes for inserting the data. Now, with Airflow, it's easier to customize and scale this process.

  • @MarcLamberti
    @MarcLamberti  ปีที่แล้ว +1

    For those who don't see the host anymore, in the account field, make sure you add:
    youraccountnumber.yourregion.yourcloud
    For example: nb71231.eu-west-3.aws
    Basically, take everything between and snowflakecomputing.com
    and leave the region field empty
    Enjoy

  • @AnselJanson-v5v
    @AnselJanson-v5v 5 หลายเดือนก่อน

    Still works 😄. really cool pipeline

    • @MarcLamberti
      @MarcLamberti  5 หลายเดือนก่อน +1

      Good to know 🥹

  • @datalearningsihan
    @datalearningsihan ปีที่แล้ว +1

    I was struggling with airflow installation, so I purchased your udemy course. Hoping, will get some better suppport.

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      Keep me posted ;)

    • @datalearningsihan
      @datalearningsihan ปีที่แล้ว

      @@MarcLamberti did not really help. I asked for a refund to the udemy. I had issues with the installation in your way. My CPU was maxing out. Nothing really was working after I was able to install the airflow in your recommended way. So, it was a bad first impression of the course. So, had to ask for a refund. Sorry.

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      ​@@datalearningsihan you don't have to be sorry. I believe your issues is more related to Docker than Airflow or the course. Check that you have enough memory. Otherwise, you can still install Airflow manually with pip install

  • @AlDamara-x8j
    @AlDamara-x8j ปีที่แล้ว

    Thanks Marc! Great Tutorial!

  • @steffot8468
    @steffot8468 ปีที่แล้ว

    Thanks man , very much appriciated.

  • @YEM_
    @YEM_ 8 หลายเดือนก่อน

    How do we manage connections credentials not via UI? I mean deploy them as code with a reference to secrets manager.

  • @黄钦钦-p6b
    @黄钦钦-p6b 2 ปีที่แล้ว +1

    Learn an easiest way to build dev env for airflow data pipeline. Great!!

  • @ornachshon1
    @ornachshon1 ปีที่แล้ว

    What is the best way to pass CSV between tasks?
    for example: one function parse a JSON to CSV
    second function take the CSV to S3 bucket.

  • @Aman-lv2ee
    @Aman-lv2ee 6 หลายเดือนก่อน

    Thanks Marc, I am facing this error when connecting to Snowflake from airflow; Airflow is running in docker compose (the file you provided in udemy course), ERROR- 250001: 250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting
    I checked all the parameters but still facing this issue ( Airflow version - v2.8.1)

  • @aminemaasri2622
    @aminemaasri2622 ปีที่แล้ว

    salut marc, est ce que je dois faire astro dev start encore une fois lorsque je crée le nouveau dag dans le dossier dags

  • @kurihama4629
    @kurihama4629 ปีที่แล้ว +4

    anyone else having issues with snowflake connection? I followed everything but it doesn't seem to work. Not even sure how to know what went wrong

    • @aldoaguirre9864
      @aldoaguirre9864 ปีที่แล้ว

      yeah, same problem for me
      250001: 250001: Could not connect to Snowflake backend after 0 attempt(s).Aborting

  • @karantatariya1303
    @karantatariya1303 หลายเดือนก่อน

    Still facing connection issues: getting error snowflake.connector.errors.OperationalError: 250001: 250001: Could not connect to Snowflake backend after 2 attempt(s).Aborting, please assist

  • @ruchipandey9721
    @ruchipandey9721 ปีที่แล้ว +2

    I'm unable to see Amazon S3 on airflow localhost. Can you please help me with that?

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      Did you install the Amazon provider?

    • @Yonatanx3
      @Yonatanx3 ปีที่แล้ว

      Hi Ruchi, I'm facing the same issue.
      Did you mange to solve this?
      Thanks

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว +3

      @@Yonatanx3 Use Amazon Web Services for the connection type ;)

  • @awallaustin
    @awallaustin ปีที่แล้ว +1

    can you check on creating the connection within airflow to snowflake? the interface has changed slightly and now i'm unable to create a connection. i've verified that all parameters are correct and yet the test is still failing

    • @isaachernandez3094
      @isaachernandez3094 ปีที่แล้ว

      Yes I have the same issue

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      I’ve just released a new video that shows how to make that connection th-cam.com/video/YZTcIi5o7FI/w-d-xo.htmlsi=8-8-Q8LUasYfz2V0

  • @NardeepML
    @NardeepML ปีที่แล้ว

    Hi, when creating connections in airflow, the test button is greyed out and says 'Testing connections is disabled in Airflow configuration. Contact your deployment admin to enable it' please can you help on this, so test is enabled, I can see in config it's set to disabled, just need to know how to switch it. Thanks

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      Yes. That has been introduced in 2.7. Change the configuration setting AIRFLOW__CORE__TEST_CONNECTION to enabled

  • @salilmarathponmadom7255
    @salilmarathponmadom7255 2 ปีที่แล้ว +3

    At SQL Requests STEP -> I had to execute the query to create Dataware House and Schema separately since I ran into a " No active warehouse selected in the current session " Error, later trying to Insert values into the table.
    Also, in the Airflow UI, in connections I don't have the Amazon S3 option !

    • @MarcLamberti
      @MarcLamberti  2 ปีที่แล้ว +2

      Use the AWS option instead of the connection. Thanks for sharing

  • @kkampassi4820
    @kkampassi4820 ปีที่แล้ว

    For me there is no option to add the host url for snowflake as connection type ....please suggest something

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      You need to install the apache-airflow-providers-snowflake==4.4.0 provider

    • @kkampassi4820
      @kkampassi4820 ปีที่แล้ว

      @@MarcLamberti I tried but still it is not working, could you please share the git repo for the entire process, this gonna be of great help for us

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      @@kkampassi4820 Look at the pinned comment :) I will release a video tomorrow that uses Snowflake as well with the updated way

  • @alex45688
    @alex45688 7 หลายเดือนก่อน

    I can't see amzon s3 connection type in airflow web

    • @MarcLamberti
      @MarcLamberti  7 หลายเดือนก่อน

      It’s AWS now

    • @alex45688
      @alex45688 7 หลายเดือนก่อน

      @@MarcLamberti ok

  • @abdelhakemabdelhak4624
    @abdelhakemabdelhak4624 2 หลายเดือนก่อน +1

    Thank you, but I'm sorry, English with French accent is terrible

    • @MarcLamberti
      @MarcLamberti  2 หลายเดือนก่อน

      Eating a 🥐 while listening makes it better ❤️

  • @konnen4518
    @konnen4518 ปีที่แล้ว +2

    I just can't stand that accent

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว +2

      Me too 🤢

    • @akj3344
      @akj3344 ปีที่แล้ว +2

      @@MarcLamberti I love your accent. Dont listen to ungrateful morons.

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว +1

      @@akj3344 Thank you🙏

    • @konnen4518
      @konnen4518 ปีที่แล้ว

      @@akj3344 eat deek

  • @sampyism
    @sampyism 9 หลายเดือนก่อน

    I couldn't find the "Amazon S3 connection" on the airflow ui. What's going on?

    • @sampyism
      @sampyism 9 หลายเดือนก่อน

      can someone explain how I can install the s3 provider package?