How to build and automate a ETL pipeline with AWS airflow | AWS End-To-End Data Engineering Project

แชร์
ฝัง
  • เผยแพร่เมื่อ 15 ก.ย. 2024

ความคิดเห็น • 34

  • @boldbellearts
    @boldbellearts 11 หลายเดือนก่อน +1

    Please post more content like this, its very useful for me to understand a project on the whole and how each tool/product work together! great work!

  • @hassanessam375
    @hassanessam375 9 หลายเดือนก่อน

    Thanks for the explanation, More of these videos please

  • @diegoalejandrorobledofigue1377
    @diegoalejandrorobledofigue1377 3 หลายเดือนก่อน

    Good work mate

  • @user-zg1kg5yt8q
    @user-zg1kg5yt8q 11 หลายเดือนก่อน

    Hi Nice informative and knowledgible video..

  • @dataengineering4438
    @dataengineering4438 5 หลายเดือนก่อน

    You rocked it Bro

  • @FallenJakarta
    @FallenJakarta หลายเดือนก่อน

    Thank you very much

  • @ForeverThankful-r3b
    @ForeverThankful-r3b 11 หลายเดือนก่อน

    Thanks man, this is helpful👍

  • @PoojaSharma-jo6hn
    @PoojaSharma-jo6hn ปีที่แล้ว

    Well explained sir 👍🏻

  • @NikitaLalwani-q7w
    @NikitaLalwani-q7w หลายเดือนก่อน

    hello that yaml file is giving error on AWS

  • @rahulteja4849
    @rahulteja4849 8 หลายเดือนก่อน

    I am facing an issue near max worker, from the set up it is set as 4, but while i am trying to link the requirements file then it is throwing an error as the number of max worker is not in th range (range 1-0) , i could not resolve this. I tried to create the env couple of times but still the same error is showing up.

  • @groundingtiming
    @groundingtiming 8 หลายเดือนก่อน

    hello, what if i do not wish to use redshift here? how can i deploy it

  • @raghudubba4427
    @raghudubba4427 3 หลายเดือนก่อน

    Even if i configure requirements.txt correctly , still getting import pandas as pd modulenotfounderror: no module named 'pandas' in airflow and dag is broken error . Thanks

    • @raghudubba4427
      @raghudubba4427 3 หลายเดือนก่อน

      Broken DAG: [/usr/local/airflow/dags/openweather_api.py] Traceback (most recent call last):
      File "", line 241, in _call_with_frames_removed
      File "/usr/local/airflow/dags/openweather_api.py", line 8, in
      import pandas as pd
      ModuleNotFoundError: No module named 'pandas'

    • @DataTechByAnandKumar
      @DataTechByAnandKumar  3 หลายเดือนก่อน

      Just redeploy the airflow instance .

  • @user-zg1kg5yt8q
    @user-zg1kg5yt8q 11 หลายเดือนก่อน

    is it necessary to create the bucket (airflowoututtos3bucket) manually
    ?

  • @avinash7003
    @avinash7003 8 หลายเดือนก่อน

    can you make project on EMR?

  • @yitianhou8706
    @yitianhou8706 3 หลายเดือนก่อน

    Hi, very nice project and video!!!
    I am having some problems, when I upload airflow-redshift-template.yaml to cloudformation -- Create stack, it shows the following error, could you tell me how to fix it?
    Template format error
    Follow the standard JSON or YAML spec to format your template.Learn more
    Parser error
    duplicated mapping key (323:3)
    320 | WebserverLogs:
    321 | LogLevel: !Ref Webserve ...
    322 | Enabled: true
    323 | SecurityGroup:
    ---------^
    324 | Type: AWS::EC2::SecurityGroup
    325 | Properties:
    Thank you very much, sir!

    • @abhishekmote7250
      @abhishekmote7250 3 หลายเดือนก่อน

      I agree, I am having same issue. Did you get it resolved?

    • @VORSTIENER
      @VORSTIENER หลายเดือนก่อน

      The issue is as a result of a duplicated mapping key - the key "SecurityGroup" is defined twice. Just rename the second SecurityGroup to SecurityGroup1.
      "SecurityGroupIngress" is also duplicated so you'll need to do the same.

  • @DhaneshAkolu
    @DhaneshAkolu 4 หลายเดือนก่อน

    Is this service free to use? If not can I work on this project and disable the service?

    • @DataTechByAnandKumar
      @DataTechByAnandKumar  4 หลายเดือนก่อน

      Yeah, you can work and disable the services .

  • @Chelovik_NZ
    @Chelovik_NZ 11 หลายเดือนก่อน +5

    WARNING ... I created a FREE AWS account and completed this training exercise during my free time, which I completed over two days. I was shocked to receive a bill from AWS for nearly $60.00. and an estimate of $600 for a month, even though the system was not running anything. I contacted AWS and it is taken another two days of correspondence to manually shut down all the services, and disestablish connections, and I still am arguing over the charges. I am very unlikely to recommend AWS to any future client

    • @MrSanjuuk
      @MrSanjuuk 8 หลายเดือนก่อน

      The services used in this demo are not covered under free tier.. so it's the user responsibility to check before hand..however aws is usually wave out this kind of billing easily

  • @ChetanSharma-oy4ge
    @ChetanSharma-oy4ge ปีที่แล้ว

    can we use any other service instead of airflow in aws ?

    • @DataTechByAnandKumar
      @DataTechByAnandKumar  ปีที่แล้ว

      We can use step functions.

    • @ChetanSharma-oy4ge
      @ChetanSharma-oy4ge ปีที่แล้ว

      @@DataTechByAnandKumarmwaa will not be required right?

    • @DataTechByAnandKumar
      @DataTechByAnandKumar  ปีที่แล้ว

      There are alternative ways to do things but if you don't wanna use airflow then sure.

  • @user-zg1kg5yt8q
    @user-zg1kg5yt8q 11 หลายเดือนก่อน

    Im got this error
    Broken DAG: [/usr/local/airflow/dags/openweather_api.py] Traceback (most recent call last):
    File "/usr/local/airflow/dags/openweather_api.py", line 26, in
    "appid": Variable.get("key")
    File "/usr/local/airflow/.local/lib/python3.10/site-packages/airflow/models/variable.py", line 142, in get
    raise KeyError(f"Variable {key} does not exist")
    KeyError: 'Variable key does not exist'

    • @ahmeddadjio2003
      @ahmeddadjio2003 9 หลายเดือนก่อน

      Me too!! Have you resolved your problem anyway?