Installing and Running Airflow on a small AWS EC2 Instance

แชร์
ฝัง
  • เผยแพร่เมื่อ 3 ต.ค. 2024
  • Running Apache Airflow on a small AWS EC2 Instance
    Many people think Airflow is resource-consuming, but you can install and run it on a machine with only 1 CPU and 2 GB or RAM.
    🏆 BECOME A PRO: www.udemy.com/...
    👍 Smash the like button to become an Airflow Super Hero!
    ❤️ Subscribe to my channel to become a master of Airflow
    🚨 My Patreon: / marclamberti
    The materials: robust-dinosau...
    Enjoy 🫶

ความคิดเห็น • 59

  • @Orionalpha48
    @Orionalpha48 ปีที่แล้ว +3

    Fantastic video. Got it up and running in no time. Simple yet comprehensive video that came out just in time to help me with my project.

  • @LoveisHell85
    @LoveisHell85 ปีที่แล้ว +2

    Airflow on Fargate next please! Thank you so much for your work.

  • @noahfisher1955
    @noahfisher1955 ปีที่แล้ว +1

    This video was such a tremendous help, thank you so much Marc :)

  • @siddhisail7105
    @siddhisail7105 11 หลายเดือนก่อน

    This video is a blessing for me. Thank you so much for making this.

    • @MarcLamberti
      @MarcLamberti  11 หลายเดือนก่อน

      Happy to help 🙌

  • @RAVIKIRANKANUR
    @RAVIKIRANKANUR 8 หลายเดือนก่อน

    Simple and great explanation!!! Thanks a lot

  • @AlexHitchon
    @AlexHitchon 6 หลายเดือนก่อน

    Hey great vid Marc was able to follow along just surprised we need postgres. I found this useful to get me started on my personal project of taking some data and writing to a SQL server db daily. Thanks

  • @ragibshahriar7959
    @ragibshahriar7959 9 หลายเดือนก่อน

    Great work Marc! Please keep it up

  • @anirudhnegi4523
    @anirudhnegi4523 ปีที่แล้ว

    Great work Marc. Helpful content!

  • @ccc_ccc789
    @ccc_ccc789 4 หลายเดือนก่อน

    Fantastic! Thanks Marc!

  • @xmagcx1
    @xmagcx1 ปีที่แล้ว

    excellent example

  • @thghtfl
    @thghtfl ปีที่แล้ว

    Thanks for the video, Marc. It was unclear though how it differs from running on 4GB or 8GB instances because I didn't see any specific points on memory allocation. Also, I tried to run Airflow in a Docker container on a 2GB ec2 instance, but at some point, the instance stopped responding for some reason. I wasn't able to find out whether Airflow or Docker caused the issue, but for me, it didn't work.

  • @alexanderpotts8425
    @alexanderpotts8425 ปีที่แล้ว

    Thanks for this. I once got Airflow to start on a Micro instance with an external database 😂but once I installed the Databricks operators it was game over from a disk space POV

  • @ChinmaySathe-bx4ck
    @ChinmaySathe-bx4ck 5 หลายเดือนก่อน

    You are a life save

  • @kaushalkrishna5249
    @kaushalkrishna5249 11 หลายเดือนก่อน +1

    Will webserver run even if we close terminal ?
    Also how will we run both webserver and scheduler in background even after closing terminal ?

  • @ronakshah6336
    @ronakshah6336 5 หลายเดือนก่อน

    Can you make video how should we do airflow for windows rather than ubantu

  • @hananenadi3744
    @hananenadi3744 15 วันที่ผ่านมา

    Hey great vid thank you very much, just i have a question
    The t2.small instance in EC2 is free ? Or should i pay sometimes at the end of usage ?

  • @abdelhakemabdelhak4624
    @abdelhakemabdelhak4624 หลายเดือนก่อน

    merci

  • @dipeshg8581
    @dipeshg8581 9 หลายเดือนก่อน

    Sir please can you tell how to stop all of airflow components and restart them in one single command. so that changes can be made easily

  • @AlDamara-x8j
    @AlDamara-x8j ปีที่แล้ว +1

    Hi Marc, again, thanks for your videos. After running airflow on aws ec2 instance, can I still follow your "Apache Airflow: The Hands-On Guide" Course on Udemy without any problems? What happens when I stop the ec2 instance for later resume? Do I need to recreate everything again from scratch? How can I avoid that?

  • @danielmawdsley157
    @danielmawdsley157 ปีที่แล้ว +1

    Great video! Have you considered making a terraform script so all of these steps could be deployed together?

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว +1

      Nop, but if you wanna partner on that I would be happy 🫶

  • @jimaustin3608
    @jimaustin3608 ปีที่แล้ว

    Anyone notice that the written command sequence is different from the tutorial verbal sequence?
    For example (4:25), ‘python3 -m venv venv’ is left out of the written seq; necessary to create the venv directory with the virtual environment.
    Then the lib install (sudo apt-get install libpq-dev) comes before the virtual environment activation (source venv/bin/activate). (don’t know if this is a problem)
    Since the EC2 instance works in the tutorial, might be a good idea to use it to ‘fix’ the written sequence … or just follow.

  • @hjop010
    @hjop010 ปีที่แล้ว +1

    Hi Marc. Can you give an idea of montlhy costs for this setup? Thanks!

    • @JeanBzh
      @JeanBzh ปีที่แล้ว

      $0.023 per hour, so $201 per year ($16.80 per month)
      Prices vary depending on the region you select

  • @ML-bu7tu
    @ML-bu7tu ปีที่แล้ว

    Hello ! Your video helped me a lot! What should I do if I want to modify the port number from 8080 to 8081 ?

  • @findsarfaraz
    @findsarfaraz 2 หลายเดือนก่อน

    hey do you teach sqllite db in airflow in your udemy course. ?

  • @parashualamel6104
    @parashualamel6104 4 หลายเดือนก่อน

    Iam facing while mapping sql alchemy connection with postgres when i tried with what you said its not reflecting to correspong variable

  • @harishmahale1
    @harishmahale1 6 หลายเดือนก่อน

    How do we disable the example dags and reduce the clutter in web UI?

  • @HarishMahale-y8n
    @HarishMahale-y8n 6 หลายเดือนก่อน

    In cases where you are installing certain "latest versions" (e.g. airflow 2.5.0, python 3.10, etc.), can you mention if we should continue with those versions in the video or find out the actual latest version while installing. Will installing another highest latest version cause any issues with rest of the steps?

  • @harishmahale1
    @harishmahale1 6 หลายเดือนก่อน

    How do we keep the scheduler always running?

  • @GuidoPerez-z8f
    @GuidoPerez-z8f 8 หลายเดือนก่อน

    Hello Marc,
    i setup Airflow on ec2, using your video. I was able to successfully install it but I keep getting this error "The scheduler does not appear to be running. Last heartbeat was received 20 hours ago."
    How can i fix it.
    thank you

  • @IvanRmz27
    @IvanRmz27 ปีที่แล้ว

    Do you have an azure equivalent video? Thanks!

  • @GMedchess
    @GMedchess ปีที่แล้ว

    The tutorial works but I have a problem, The scheduler does not appear to be running. ... This happened after I close the shh connection, but the webserver is still active.
    I need use this airflow to schedule a periodic task what should I do ? ... How do I keep the scheduler active

  • @cesarofuchi
    @cesarofuchi ปีที่แล้ว

    python3 -m venv venv is missing from the notion page

  • @OmerNadler
    @OmerNadler 6 หลายเดือนก่อน

    can you make it with docker ? and kub8 ?

  • @0kazaki
    @0kazaki ปีที่แล้ว

    If I want to deploy airflow in a production environment with multiple workers for scalability, how should I do it ? I assume with kubernetes?

  • @javiercavieres2843
    @javiercavieres2843 ปีที่แล้ว

    Why when i hit the command airflow db init that don't create de airflow folder?

  • @guruprasadbs
    @guruprasadbs ปีที่แล้ว

    Hi, where are those dag files stored

  • @findsarfaraz
    @findsarfaraz 2 หลายเดือนก่อน

    in your udemy course have covered sparksubmit operator?

  • @user-c3p0-zk1mr
    @user-c3p0-zk1mr ปีที่แล้ว

    Awesome video. Is it possible to install airflow via cloud9 (it runs on ec2 and can be stopped anytime)?

  • @kikecastor
    @kikecastor ปีที่แล้ว

    Beautiful video, thank you!
    Two questions:
    If I shut down my computer, will the dags still run in the cloud?
    Can anyone enter with the public ipv4 dns 12:42 or can only I enter?

    • @MarcLamberti
      @MarcLamberti  ปีที่แล้ว

      correct

    • @kikecastor
      @kikecastor ปีที่แล้ว

      @@MarcLamberti Awesome, thank you!

    • @MrMadmaggot
      @MrMadmaggot ปีที่แล้ว

      @@MarcLamberti Correct what?

    • @UsrU-y8c
      @UsrU-y8c 7 หลายเดือนก่อน

      @@MrMadmaggot shutdown pc - resources still run in the cloud.
      Can anyone enter... -Security is not my best friend, but this does not look secure at all to me. Anyone with the link can enter for sure.

    • @MrMadmaggot
      @MrMadmaggot 7 หลายเดือนก่อน

      How is it possible? How the dags can run if the Instance is shutdown?@@UsrU-y8c

  • @EliaMartali
    @EliaMartali ปีที่แล้ว

    Tried this with azure VMs and got some errors... I had to give up lol

  • @jeanmarieabengzoa2600
    @jeanmarieabengzoa2600 7 หลายเดือนก่อน

    I have question. It is important to install postgresql before?

  • @duynghiavo7915
    @duynghiavo7915 หลายเดือนก่อน

    Need many dollar to create airflow on aws

    • @MarcLamberti
      @MarcLamberti  หลายเดือนก่อน

      Good thing you can have a free trial on Astronomer

  • @nathanielsolomon917
    @nathanielsolomon917 4 หลายเดือนก่อน +1

    Encountered and issue with 10:16 airflow init db. Error
    Error : psycopg2.errors.InsufficientPrivilege: permission denied for schema public
    Solution that worked for me: ALTER DATABASE airflow_db OWNER TO airflow_user;
    Note I am using the latest airflow 2.9.1 as of this point

    • @alpacajin6868
      @alpacajin6868 3 หลายเดือนก่อน

      tysm! ran into the same issue here

    • @ember2658
      @ember2658 3 วันที่ผ่านมา

      In version 16, I had to add:
      GRANT ALL ON SCHEMA public TO airflow;
      ALTER DATABASE airflow OWNER TO airflow;
      GRANT CONNECT ON DATABASE airflow TO airflow;
      GRANT USAGE ON SCHEMA public TO airflow;
      GRANT CREATE ON SCHEMA public TO airflow;
      GRANT ALL PRIVILEGES ON ALL TABLES IN SCHEMA public TO airflow;
      GRANT ALL PRIVILEGES ON ALL SEQUENCES IN SCHEMA public TO airflow;
      to the SQL he created, its overkill, but its just a test