Amazon EMR - When to use EMR on EC2, EKS, and Serverless

แชร์
ฝัง

ความคิดเห็น • 6

  • @mertsevenz
    @mertsevenz 5 หลายเดือนก่อน +1

    Hey @dacort, Thanks for the great video.
    - What about Glue? Can we say that Glue and EMR serverless do more or less the same thing?
    - Let's say we only have Spark jobs to run based on some triggers. Since it is a transient job, I should run it with EMR serverless. On the other hand, if I need a long-running cluster, I should go with EMR on EC2/EKS. Can I extract the formula like this :)

  • @user-zf8se9uu4u
    @user-zf8se9uu4u 9 หลายเดือนก่อน

    the video talks about the advantages of using EMR on EC2 and EMR serverless, so what is benefit of using EMR on EKS?

    • @dacort
      @dacort  8 หลายเดือนก่อน

      EKS (Kubernetes) is great for want to share your compute/memory resources across different variable workloads. Many orgs are adopting k8s, so EMR on EKS helps make it easier to run EMR workloads (like Spark and Flink) on top of EKS.

    • @nathanbenton2051
      @nathanbenton2051 5 หลายเดือนก่อน

      indeed@@dacort. but one of the catches being that without quota or limit thresholds set at the k8s level, it's very easy for various team/apps to cripple resources in the "emr" namespace for emr containers. anyways, great vid and thanks for the content!

  • @jenjayhsu1671
    @jenjayhsu1671 11 หลายเดือนก่อน

    why EMR serverless does not support Flink? and also why EMR on EKS does not support Hive?

    • @dacort
      @dacort  8 หลายเดือนก่อน

      Each deployment model of EMR has different use-cases and customer bases. In other words, "folks that tend to run a modern k8s environment, also run modern workloads like Spark or Flink, but not Hive."