GCP - Dataproc ,PySpark - Shell , PySpark | Job Submit through Script

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 ต.ค. 2024
  • GCP - Dataproc ,PySpark - Shell , PySpark - Job Submit through Script

ความคิดเห็น • 6

  • @frozensquash8098
    @frozensquash8098 ปีที่แล้ว

    Thank you so much.

  • @madhavtrading1917
    @madhavtrading1917 2 ปีที่แล้ว

    Pyspark and Apache Beam -which one is best in current market and future??

    • @anjangcpdataengineering5209
      @anjangcpdataengineering5209  2 ปีที่แล้ว

      If the workloads are laready running on on premise using spark and if you have to migtrate them to GCP then pyspark with dataproc is useful otherwise in case of workload (ETL) development from scratch on GCP Apache beam with dataflow is reccomended , hence it is difficult to say which one is better it all depends on use cases , as a data engineer it's better to have both skills

    • @madhavtrading1917
      @madhavtrading1917 2 ปีที่แล้ว

      @@anjangcpdataengineering5209 Thanks Sir..

  • @j.franciscohernandezhernan1258
    @j.franciscohernandezhernan1258 ปีที่แล้ว

    Too long video