32 Spark Memory Management | Why OOM Errors in Spark | Spark Unified Memory | Storage/Execution Mem

แชร์
ฝัง
  • เผยแพร่เมื่อ 14 ม.ค. 2025

ความคิดเห็น • 13

  • @PraveenKumarBN
    @PraveenKumarBN วันที่ผ่านมา

    Amazing. You deserve to get subscribed... Keep it coming! 😀

    • @easewithdata
      @easewithdata  วันที่ผ่านมา

      Thank you! Don't forget to share this with your network over LinkedIn♻️

  • @bharathansbs
    @bharathansbs 2 วันที่ผ่านมา +2

    Hi, I am able to understand the content except this part. Based on the example, a single line of 11 mb file when exploded we get multiple rows of 11 mb so the size is getting huge. But still we have around 60 mb of execution memory (around 90mb minus 30 mb of cached memory). So even the data size is getting bigger it can spill to disk right ? why we are receiving OOM .
    Can you please explain this part.

    • @easewithdata
      @easewithdata  วันที่ผ่านมา +1

      Hello,
      To keep it simple, runtime computations are stored in memory. Since runtime object gets multiplied because of explode it will not be able to fit in memory, running into OOM. In other case, when we try to read a bigger partition, spark knows that it will not fit in memory thus it spills it to disk before running computations on top of it. But once a data is brought in memory and then it explodes because of computation, that can cause issue.
      I tried to make this simple, as this is not so simple to understand in first time.
      Don't forget to Like and shar with your network over LinkedIn 💓

    • @bharathansbs
      @bharathansbs วันที่ผ่านมา

      @@easewithdata Thanks for your reply.

  • @rakeshpanigrahi577
    @rakeshpanigrahi577 วันที่ผ่านมา

    Bhai shab, Awsm video ❤

  • @RiskyAsh
    @RiskyAsh 2 วันที่ผ่านมา

    Subscribed🎉

    • @easewithdata
      @easewithdata  วันที่ผ่านมา +1

      Don't forget to repost this with your friends as well on LinkedIn ♻️

  • @moyeenshaikh4378
    @moyeenshaikh4378 วันที่ผ่านมา

    Bhai Databricks series complete ho gaya kya

    • @easewithdata
      @easewithdata  วันที่ผ่านมา

      Nhi that is in progress. Both Spark and Databricks will run in parallel.

    • @moyeenshaikh4378
      @moyeenshaikh4378 วันที่ผ่านมา

      @easewithdata still how many videos left?

    • @easewithdata
      @easewithdata  วันที่ผ่านมา

      @@moyeenshaikh4378 for databricks? Around 10

  • @funnyvideo8677
    @funnyvideo8677 วันที่ผ่านมา