(Re-upload) What is Shuffle | How to minimize shuffle in Spark | Spark Interview Questions

แชร์
ฝัง
  • เผยแพร่เมื่อ 5 ก.พ. 2025
  • Hi Friends,
    In this video, I have explained about Shuffle and given the ways to minimize the shuffle.
    github.com/sra...
    Please subscribe to my channel for more interesting learnings.

ความคิดเห็น • 9

  • @sravanalakshmipisupati6533
    @sravanalakshmipisupati6533  2 ปีที่แล้ว +1

    Hi Friends, Reuploaded the video with volume enhancements. Please check and let me know if this is fine. Thanks.

  • @rahulpandit9082
    @rahulpandit9082 2 ปีที่แล้ว +1

    Nice content, Thanks a lot Sarvana..

  • @udaypratapsingh2245
    @udaypratapsingh2245 2 ปีที่แล้ว +1

    Thanks for this wonderful video! If you explain Lil more about clusterbykey point .

    • @rahulpandit9082
      @rahulpandit9082 2 ปีที่แล้ว

      Read about Bucketing in Hive..

    • @sravanalakshmipisupati6533
      @sravanalakshmipisupati6533  2 ปีที่แล้ว

      @Uday Pratap Singh - Please checkout - th-cam.com/video/pZMTv0090rI/w-d-xo.html for cluster by key in Hive.

  • @satishmajji481
    @satishmajji481 2 ปีที่แล้ว

    Can you explain how to pick the correct/optimal number of partitions dynamically while processing data?

    • @sravanalakshmipisupati6533
      @sravanalakshmipisupati6533  2 ปีที่แล้ว

      Spark has its own mechanism to have dynamic partitioning. If you want to overwrite then you can analyse the data and then give appropriate partition number.

    • @satishmajji481
      @satishmajji481 2 ปีที่แล้ว

      @@sravanalakshmipisupati6533 How to analyse and decide the appropriate partition number? Can you explain please?