66. Databricks | Pyspark | Delta: Z-Order Command

แชร์
ฝัง
  • เผยแพร่เมื่อ 28 มิ.ย. 2022
  • Azure Databricks Learning: Delta Lake - Z-Order Command
    ========================================================
    What is Z-order Command in delta table and how to apply in delta lake development?
    Z-order one of the performance optimization techinique used in delta lake. It is used along with optimize command and used to compact small files into optimal size and at the same time relevant data is co-located to improve the performance.
    This video gives complete understanding of Z-order command
    #DeltaZorder, #DatabricksZorder, #PerformanceOptimization, #Zorder,#Z-order, #Z-Ordering, #DeltaOptimize, #DeltaOptimizeZorder #DeltaCompactFiles, #DeltaSmallFileIssue, #DeltalakePerformance, #DeltaPerformanceImprovement ,#DeltalakeIntro, #IntroductionToDeltaLake, #Deltalake, #DeltaTable, #DatabricksDelta, #DeltaTableCreate, #DatawarehouseVsDataLakevsDeltaLake, #PysparkDeltaLake, #DeltalakevsDatalake, #SQLDeltaTable, #DataframeDeltaTable,#DeltaFormat ,#DatabricksRealtime, #SparkRealTime, #DatabricksInterviewQuestion, #DatabricksInterview, #SparkInterviewQuestion, #SparkInterview, #PysparkInterviewQuestion, #PysparkInterview, #BigdataInterviewQuestion, #BigdataInterviewQuestion, #BigDataInterview, #PysparkPerformanceTuning, #PysparkPerformanceOptimization, #PysparkPerformance, #PysparkOptimization, #PysparkTuning, #DatabricksTutorial, #AzureDatabricks, #Databricks, #Pyspark, #Spark, #AzureDatabricks, #AzureADF, #Databricks, #LearnPyspark, #LearnDataBRicks, #DataBricksTutorial, #azuredatabricks, #notebook, #Databricksforbeginners
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 71

  • @shreeyashransubhe2537
    @shreeyashransubhe2537 ปีที่แล้ว +8

    Sir, I have gone through lots of videos but never understood the concepts so simple yet very detailed manner. Thank you very much. I have shared your playlist with my colleagues too. They also liked it very much.

  • @pratikparbhane8677
    @pratikparbhane8677 4 หลายเดือนก่อน +2

    Great Explain , Understood OPTIMISE , VACCUM() AND Z-ORDERING in One Video

  • @rohitdanda
    @rohitdanda ปีที่แล้ว +2

    Your videos are so simple that a kid can also understand. Thanks and salute sir🖖 for putting so much effort and making videos and helping us!

  • @SumitAmbatkar
    @SumitAmbatkar หลายเดือนก่อน +1

    i watched your nearly all playlist i loved your teching style, how to ogrip on concept, your explaination are fabulous keeping doing sir, best of luck. we are always here for you
    Thank you..:)

  • @RanjeetkumarYadav
    @RanjeetkumarYadav หลายเดือนก่อน +1

    Amazing and very intuitive example. Thank You!!

  • @YogeshBiguvu2208
    @YogeshBiguvu2208 7 หลายเดือนก่อน +2

    Excellent explanation with Examples.....Thank you so mcuh sir..

  • @mukilanlakshmanan8968
    @mukilanlakshmanan8968 7 หลายเดือนก่อน +1

    Sir, I love your teaching method, you have explained it in detail.

  • @user-hj2nv8gt4o
    @user-hj2nv8gt4o 4 หลายเดือนก่อน +1

    Sir, Thanks for explaining in a very simple manner.

  • @ajaykiranchundi9979
    @ajaykiranchundi9979 ปีที่แล้ว +1

    A very well explained . The way you broke down the data to explain the same is amazing. I am sure it would have taken good time to put it together. Indebted to you brother.

  • @saurav0777
    @saurav0777 ปีที่แล้ว +1

    Thanks for uploading . Very nice explanation

  • @sraoarjun
    @sraoarjun 2 หลายเดือนก่อน +1

    Indeed an awesome video !! Great explanation !!

  • @3a8saisamireddi61
    @3a8saisamireddi61 หลายเดือนก่อน +1

    detailed explanation👏

  • @vivek05117gece
    @vivek05117gece 11 หลายเดือนก่อน +1

    very well explained. Kudos to you.

  • @shwetac2929
    @shwetac2929 ปีที่แล้ว +1

    you teaching methos is very good ....this video clear my all doubt

  • @terrificmenace
    @terrificmenace ปีที่แล้ว +2

    Thank you sir 🙏🏻 I went through many udemy courses but never understood these concepts. Ur explanation is very good and easy to understand many many thanks sir 🙏🏻 🙏🏻

  • @FreakONcW1
    @FreakONcW1 6 หลายเดือนก่อน +1

    Extremely helpful video.

  • @dineshwaditake5248
    @dineshwaditake5248 9 หลายเดือนก่อน +2

    Nicely explained !!

  • @tanushreenagar3116
    @tanushreenagar3116 ปีที่แล้ว +1

    Very nice sir 👌 cleared my concept now

  • @ravulapallivenkatagurnadha9605
    @ravulapallivenkatagurnadha9605 ปีที่แล้ว +1

    Please continue this videos

  • @AFSARAHMED4
    @AFSARAHMED4 ปีที่แล้ว +1

    Excellent Explaination Sir

  • @gil.0007
    @gil.0007 6 หลายเดือนก่อน +1

    Very nicely explained 🎉

  • @omprakashreddy4230
    @omprakashreddy4230 ปีที่แล้ว +3

    Your videos are definitely creating great impact. Thank you for that.
    Can you also please explain df.explain() command in great detail with examples.

    • @rajasdataengineering7585
      @rajasdataengineering7585  ปีที่แล้ว

      Happy to hear that it's creating impact on data engineers. Thank you
      Sure, will post a video on explain plan

    • @rajasdataengineering7585
      @rajasdataengineering7585  ปีที่แล้ว

      Hi Omprakash, created a video on explain plan as per your request. Hope it helps you - th-cam.com/video/6NrVQTbkndU/w-d-xo.html

  • @viniciusguimaraessantana5455
    @viniciusguimaraessantana5455 10 หลายเดือนก่อน +1

    thank you very much.

  • @manjit_singhh
    @manjit_singhh ปีที่แล้ว +1

    Very nice explanation 🙂

  • @mohitupadhayay1439
    @mohitupadhayay1439 21 วันที่ผ่านมา +1

    Raja please try to create a full project where all these optimizations can be shown at full scale.

  • @tanushreenagar3116
    @tanushreenagar3116 11 หลายเดือนก่อน +1

    PERFECT CONTENT SIR

  • @venkatasai4293
    @venkatasai4293 ปีที่แล้ว +2

    Thanks for the great explanation Raja. So are the statistics collected on all the columns ? What if we want to query on other columns ? Will it work ?

    • @rajasdataengineering7585
      @rajasdataengineering7585  ปีที่แล้ว

      Yes Venkata, it will work first 32 columns. If your table contains more than 32 columns and you want to collect statistics for those columns, we can configure that separately

    • @venkatasai4293
      @venkatasai4293 ปีที่แล้ว

      @@rajasdataengineering7585 ok . So zorder is similar to bucketing right ? Colocating the data into same set of files ? If two tables contains same key and if we zorder them on the key While joining the data it will fetch only required files into the executor ?

  • @sathyahisto
    @sathyahisto 11 หลายเดือนก่อน +1

    good Explaination, liked it when you demonstrated with excel. Just one suggestion syntax for zorder seems to be changed to "Zorder by ()"

  • @sumiransinha3707
    @sumiransinha3707 ปีที่แล้ว +1

    Great!

  • @shankar1556
    @shankar1556 ปีที่แล้ว +1

    Hi Azar,
    Thank you for explanation.
    I have a dought. in this example it shows that z-order create new partitions with sorting emp_id. Does z-order really create new partitions?

    • @rajasdataengineering7585
      @rajasdataengineering7585  ปีที่แล้ว

      Hi Shankar, this is Raja.
      When we perform z-order, data is being co-located within same set of files. It is not shuffling the data, nor creating new partitions

  • @TheDataArchitect
    @TheDataArchitect หลายเดือนก่อน

    What about using multiple columns in z-order?

  • @purnimasharma9734
    @purnimasharma9734 ปีที่แล้ว +1

    Hi Raja, how is the partition column determined e.g. how does it know that you have to use emp_id here? Is it based on the predicate column?

  • @ravulapallivenkatagurnadha9605
    @ravulapallivenkatagurnadha9605 ปีที่แล้ว +1

    Please do video on how to convert pandas data pipilines to spark data pipiy

  • @aswaniyettapu9992
    @aswaniyettapu9992 ปีที่แล้ว +1

    Can u do one video on lead and lag in pyspark..?