Null handling in pySpark DataFrame

แชร์
ฝัง
  • เผยแพร่เมื่อ 26 ธ.ค. 2024

ความคิดเห็น • 13

  • @saikumar-it1gl
    @saikumar-it1gl 4 ปีที่แล้ว +1

    Excellent sir. You cleared all my doubts . Thank you

  • @arpithasp7500
    @arpithasp7500 ปีที่แล้ว +1

    Thank you for this content

  • @apurvsingh5541
    @apurvsingh5541 4 ปีที่แล้ว +1

    3:49, "all" drops the row when all the values in the row are null. It has nothing to do with columns.

  • @sushmamc8904
    @sushmamc8904 2 ปีที่แล้ว

    Hi Sir,what if I want to collect the data which have null values ?

  • @gayathrilakshmi6087
    @gayathrilakshmi6087 3 ปีที่แล้ว

    Neat explanation

  • @gauravpathak7017
    @gauravpathak7017 3 ปีที่แล้ว

    My data frame is having null values in some columns I want to drop that null column and show other columns .

  • @harshaviswanath7697
    @harshaviswanath7697 2 ปีที่แล้ว

    Hello, Nice Tutorial! Not sure if you can help with a scenario I am facing, but I wanted to check. My CSV has integer values in a date column, I created a schema with DateType() for this column and tried to load the file into a data frame, unfortunately, the dataframe throws null values. May be I should ingest timestamp data as integer. Please let me know if you can help? I am also happy to get into a call to show the pseudo code

  • @gauravpathak7017
    @gauravpathak7017 3 ปีที่แล้ว

    If I need to remove null in any of the columns and display other columns results then??

    • @KnowledgeSharingjkb
      @KnowledgeSharingjkb  3 ปีที่แล้ว

      hi Gaurav, please see this video th-cam.com/video/oPtCbVG65oU/w-d-xo.html

  • @Suriya_MSM
    @Suriya_MSM ปีที่แล้ว

    Hi sir , what if i want to fill the null columns in salary with the average of preceding and successive values ?

    • @Suriya_MSM
      @Suriya_MSM ปีที่แล้ว

      and if there are continuous null values then first populate the average for the first null values with the average and then .. with that updated value and the next successive value calculate the average for the 2nd null value

    • @KnowledgeSharingjkb
      @KnowledgeSharingjkb  ปีที่แล้ว

      @@Suriya_MSM I think I am not clear. Can you please paste the example

  • @gauravpathak7017
    @gauravpathak7017 3 ปีที่แล้ว

    Let’s say in row no 1 . Name ,I’d,Gen, Sal in which I’d and gen are null but name and Sal have values . Row no2 . Name-null,Sal -null but gen ,I’d are having values . How to display the result . I want op with null removed wherever data is available I want to display