Data Cleaning PRO Tip: Dealing with text & numbers in same column

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 พ.ย. 2024

ความคิดเห็น • 122

  • @chandoo_
    @chandoo_  2 ปีที่แล้ว +27

    Just checking... Do you want to see a Pandas tutorial on how to clean this data? Like the comment if you want to.

    • @Azhar_Khan383
      @Azhar_Khan383 2 ปีที่แล้ว

      That's awesome

    • @namy1557
      @namy1557 2 ปีที่แล้ว

      Looking forward 👌🏻

    • @hiteshallakki1740
      @hiteshallakki1740 2 ปีที่แล้ว

      YES, definitely

    • @armanwirawan7099
      @armanwirawan7099 2 ปีที่แล้ว

      yes, this is some next level for data analytics videos well done

    • @natynt227
      @natynt227 2 ปีที่แล้ว

      Yes please

  • @Hopspowered
    @Hopspowered 11 หลายเดือนก่อน +2

    Small token of appreciation. Several views lately and almost always save to my excel playlist. And you’re so entertaining as well. Even a rugby reference in a video!

    • @chandoo_
      @chandoo_  11 หลายเดือนก่อน

      Hey @Hopspowered... thanks for that lovely gesture and appreciation of my work. 😍

  • @muhammadjawad8422
    @muhammadjawad8422 2 ปีที่แล้ว +1

    I am lucky to have a teacher like you. keep up the good work.

  • @vivekchakraborty6123
    @vivekchakraborty6123 2 ปีที่แล้ว +4

    I never comment on any YT video, but your videos are really interesting and informative. Kudos to such content and Thanks for this

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      I appreciate that!

  • @canirmalchoudhary8173
    @canirmalchoudhary8173 2 ปีที่แล้ว +7

    2nd solution is awesome, I was thinking how data can be retained while it is being removed 😀.

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      I am glad you liked to Nirmal :)

  • @Azhar_Khan383
    @Azhar_Khan383 2 ปีที่แล้ว +1

    I was familiar with the first technique but today I learned the second technique. I'm becoming awesome. Thanks Sir

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Excellent

  • @MobinulHaq
    @MobinulHaq 2 ปีที่แล้ว

    THANK YOU!! So many accounting softwares extract ledgers like this (with account heads between dates) and it has been so frustrating until now. This is REALLY helpful!

  • @warriorppr
    @warriorppr 2 ปีที่แล้ว

    Hi Chandoo, Thanks a lot. i was looking for this solution and found this video by God's grace. Excellent, Appreciate it Bro.

  • @binishthomas2675
    @binishthomas2675 2 ปีที่แล้ว

    I might have not done this using power query [might have used fiters and delete]......a really nice application. 👍

  • @tpnphoonsiri
    @tpnphoonsiri ปีที่แล้ว

    Hi Chandoo,I love 2nd technique .This will my life easy than before.Thank you very much.

  • @theanalystkr3255
    @theanalystkr3255 2 ปีที่แล้ว

    Just started my journey into data analytics, thanks for these really interesting/intuitive videos!!

  • @arun.kumar.s
    @arun.kumar.s 2 ปีที่แล้ว

    Learned power query from you and will keep learning from you. Boom

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      More power to you Arun ⚡

  • @Excelambda
    @Excelambda 2 ปีที่แล้ว +1

    Great video!!✌
    To solve with a formula:
    =LET(ar,A2:D196,h,A1:D1,d,A2:A196,f,D2:D196,
    a,IF(ar="","",ar),s,SEQUENCE(ROWS(a)),
    x,f="",
    k,XLOOKUP(s,FILTER(s,x),FILTER(d,x),,1),
    VSTACK(h,FILTER(HSTACK(a,k),NOT(x))))
    And dates are dates (numeric), and room numbers , numbers, so we can do other filtering etc

    • @Excelambda
      @Excelambda 2 ปีที่แล้ว

      Forgot to name the last column:
      =LET(ar,A2:D196,h,A1:D1,d,A2:A196,f,D2:D196,t,"Text Value",
      a,IF(ar="","",ar),s,SEQUENCE(ROWS(a)),
      x,f="",
      k,XLOOKUP(s,FILTER(s,x),FILTER(d,x),,1),
      IFNA(VSTACK(h,FILTER(HSTACK(a,k),NOT(x))),t))

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Good one... 😎

  • @Seftehandle
    @Seftehandle 2 ปีที่แล้ว

    Great approach for people trying to transition to power query and later on dax. I loved how you gave the1st option for excel beginners as well as 2nd one for intermediate learners. Cant wait to see how will you transition to simple dax!!!!!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      You are welcome Tina. The rule when it comes to DAX is simple. Try to solve data problems first with SQL / Power Query or other source systems. If you try to develop DAX solution for this problem, you will do a lot of unnecessary stuff.

    • @Seftehandle
      @Seftehandle 2 ปีที่แล้ว

      @@chandoo_ thank you Chandoo!!! I agree regarding power query - dont know how to start w swl really. For dax - maybe you could create sime simple intro to creating formula and writing code here - if you see the need. Not necessarily for this case but maybe other use cases.

  • @khallodah1
    @khallodah1 2 ปีที่แล้ว +1

    easy but yet very valuable. thannk you

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      You are welcome Khalid.

  • @nethikarnarendra4884
    @nethikarnarendra4884 2 ปีที่แล้ว

    Amazing solution!! How to text.combine for only single column e.g. QnA in multiple rows separated by ? Delimiter (Q ends with?)

  • @namy1557
    @namy1557 2 ปีที่แล้ว +1

    good tips on data cleaning, Can't wait to apply this knowledge practically.. 👌🏻

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      You are welcome :)

  • @chrism9037
    @chrism9037 2 ปีที่แล้ว +1

    Great video as always! Thanks Chandoo!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Glad you enjoyed it Chris.

  • @vishalbhati912
    @vishalbhati912 2 ปีที่แล้ว +1

    Sir u don't need go to special. Just simply apply filter and choose blanks and press ctrl+ - boom.

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      😍 Love it.

    • @vishalbhati912
      @vishalbhati912 2 ปีที่แล้ว

      @@chandoo_ is this sarcastic reply 😃😃

  • @carpediemwithLesli
    @carpediemwithLesli 2 ปีที่แล้ว +2

    This is great!!
    I'm also looking for a formula for the following:
    This is my issue: I need to keep my employees at 40 hours a week. So if on Thursday I do the time cards and let's say Monday thru Wednesday they are at 30 hours already. Assuming that on Thursday they will do 8 hours and they will only have 2 hours left for Friday; at what time on Friday they will go home when they get to 40 hours?
    So the formula that I need is:
    Accumulated hours ➕️ today's 8 hours; hours left for Friday equals at what time he will go home on friday..
    Help!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +2

      You are welcome CDWU.
      You can use a formula like this to see how many hours they will have for Friday.
      =MIN(8, MAX(32-hourssofar, 0))
      You can then add that to starting time to see what time the employee can leave on Friday.

  • @Sekoleyte
    @Sekoleyte 2 ปีที่แล้ว +1

    Hi. Chando. Let's do your 2nd technique with an another simply solution.
    Firstly please add a column at the beginning of the page. you will scroll your columns. i mean your A column will be your B column. i don't know but CTRL+R didn't work while our single values in column A. i can reference cells better with this :)
    1) filter blank cells from C,D or E columns.
    2) choose your values from B column, and choose cells from F column in the same range.
    3) Then press CTRL + R.
    4)Then remove the filter. and refilter your values instead of blank cells from column C, D or E.
    5) write "=F3", in cell F2 and scroll it down.
    And ta taaa :)

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Good one. 😎

  • @Luciano_mp
    @Luciano_mp 2 ปีที่แล้ว

    Great!!! Thank you Chandoo!

  • @sunilkumar-jl7uf
    @sunilkumar-jl7uf 2 ปีที่แล้ว

    एक्सीलेंट tricks 👍👍

  • @WiFiJeremy
    @WiFiJeremy 2 ปีที่แล้ว

    Great stuff, Chandoo!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      You are welcome Jeremy :)

  • @muraliiyer7850
    @muraliiyer7850 2 ปีที่แล้ว +1

    Nice to learn 🙏

  • @talknow2859
    @talknow2859 7 หลายเดือนก่อน

    Amazing stuff 🎉!

  • @savissm
    @savissm 2 ปีที่แล้ว

    Thank you Chandoo 🙏

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      You are welcome Saviss.

  • @surajchavan6447
    @surajchavan6447 2 ปีที่แล้ว

    As always awesome!!

  • @basicinfoforall7306
    @basicinfoforall7306 2 ปีที่แล้ว

    your content is good always.thanks

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Glad you think so!

  • @miguelgomez9078
    @miguelgomez9078 2 ปีที่แล้ว +2

    Hey Chandoo,
    Thank you for the great work you do and for your videos that are beyond helpful. I have a problem that maybe you can solve. If I wanted to sort quarterly dates in the format of 1Q22 (quarter, quarter letter indicator, year) from earliest to latest, how would you go about it?

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      You are welcome Miguel. If you just have text values of 1Q22, 2Q22, 1Q21, 3Q20, etc. in a column, you can use this formula (Excel 365).
      =SORTBY(data, RIGHT(data,2)+0, LEFT(data,1)+0)
      But a better option would be to extract the Q & Year and then use Sort options and set up two level sorting if it is a one off thing.

  • @bc4198
    @bc4198 2 ปีที่แล้ว

    Thanks, Chandoo! Question: HowWouldYouAddSpacesAtCapitalLetters?

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      In Power Query, you can split text when the case changes. You can then "merge" these columns with space as delimiter.

    • @Excelambda
      @Excelambda 2 ปีที่แล้ว +1

      A formula, that works for any array/range:
      =REDUCE(range,CHAR(SEQUENCE(26,,65)),LAMBDA(v,x,TRIM(SUBSTITUTE(v,x," "&x))))

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      Good one. Donut for you 🍩
      One more I come up with...
      =TRIM(REDUCE("", MID(B2,SEQUENCE(LEN(B2)),1),LAMBDA(a,v,a&IF(EXACT(v,UPPER(v))," "&v,v))))

    • @Excelambda
      @Excelambda 2 ปีที่แล้ว

      @@chandoo_ Super cool!! But what is "range" ??
      If in B2 we set the "sentence" we want to "split" then range should be an empty string "" like in:
      =TRIM(REDUCE("", MID(B2,SEQUENCE(LEN(B2)),1),LAMBDA(a,v,a&IF(EXACT(v,UPPER(v))," "&v,v))))
      This works fine for a single cell.
      My "range" can be column/row vectors or 2D arrays 😉
      Your idea with EXACT is to be used if we have unusual capital letters like Ñ or others that are not found in an usual alfabet. So, embracing your idea with EXACT, a bullet proof formula that will work with any chars and also handling 2D arrays can be:
      =MAP(range,LAMBDA(x,LET(m,MID(x,SEQUENCE(LEN(x)),1),TRIM(TEXTJOIN("",,IF(EXACT(m,UPPER(m))," "&m,m))))))
      For a single cell, everything that is after LAMBDA can work as a formula, no REDUCE needed
      =LET(x,B2,m,MID(x,SEQUENCE(LEN(x)),1),TRIM(TEXTJOIN("",,IF(EXACT(m,UPPER(m))," "&m,m))))
      So , team work won. Half of donut to you. ✌😉

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      My bad. Should have been REDUCE("" instead of REDUCE(range...
      and yeah, I never eat half a donut. 😂

  • @bimantoro17
    @bimantoro17 2 ปีที่แล้ว

    Ty chandoo, love u

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      😍😍😍

  • @Riri-qi2fu
    @Riri-qi2fu 2 ปีที่แล้ว

    Omg thank u! This is so useful!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      Glad it was helpful!

    • @Riri-qi2fu
      @Riri-qi2fu 2 ปีที่แล้ว

      @@chandoo_ - I use various computing codes prepare reports but nothing can compete with excel when it comes to user friendly funtions! Excel team has done a great work in last 2-3 years!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      YES!!!

  • @ExcelWithChris
    @ExcelWithChris 2 ปีที่แล้ว

    Nice one. Thanks!!

  • @yuvrajyewale4980
    @yuvrajyewale4980 2 ปีที่แล้ว

    Very valuable content on your you tube.
    I have an data of 1000 rows and 500 column and want to modify the data of random 200 rows ..i am filtering 200 rows using normal filter manually...is there any alternative to select 200 rows quickly ?

  • @elfridhasman4181
    @elfridhasman4181 2 ปีที่แล้ว

    Wow, Your video help me a lot :)

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Thanks Elfrid.

  • @naiduvikas202
    @naiduvikas202 2 ปีที่แล้ว +1

    Crazy tip

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      IT IS!!!

  • @nothinghere4520
    @nothinghere4520 2 ปีที่แล้ว

    Love this!!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      😎😎😎

  • @Munichandra_Reddy
    @Munichandra_Reddy 2 ปีที่แล้ว +1

    Hi Chandoo,
    I want to Your help, in my current role, I am facing lack of store produce issue, please tell me, where will I learn more Advanced Store produce ? Swift response

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Not sure what you mean by "store produce". If you meant stored procedures, I am not the person to help. I don't use them.

  • @emtyas
    @emtyas 2 ปีที่แล้ว

    Hey Chandoo,
    Hope you're doing well...
    I have around 1K of Google Maps hyperlinks in my excel file from which I must extract the latitude and longitude of each. Considering that there are thousands, it can be quite challenging to do it manually...
    In any future video or in the comments, if you would be so kind as to share any formulas or tricks, that would be great.
    Thanks...

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      You can use the TEXTSPLIT() function (new in Excel 365) for these kinds of tasks or Power Query. I suggest learning a bit more about PQ using this video - th-cam.com/video/PiFAa_jjaEI/w-d-xo.html

  • @peace_321
    @peace_321 ปีที่แล้ว

    What if there are dates (text datatype), instead of some words in the date column .. and there might not be any blank cells..
    How to solve this .. ????
    Is it must to use SQL for this or can we do it by using EXCEL OR POWER BI

  • @RamKumar-ye3gg
    @RamKumar-ye3gg 2 ปีที่แล้ว

    Hi Chandoo !!
    I need help in creating a macro which can combine data from different tabs in one. However the header keep changing for files.
    Sometime it will have header in 2 rows sometime in 3.
    I want to write a macro which will check how many rows are header and from next row it will copy the data and paste in the main sheet with same header format.
    Please help.

  • @arbazahmad7177
    @arbazahmad7177 2 ปีที่แล้ว

    Awesome 👌

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      🔥🔥🔥

  • @stephaniemorris9143
    @stephaniemorris9143 9 หลายเดือนก่อน

    Chandoo, in your conditional column, what does it mean when you select Output of Date? I am confused by this since date isn't the actual output in the conditional column. I hope you see this and respond...many thanks!

    • @chandoo_
      @chandoo_  9 หลายเดือนก่อน

      Thanks for your question. The "Date" is just a placeholder for us to know which rows have dates and which ones don't.

  • @amitverma1778
    @amitverma1778 2 ปีที่แล้ว

    Hi Chandoo , seen your so many videos and it's very inspiring and informative for new commers .... It will be helpful if you could help me out with one problem or any experts viewers most welcome. I'm facing the weird issue , a client sent me file which has a its own theme but when I try to open same in my pc themes is getting changed to default automatically.
    Any suggestions how to stop default theme and keep the theme to client one🙇🙏

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      You are welcome Amit. I have never noticed this kind of issue. I suggest posting about it on a forum like the official Microsoft help site or stackoverflow.

    • @amitverma1778
      @amitverma1778 2 ปีที่แล้ว

      @@chandoo_ Thank you very much for reply ❤️😊

  • @saimourya579
    @saimourya579 2 ปีที่แล้ว

    I am French teacher , how I use French mix up with Ms Excel to get freelancing job
    will you please give a suggestion for me

  • @TheBossSuperstar
    @TheBossSuperstar 2 ปีที่แล้ว

    How fill up works. Will it reads the unique value as null and fills up. Can it be blank instead of null and will it works still.

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      I haven't tried it with blanks. But as far as I know, I think it just reads nulls and replaces them with the value from below. Kind of like bfill and ffill in Python Pandas.

  • @taizoondean689
    @taizoondean689 2 ปีที่แล้ว

    Thank you 🙏

  • @benndii
    @benndii 2 ปีที่แล้ว +1

    Danke!

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      😍😍😍

  • @amc1520
    @amc1520 2 ปีที่แล้ว

    Genuis! 👏

  • @omkarhulsurkar7518
    @omkarhulsurkar7518 2 ปีที่แล้ว

    Hello Chandoo. I follow your YT videos these are very informative. I have a unique problem in power bi. In which I want to calculate date from a date column by user input parameter. But that column doesn't get updated when I change user input parameter. Can you tell me how I can resolve this.

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Hi Omkar.. the column calculations don't update in runtime. They are calculated and set in model at the time of creation.

  • @deshn21
    @deshn21 10 หลายเดือนก่อน

    Superb. Learned something new!

  • @leehueyyun630
    @leehueyyun630 2 ปีที่แล้ว

    It is awesome.

  • @chandrasekarsankar102
    @chandrasekarsankar102 2 ปีที่แล้ว

    Great!

  • @sakeobulesu7130
    @sakeobulesu7130 ปีที่แล้ว

    Awesome

  • @ManoharVideos.
    @ManoharVideos. 2 ปีที่แล้ว

    Super

  • @aravindr8034
    @aravindr8034 2 ปีที่แล้ว

    But company row are still get some blank value

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      That is expected as not all hotel bookings will be for companies.

    • @aravindr8034
      @aravindr8034 2 ปีที่แล้ว

      @@chandoo_ can you offer any assignments because my passion to become a data analyst and now I am work on it but I don't no how to implement all that I have learned so please help me to give any assignments like that chandoo bro

  • @artadme7448
    @artadme7448 2 ปีที่แล้ว

    Hello chandooo

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Hello Art

  • @SRCOMPUTEREDUCATION
    @SRCOMPUTEREDUCATION 2 ปีที่แล้ว

    You are awesome in excel , but your courses are too costly :D :D

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Thanks SR. That is ok. My courses are not for everyone. I believe knowledge should be free. But my bank says free videos are not an acceptable form of payment for mortgage. So I charge for some of my content.

    • @SRCOMPUTEREDUCATION
      @SRCOMPUTEREDUCATION 2 ปีที่แล้ว

      @@chandoo_ yes you should charge for your expertise

  • @rezabaig
    @rezabaig ปีที่แล้ว

    Can you help me with cleaning data. Where can I ping you???

    • @chandoo_
      @chandoo_  ปีที่แล้ว

      I am not available for any 1:1 help Reza. Please post your questions in a forum.

  • @EDM179
    @EDM179 ปีที่แล้ว

    🤯

  • @sirisoj
    @sirisoj 2 ปีที่แล้ว

    Chandoo please don't ruin your videos with memes 😥

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว +1

      Hmm.. what is so ruinous about a joke or two? This is how most of my videos (and even articles) have been.

  • @frankbradford9616
    @frankbradford9616 2 ปีที่แล้ว

    I can’t believe he used personally identifiable data here. These are real people with real hotel stay dates. This has to be a violation of privacy or law. You should take this video (which is an excellent video btw) down and change the names in the data and reupload a new video

    • @Excelambda
      @Excelambda 2 ปีที่แล้ว

      He is a professional, definitely he used fake data. There are a lot of random list generators engines online for names and almost any type of data. Hint: When names and surnames of same person look like they are from distinct cultures roots it is obvious that are random generated names.

    • @chandoo_
      @chandoo_  2 ปีที่แล้ว

      Thanks for your concern. This data is randomly made up. Why would you think these are real people.