Excel Magic Trick 1336: Power Query: Import Big Data Text Files: Connection Only or Data Model?

แชร์
ฝัง
  • เผยแพร่เมื่อ 6 พ.ย. 2024

ความคิดเห็น • 104

  • @ElTigreFIME
    @ElTigreFIME 2 ปีที่แล้ว +2

    It’s magical!
    Wow, I didn’t know this was even possible. Now, this approach has unlocked so many doors!
    Thanks Mike!

    • @excelisfun
      @excelisfun  2 ปีที่แล้ว

      You are welcome for the magic!!!

  • @krismaly6300
    @krismaly6300 8 ปีที่แล้ว +2

    I enjoyed watching this video and recommend others to watch.
    Host seem to be extremely knowledgeable and expert in Excel
    He has designed the course I feel extremely good. Worth paying to buy the course.
    With this sample video I learned a bunch.
    Thanks for educating the community and appreciate your volunteership.
    Keep posting some more videos.
    Thanks a bunch

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      You are welcome!

    • @wmfexcel
      @wmfexcel 7 ปีที่แล้ว +1

      I agree with you except the word "seem". WITHOUT any doubt, Mike IS extremely knowledgeable. :)

  • @harveyhirst4367
    @harveyhirst4367 7 ปีที่แล้ว +1

    My plant manager is going to be so happy our OEE data will take seconds to update and less time to pivot! Awesomee, thank you

  • @cdsheffer001
    @cdsheffer001 8 ปีที่แล้ว +4

    Thanks ExcelIsFun. I had a 167 MB txt file I was having issues with even loading. This helped a TON!

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      You are welcome!

  • @brendamartinez6955
    @brendamartinez6955 2 ปีที่แล้ว +1

    Thanks ExcelIsFun.

  • @mjprom
    @mjprom 5 ปีที่แล้ว +2

    You saved me so much googling! Finally someone explaining these functions in a useful and easy to understand way. Thank you!

    • @excelisfun
      @excelisfun  5 ปีที่แล้ว

      You are welcome, Matt!! Thanks for your support : )

  • @CaribouDataScience
    @CaribouDataScience 2 ปีที่แล้ว

    What database is the data imported into when using data model?

  • @maksim0933
    @maksim0933 2 ปีที่แล้ว +1

    Thanks a lot for Excel is fun channel! 👍

  • @IlaPatel811
    @IlaPatel811 4 ปีที่แล้ว +1

    Thanks Mike! Love to watch all your videos, always learning so much about Excel and Power Query!

    • @excelisfun
      @excelisfun  4 ปีที่แล้ว

      Glad you can learn so much : )

  • @MGHxxvi
    @MGHxxvi 2 ปีที่แล้ว +2

    This is so helpful for me! Thank you!

    • @excelisfun
      @excelisfun  2 ปีที่แล้ว

      You are welcome, Monica!

  • @MrErolyucel
    @MrErolyucel 8 ปีที่แล้ว +1

    Clear explanation for the big data problem. Appreciate your existence...

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Glad that the existence of the videos helps!

  • @MaiTran-xf3op
    @MaiTran-xf3op 3 ปีที่แล้ว

    Thanks for sharin. I wonder if I just want to choose some in 10 file to refresh, how can I do?
    For example for 10 months of data, i just want to fix 7 months and update 3 months. How can I do to save time when refreshing

  • @hrithikzgirl83
    @hrithikzgirl83 ปีที่แล้ว +2

    Super clear and exactly what I wanted to know. Thanks so much!

    • @excelisfun
      @excelisfun  ปีที่แล้ว

      You are welcome so much!!!!

  • @aks1008
    @aks1008 4 ปีที่แล้ว +1

    Data Model is a role model

  • @jasonm8098
    @jasonm8098 8 ปีที่แล้ว +1

    Wow, that's a big size difference. I like comparisons like that. Thanks.

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      I like comparisons like that too, helps me with the why. But don't forget, small data sets, Connection Only is quite nice so we only have one refresh. You are welcome as always!

  • @fokhrulislam2633
    @fokhrulislam2633 2 ปีที่แล้ว +1

    Yup... That was such a real magic. :) Can't thank you enough or appreciate enough for your work Mike. My Excel journey was started with your tutorials.. :D :)

  • @zt.5677
    @zt.5677 ปีที่แล้ว +1

    I am getting on with hard core PQ, Power Pivot activities. The relevant Magic Tricks are great help. (and the entire series on data analysis, of course )Thank you. Am I correct to assume that, with every new version, Excel has intentionally moved towards database management - SQL and the like - to become a greater tool for data management, and go beyond conventional spreadsheet services. (with Lambda, Excel has also reached out to programming - again, unconventional for a simple spreadsheet app.)

    • @excelisfun
      @excelisfun  ปีที่แล้ว +1

      I hope you are using this playlist: th-cam.com/play/PLrRPvpgDmw0ks5W7U5NmDCU2ydSnNZA_1.html
      It has the full MSPTDA Data Analysis class, plus many other Magic Tricks for DAX and Power Query.
      Microsoft has moved Power Query towards SQL, in that when you connect to an SQL database, Power Query builds SQL that it sends back to the SQL database for more efficient execution. But I don't think Excel or Power BI is "database management", both tools are built to perform analytics, rather than database management. You are right about LET and LAMBDA in that they move Excel closer to a programming language : )

    • @zt.5677
      @zt.5677 ปีที่แล้ว +1

      @@excelisfun Yes, you are right, analytics is the better term, not DB management. Yes, I use MSPTDA and the E DAB videos. Both of them are saved in my Playlist, together with some single videos from the Dragon series, Magic Tricks etc. Thank you.

    • @excelisfun
      @excelisfun  ปีที่แล้ว +1

      @@zt.5677 You are welcome, Z T!!!!!

  • @janardhanvenkatarama6134
    @janardhanvenkatarama6134 2 ปีที่แล้ว

    Hi Mike, I a big fan of you...you are absolutely great in explaining this stuff so everyone (even me) is able to understand it. 🙂
    Nevertheless, is there no way to set a parameter in the first step already to import certain columns only BEFORE loading the data into the power query editor itself?
    I know for CSV files at least you can select certain number of columns in the Csv.Document command, but is there a similar option to do with Excel Files with Excel.Workbook command which you load from a folder?
    I was searching for it since a long time but w/o success at all yet. My issue is that I want to keep this Master Excel file as small as possible as data from the Excel file in the folder is growing everyday I am reading from. Apart from that refresh takes too long time if I first load all columns and then remove columns I dont need in second step.
    For instance, I can do that in KNIME very easily but my management does not know KNIME at all... 🙂
    Thank you for you feedback.
    Regards, Janni

  • @Sal_A
    @Sal_A 8 ปีที่แล้ว +1

    Loved it! So if we get new data in the original file source and your Pivot Table is based off the Data Model then you just refresh the Pivot Table to get the updated data? Also, what's the Fast Data Load option in Power Query? Will that increase speed for the Connection Only method? Thanks!

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Yes, Just one refresh with Data Model. I don't know what "Fast Data Load option" is, where did you see it?

    • @pabeader1941
      @pabeader1941 6 ปีที่แล้ว

      It's a click box near the bottom of certain Query Properties dialog boxes. It's called Fast Load Data (It says by enabling this your data will load faster but Excel might become unresponsive for long periods of time).

  • @sajidsherif5707
    @sajidsherif5707 4 ปีที่แล้ว

    Hi Mike. Thanks so much for the content you put out there - your efforts have helped me big time in my work and I have developed a love for Excel (I knew zilch before watching your videos). I do have a question though. I used the Data Model method as illustrated in the video, but my file size is just over 100MB. I have 132 columns and 1.1M rows. The Pivot Table is working fine with minimal lag, but for my understanding, I am wondering why my file size is so large (can't seem to get an answer with standard googling). Thanks in advance!

  • @muhammadsyazwanfathullahfa8569
    @muhammadsyazwanfathullahfa8569 4 ปีที่แล้ว

    Hello, i'd like to ask, you example here has about 7mil lines. My data is only 3mil, but i could not do the pivot table. I will always get this alert "Microsoft Excel cannot make this change because there are too many row or column items..............". May you help me to address this?

  • @mowgli536
    @mowgli536 4 ปีที่แล้ว

    Hi! I'm very excited to start using Power Pivot! I have some very large files that I would like to like to add to a Data Model via a connect because I am building some dashboards. I want to get the data from a folder, so all I have to do is dump the files in the folder and refresh my dashboard workbook. However, I have one worksheet with a dashboard for Active Employees and another worksheet within the same workbook for Termed Employees. I cannot have these datasets merged because they are different. I can't seem to create 2 queries from one folder.

  • @Sal_A
    @Sal_A 8 ปีที่แล้ว

    Cool. I'm using Excel 2010 with PQ as an add on. Click Power Query tab > Options > Data Load > Fast Data Load (you can check/uncheck it). You can also view this option by hovering over one of your Connections in your Power Query Workbook Queries > select the ellipses (......) > Properties > Fast Data Load

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Got it! Thanks for the hot tip! : )

  • @DigitalCraig
    @DigitalCraig 8 ปีที่แล้ว

    Good Example in difference in file size.

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Glad it is helpful! : )

  • @entertainmentgalaxy971
    @entertainmentgalaxy971 7 ปีที่แล้ว

    Superb. Many thanks for sharing such kind of knowledge. Major difference between data model and connection only, one is faster and compress file size and other dont.. Please correct if i m wrong.....

    • @excelisfun
      @excelisfun  7 ปีที่แล้ว +1

      You are correct. Yes, I have done a number of videos on this topic. Connection Only can be slower sometimes in refreshing, but file size should be smaller.

  • @babon3153
    @babon3153 5 ปีที่แล้ว

    Great material, thanks! :)
    The only difference that I have got is on the importing phase: while selected proper folder and clicking on the two arrows to see the fields it opens a view per each file and then when I have connection it is not a one query, but actually a set of five queries... Could you please comment what I am doing wrong?

    • @fabien3512
      @fabien3512 2 ปีที่แล้ว

      I actually have the same problem when in every video I saw, everyone has only one query..

  • @sharlok4150
    @sharlok4150 3 ปีที่แล้ว

    Many Thanks for your help

  • @donovannewton9507
    @donovannewton9507 4 ปีที่แล้ว

    has power query learnt to use a.docx file yet without having to convert it to text?

  • @gabormathe823
    @gabormathe823 2 ปีที่แล้ว

    Excellent! Thank you!

  • @saikrishna3951
    @saikrishna3951 2 ปีที่แล้ว

    I cannot choose only create connection, please help]

  • @amrizzato
    @amrizzato 5 ปีที่แล้ว

    I have a question, let's suppose you came back a month later and now have an additional text file, you drop it in the folder, do you just refresh the query or do you have to recreate from scratch?

    • @faustdw88
      @faustdw88 5 ปีที่แล้ว

      just refresh

    • @amrizzato
      @amrizzato 5 ปีที่แล้ว

      Fauststein thank you!

  • @blackrush1234
    @blackrush1234 ปีที่แล้ว

    Amazing work!

  • @monadm5697
    @monadm5697 7 ปีที่แล้ว

    Hello, I have a table with 6 colums but with 21420 rows , couldnt past it into Excel, do you have a method for doing that? I have MS Excel 2007.
    Thanks ..

  • @gillynova
    @gillynova 4 ปีที่แล้ว

    Thanks for this! Very helpful

  • @orta02
    @orta02 5 ปีที่แล้ว

    Awesome Video - helped me save so much time - thank you

  • @arunkhanal7208
    @arunkhanal7208 2 ปีที่แล้ว

    Instead of Sum of values kindly show the COUNT so that one can realise the number of rows.

  • @kamranb1369
    @kamranb1369 6 ปีที่แล้ว

    I learn something new everyday with your help.Thanks Mike :-)

    • @excelisfun
      @excelisfun  6 ปีที่แล้ว

      Glad the videos help and you learn something each day, K B : )

  • @pmsocho
    @pmsocho 8 ปีที่แล้ว

    Great comparison! Thanks Mike!

    • @excelisfun
      @excelisfun  7 ปีที่แล้ว

      Glad yoy like it, Awesome Online Excel Teammate!

  • @Victor-ol1lo
    @Victor-ol1lo 8 ปีที่แล้ว

    Hi Mike ! Great video. Looking forward to see more PQ-Stuff... Thumbs up !!

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Lots more is coming! Thanks for the Thumbs Up and Sub! : )

  • @p.narasimamurthy5353
    @p.narasimamurthy5353 5 ปีที่แล้ว

    How to import data in power query if empty cells have between cell

  • @hueyleenwong6908
    @hueyleenwong6908 5 ปีที่แล้ว +1

    Hii, I downloaded my data into txt. Version. But when I import my data, some of the column runs causing it to hit error that show unable to convert to number. I get to append a few data together but I did not get the same error as my big files. Is there any solution for ignoring the error, and let my data appear?

    • @RuntyKnows
      @RuntyKnows 4 ปีที่แล้ว +1

      Power BI Error - Correcting the error: We couldn't parse the input provided as a Date value. - th-cam.com/video/iD7gNBcYjEE/w-d-xo.html

  • @ishaq79
    @ishaq79 6 ปีที่แล้ว

    when i change the data type, some of my dates of data show error, although before changing data type i can see as a date. why is showing this don't get it. Could you please advise.

  • @s1ngularityxd64
    @s1ngularityxd64 4 ปีที่แล้ว

    Awesome, great video

  • @luisgaspar3278
    @luisgaspar3278 6 ปีที่แล้ว

    Stunning!

  • @knikl
    @knikl 4 ปีที่แล้ว

    Great video. Thank you Mike

  • @Sal_A
    @Sal_A 8 ปีที่แล้ว +1

    BTW, "add this data to the Data Model" feature is not in Excel 2010 even if you download the PQ add on.

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Yes, that is correct. It was added in Excel 2013.

  • @onnseanbeats1012
    @onnseanbeats1012 3 ปีที่แล้ว

    This is great, thank you!

    • @excelisfun
      @excelisfun  3 ปีที่แล้ว

      You are welcome!

  • @ContentedSoul
    @ContentedSoul 7 ปีที่แล้ว

    For anyone having errors with this problem, there is an error in the "header" (first-) row of the text files 2017-01 and 2017-03.
    This begins:
    Date Time{tab space}Web Site{tab space}...
    In all the other files this is:
    Date{tab space}Web Site{tab space}...
    Replace "Date Time" with "Date".
    Excel/Power Query should now work fine, if a little slowly, with all these records.

    • @excelisfun
      @excelisfun  7 ปีที่แล้ว

      Thank you for tracking down this error. I have changed the two files :2017-01 and 2017-03. For each I changed "Date Time" to "Date". I hope this fixes the problem.
      Thanks, ContendSoul for being a great Online Excel Teammate and helping me out : )

  • @נירמימון-ז4ז
    @נירמימון-ז4ז 7 ปีที่แล้ว

    hey,
    good Explanation, but it didn't work for me...
    i use several excel files that located in 2-3 folders. The total files size are almost 200MB. After importing and manipulating them a little bit, i loaded the final query to the Data Model and saved it.
    At the end of the process the file size (with the model) was 70MB which is very big. The dimentions are 600K rows X 26 Cols.
    Any idea what went wrong in process ?

    • @excelisfun
      @excelisfun  7 ปีที่แล้ว

      Mot sure, but maybe: Since the Data Model in-memory Columnar Database only stores unique values, if your data set had many different values (not-unique or distinct), maybe file size was not reduced as much.

    • @נירמימון-ז4ז
      @נירמימון-ז4ז 7 ปีที่แล้ว

      Worked like a magic !!!
      Thank you !

    • @excelisfun
      @excelisfun  7 ปีที่แล้ว

      I have fixed the source data sets, so I hope after you download the text files again and re-try, everything will work.

  • @himanshudalai1028
    @himanshudalai1028 6 ปีที่แล้ว

    Thank you Mike for this great video.

  • @CraigHatmakerBXL
    @CraigHatmakerBXL 8 ปีที่แล้ว

    Nice presentation.

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      Glad you like it!

  • @yuliya3294
    @yuliya3294 3 ปีที่แล้ว

    I have tried so many times, the file with data model + Pivot is always much bigger than connection only + Pivot. Who knows why? Data are extracted from XLXS files. There are about 700 thousand rows and 30 columns, when data are loaded pure through Power Query to Data Model, the file size more 27M.

    • @excelisfun
      @excelisfun  3 ปีที่แล้ว

      Becasue Connection Only does not load all the raw data, all other methods do : )

  • @kabradg
    @kabradg 8 ปีที่แล้ว

    My work's database management system is exporting data tables just fine but the first 9 rows contain information that disrupts power query because the first 9 rows have a small amount of information data that take up only two the first two columns. Power query then creates its table and thinks I only have 2 columns of data and doesn't display the remaining 20 columns that start in row 10. Is there a way to fix this in power query?

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      It sounds like a specific issue with this database. I have not encountered this issue before. The key with Power Query (and all data cleaning, querying and importing) is to recognize a consistent pattern in the data that can be used to get the output you want. Maybe, if the consistent pattern is always that the top 9 rows are faulty, then remove top 9 rows with Remove Top Rows feature. This feature is in Home Ribbon Tab, Reduce Rows group, Remove Rows drop-down, then Remove Top Rows.

  • @jamesdiaz9740
    @jamesdiaz9740 6 ปีที่แล้ว

    Thanks man!! I've learned a lot from you.

    • @excelisfun
      @excelisfun  6 ปีที่แล้ว +1

      Awesome! Glad it all helps! Thanks for the continued support : ) Tell all your friends too : )

  • @michalroesler
    @michalroesler 3 ปีที่แล้ว

    Yes.

  • @rizkyramdhany4549
    @rizkyramdhany4549 5 ปีที่แล้ว

    Its easy to understand, thanks :) Subscribed

    • @excelisfun
      @excelisfun  5 ปีที่แล้ว

      Thank you for your Sub : )

  • @antoniojhuerta
    @antoniojhuerta 7 ปีที่แล้ว

    Not working fine for me, text files returns null columns in power query .. I don't know why..

    • @excelisfun
      @excelisfun  7 ปีที่แล้ว

      I have fixed the source data sets, so I hope after you download the text files again and re-try, everything will work.

  • @mohamedchakroun4973
    @mohamedchakroun4973 8 ปีที่แล้ว

    thank you

    • @excelisfun
      @excelisfun  8 ปีที่แล้ว

      You are welcome!

  • @LesiavanderWoman
    @LesiavanderWoman 4 ปีที่แล้ว

    that was interesting, thanks

    • @excelisfun
      @excelisfun  4 ปีที่แล้ว

      You are welcome, Lesia!!!

  • @zaighamuddinfarooqui1705
    @zaighamuddinfarooqui1705 8 ปีที่แล้ว

    Cool!! :)

  • @jamesdiaz9740
    @jamesdiaz9740 6 ปีที่แล้ว

    Very helpful, thanks Mike.

  • @kumshan1407
    @kumshan1407 7 ปีที่แล้ว

    Hi I am not able to reduce the size of the file. I tried to import a text file using data query before making a Pivot table report. I have posted the query in below forum: chandoo.org/forum/threads/excel-power-query-get-and-transform-import-big-data-text-files-connection-only-or-data-model.35668/