Scrape Data from Multiple Web Pages with Power Query

แชร์
ฝัง
  • เผยแพร่เมื่อ 22 พ.ย. 2024

ความคิดเห็น • 474

  • @robertbartlett3757
    @robertbartlett3757 3 ปีที่แล้ว +24

    That is absolutely brilliant!!! I have spent the last two days trying to figure out how the do it in Python and within 8 minutes you showed me a much easier straight forward way.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +1

      :-) so pleased it was helpful, Robert!

    • @abhinandanaams2613
      @abhinandanaams2613 2 ปีที่แล้ว

      @@MyOnlineTrainingHub can i download epaper into pdf without coding?

    • @JayPatel-hc8dq
      @JayPatel-hc8dq ปีที่แล้ว +1

      lol... literally me too.. i got quite for until python was reading arabic webpages in hex and then i thew my laptop out the window!

  • @fentian
    @fentian 20 วันที่ผ่านมา

    Wow, what an astonishing concept and how wonderfully well you've explained it.
    I've just applied it in Excel PQ to call an API over and over with a number of variables including a date that changes for each iteration, returning JSON data that is then transformed and presented in a pivot table. Thank you Mynda, xxx

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  19 วันที่ผ่านมา

      Thank you! So pleased this video was helpful.

  • @obinnaduru3815
    @obinnaduru3815 3 ปีที่แล้ว +5

    Thank you so much for this video. Very practical for my Data Analyts journey. I followed the steps and didn't tun into any errors.

  • @prameelar1753
    @prameelar1753 3 ปีที่แล้ว +2

    I watched this video on this teachers day, and I believe you are one of the best teacher could help me on web scraping... 🤗

  • @abdulhaseeb8027
    @abdulhaseeb8027 4 ปีที่แล้ว +3

    It's like you have read my mind because I was looking to scrape data from web like this currently. Thanks for the tutorial it's really helpful.

  • @Secret구이구이
    @Secret구이구이 4 ปีที่แล้ว

    Thank you!
    It is hard to study in Korea because there is not much data about powerquery.
    Thanks to this, I integrated several post api into a single query.

  • @malaniebanney1634
    @malaniebanney1634 ปีที่แล้ว

    I slightly adjusted this to scrape data from a folder full of PDF files. Excellent thanks!

  • @geoffreyzziwambazza7862
    @geoffreyzziwambazza7862 2 ปีที่แล้ว +1

    To think I was doing this manually 🤦🏽‍♂️. Thank you, this is a huge time saver!

  • @markhooper279
    @markhooper279 4 ปีที่แล้ว

    That's remarkable; this is like the limit of most peoples Python learning, and most co-workers would consider them "dangerous" with those Python abilities. (in the most professional and excellent way of course!)

  • @awesh1986
    @awesh1986 4 ปีที่แล้ว

    This is an amazing way of working with web pages. I have seen people write lengthy macros and Python code for this.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Yes, Power Query is super easy to use. I wish more people knew of it's powers ;-)

  • @davidstevens4064
    @davidstevens4064 2 ปีที่แล้ว

    Wow...Easily used this tutorial to query printer settings from every Zebra printer on my LAN. Very helpful!

  • @davegoodo3603
    @davegoodo3603 4 ปีที่แล้ว +3

    A bit beyond me at this point Mynda, Power Query is on my "to learn" list. Well presented.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว +1

      Thanks, Dave! Power Query is amazing...I'm confident you'll think so too :-)

  • @sushicatsan
    @sushicatsan 3 ปีที่แล้ว +1

    I knew this was possible, but ran into some errors while trying to do it on my own. Thank you very much for the great tutorial. Now to let Power Bi Spin!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว

      Glad it helped!

    • @omenaokoro4693
      @omenaokoro4693 ปีที่แล้ว

      spot on. I was only able to do the first page. This gives me the ability to do an entire site.

  • @jamessawyer8565
    @jamessawyer8565 4 ปีที่แล้ว +8

    I wasn't even aware that M/Power Query can be used to such extent. Thank you for the great insight!

  • @awesh1986
    @awesh1986 4 ปีที่แล้ว +1

    Thanks Mynda, there is no way that I would not like this video. It's awesome.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Thanks so much, Awesh! And thanks for sharing it on LinkedIn :-)

  • @biswajeetswaro7831
    @biswajeetswaro7831 4 ปีที่แล้ว +1

    Great video mam!!! I was doing this before python then saved into csv then importing to PBI. Now I can do with PBI directly 👏👏👏

  • @naotoaguilarmorita7079
    @naotoaguilarmorita7079 3 ปีที่แล้ว

    Thanks a lot for this tutorial! I could get mutiple api call in single query, best solution ever!

  • @StephanOnisick
    @StephanOnisick ปีที่แล้ว

    Awesome use of M for us tiptoeing into the M Script!

  • @prashantmanshrestha
    @prashantmanshrestha 3 ปีที่แล้ว

    Clear Voice, Beautifully Explained Super-woman.

  • @merbouni
    @merbouni 4 ปีที่แล้ว

    I have never tried this, but I frequently convert data from the csv file to the html Datatable, Thanks Mynda.

  • @khalidessaadi8915
    @khalidessaadi8915 15 วันที่ผ่านมา

    Wonderful job ! So clear and perfectly explained, thank you so much !

  • @vincasvosylius6045
    @vincasvosylius6045 4 ปีที่แล้ว

    You are the legend! Helped me to solve this greyed out "change data source "button

  • @MichaelBrown-lw9kz
    @MichaelBrown-lw9kz ปีที่แล้ว

    This is simply awesome, now I have to practice this technique.

  • @victorgabrielcamargo6384
    @victorgabrielcamargo6384 8 หลายเดือนก่อน

    Wooww thank you so much, took me months to find this function. I will try it in a more complicated webpage. thank you

  • @deepakd-w5h
    @deepakd-w5h 28 วันที่ผ่านมา

    Merci Beaucoup madame. You made my work very easier

  • @MichaelHendersonMHC
    @MichaelHendersonMHC 4 ปีที่แล้ว +1

    Brilliantly framed and well communicated. Thank you again Mynda.

  • @julianstarkey9301
    @julianstarkey9301 4 ปีที่แล้ว

    Very helpful, a lot less complicated excel formulas in my life now, shame that challenge has gone but I had to think a lot about my queries.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Don't be sad that the challenges have gone...there are plenty of new challenges awaiting; M code, DAX, dynamic array functions :-)

  • @CEYLAN64
    @CEYLAN64 4 ปีที่แล้ว +2

    Thank you very much. I'm from Turkey. Have a nice day.

  • @machadolopes
    @machadolopes 2 ปีที่แล้ว

    Amazing how it is easy to scrape web pages. Thanks for this excellent tutorial.

  • @valentecg8518
    @valentecg8518 หลายเดือนก่อน

    I really appreciate your tutorial! monysaver! Most data extraction tools are costly.

  • @仁です
    @仁です ปีที่แล้ว

    It's usefull. Thanks you. I am looking for silimilar data scraper software. Do you mind to show me how to work with power BI in the case with differences website please.

  • @iankr
    @iankr 7 หลายเดือนก่อน

    Brilliant! Many thanks, Mynda.

  • @stephencross4978
    @stephencross4978 ปีที่แล้ว

    Wow, this is clever and exactly what I needed. My mind is blown !!

  • @fabio.s.barbosa
    @fabio.s.barbosa 3 ปีที่แล้ว

    Wonderfull tutorial! that was exactly what I Looking for. I was duplicating datasources for each week to scrap some web data. Thanks a lot!

  • @michalvydrzel
    @michalvydrzel 8 หลายเดือนก่อน

    YOU ARE THE BEST!! Saved me so much work!

  • @StephenMattison66
    @StephenMattison66 3 ปีที่แล้ว

    Great info, easy to understand. TYVM! I'd love to learn how to do all of this in Google Sheets. Power Query sounds cool!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว

      Glad you liked it, Stephen! Sheets doesn't have Power Query.

  • @powerb_i
    @powerb_i 2 ปีที่แล้ว

    Great video thanks this makes web scraping a lot easier. Thank you.

  • @peimanhosseini37
    @peimanhosseini37 ปีที่แล้ว

    thank a lot, that was really really useful. you solve my very big problem. 🙏🙏🙏🙏🙏🙏

  • @nazaarshadir
    @nazaarshadir 3 ปีที่แล้ว +1

    Another great lesson. I have a website with unstructured data for many items. I need specific values for each item from the site. Please, how may I do it automatically and quickly. cftc .gov/dea/futures/deacmesf . htm
    I only need LONG and SHORT value for each code. Thanks.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +1

      Great to hear, Nazaar! The URL provided isn't right. Please post your question and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum

    • @nazaarshadir
      @nazaarshadir 3 ปีที่แล้ว +1

      @@MyOnlineTrainingHub thanks for the quick reply. I just joined the forum. Your forum is clean and organized. Looking forward to learning more. Thanks.

  • @03mariadelmar
    @03mariadelmar 2 ปีที่แล้ว

    Hi! Your tutorial is very clear. However, what if the web page you are trying to access needs your credentials first? Do you know how I can go around that? Thank you!

  • @naveedkhowaja4089
    @naveedkhowaja4089 ปีที่แล้ว

    Excellent tutorial, super easy to follow. That’s brilliant 👍

  • @ramakumarguntamadugu1299
    @ramakumarguntamadugu1299 2 ปีที่แล้ว

    Great Video... Thanks for the efforts and sharing it. this will be very useful for many tasks...

  • @jbjs5820
    @jbjs5820 2 ปีที่แล้ว

    Excellent work. just a question, when i try to refresh it in the system it doesn´t allow. indicates "This dataset includes a dynamic data source. Since dynamic data sources aren't refreshed in the Power BI service, this dataset won't be refreshed", any workaround?

  • @youse3
    @youse3 2 หลายเดือนก่อน

    Thank you so much for this video. what if we have "read more" instead of page numbers ?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 หลายเดือนก่อน +1

      Power Query typically can't see 'read more' information unless it's already in the page HTML. If it's generated using JavaScript, then you can't scrape it.

    • @webscrapingseniors
      @webscrapingseniors หลายเดือนก่อน

      Power Query can struggle with data generated by JavaScript after the initial page load. In such cases, consider using a web scraping tool like Selenium, which can handle JavaScript and interact with 'read more' buttons to load additional content. This way, you can extract all the necessary information from the page. Let me know if you need more guidance!

  • @ssomtom
    @ssomtom 2 ปีที่แล้ว

    Beautiful. It's solved my actual problem. Thx. :)

  • @iliyatsekov6044
    @iliyatsekov6044 2 ปีที่แล้ว

    Many thanks for the video! What if I have two variable names? My URL includes both a year and a quarter. I created the two variable names but how do I invoke the function to take all quarters from every year?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว +1

      Make a table containing the string made up of the quarter and year components and whatever other characters form that section of the URL, and feed that into a single variable.

  • @eslamfahmy87
    @eslamfahmy87 9 หลายเดือนก่อน

    Thank you, one more thing if my pages contain PDF files and I need to add another column which contains that PDF and I need to be accessible by link

  • @jayli3291
    @jayli3291 ปีที่แล้ว

    Great resource! I am curious about replacing 1 with "&PageStart&". Can you explain why we use the double quotes coupled with the double ampersand? Which language/grammar are we following here, M or HTML or something else? I just wanted to learn more coding rules so I can crack the query more freely. I would appreciate any help you could provide.

    • @jayli3291
      @jayli3291 ปีที่แล้ว

      I guess I figured it out. We are just concatenating the opening " with PageStart and then with the closing "; the & works as the concatenation operator. And because PageStart is a text variable, we need to put it inside the double quotation marks.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  ปีที่แล้ว

      You got it 👍

  • @shakiraasfoor7599
    @shakiraasfoor7599 4 ปีที่แล้ว

    Well Done Mynda
    All Your Videos Are Useful

  • @lindalai1406
    @lindalai1406 3 ปีที่แล้ว

    Thank you very much for bringing this brilliant video. I do have a question, if I am not used to using Power BI and still want to use excel to extract web data like you do in this video, how do I do that?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +2

      Hi Linda, Power Query in Excel doesn't have 'from web by example'. Your best option is to use 'From Web', but that will require the data in the web page to be stored in a HTML table. You'll know if it is, because you'll be able to see the table in the preview in Power Query.

    • @lindalai1406
      @lindalai1406 3 ปีที่แล้ว +1

      @@MyOnlineTrainingHub Thank you very much for your prompt response.

  • @ritvikbolugudde8688
    @ritvikbolugudde8688 2 ปีที่แล้ว

    Thanks a lott!! I was wondering if the web page is updated would the loaded data in power bi update too (so basically if it's real time or not)

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      Only direct query datasets can refresh real time, however, you can schedule refreshes at set intervals.

  • @NadeemShafiqueButt
    @NadeemShafiqueButt ปีที่แล้ว

    As always, an excellent tutorial

  • @Ismail-Yahya
    @Ismail-Yahya 4 ปีที่แล้ว +2

    Web scrapping, oh I love it 😊

  • @gest4mp
    @gest4mp ปีที่แล้ว

    I don´t know you, but I love you. thanks!

  • @AnonymousHunYaar
    @AnonymousHunYaar 2 ปีที่แล้ว

    Marvelous ! You make it so easier, Thanks a lot

  • @chrism9037
    @chrism9037 4 ปีที่แล้ว

    Super cool video, thanks Mynda

  • @mariaalcala5159
    @mariaalcala5159 3 ปีที่แล้ว

    Wow amazing what you can do! Thanks a lot mynda I’m always learning from you!

  • @reng7777
    @reng7777 4 ปีที่แล้ว

    Dear Somthing that i would say iis that t is important to mention is that UsingPower Query in excel give a lots of problem since consumes a lot of Ram memory , i had a really bad experiance with Pq tool in excel ,even though i reduced to the minimun as i could the steps and the volumen of imported data.. so thta is something that MS got to improve for sure.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Hi Rene, occasionally you will come up against limits like you describe, but I would say this is the exception rather than the norm. Sometimes there are settings and alternate approaches to your query's structure that can alleviate performance issues like you describe. Sometimes it's an issue with your PC's specifications e.g. not enough RAM, not 64-bit Excel etc.

  • @adamsteele44
    @adamsteele44 2 ปีที่แล้ว

    Wow. Amazing video, thank you!

  • @darrylmorgan
    @darrylmorgan 4 ปีที่แล้ว

    Hi Mynda!Great Tutorial,Just Learnt Something New So I Can Have More Fun With POWER BI..Thank You :)

  • @wrandyrice5447
    @wrandyrice5447 3 ปีที่แล้ว

    Mind blown. This is awesome. Thank you.

  • @Chriiichriii
    @Chriiichriii 2 ปีที่แล้ว

    Exactly what I was looking for, thanks ! great video

  • @carltonquine9277
    @carltonquine9277 4 ปีที่แล้ว

    Wow you're amazing! Can't believe this information is free! Thank you so much!

  • @sadinenim5360
    @sadinenim5360 2 ปีที่แล้ว

    Can do a video on how we can scrap the data from after login into portal with our credentials and then fetch the data

  • @petermcallister908
    @petermcallister908 2 ปีที่แล้ว

    Great tutorial! Helped me a lot. But do you have any idea, why "Add Table Using Examples" won't work and throws this message: "This Stencil app is disabled for this browser"?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      Never heard of that before, Peter. It sounds like you're trying to use Power Query online because there's reference to a browser.

  • @ДенисДементьев-т3о
    @ДенисДементьев-т3о 2 ปีที่แล้ว

    Great video! Extremely useful
    It works in my case, but only for first 19 sheets out of 89.
    Starting from 20th sheet i get a blank page without any data, however i can see pages from range 20 to 89 via browser.
    I would appreciate if you show how many pages could be exported in your exact example

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      Sounds like the web site is throttling the feed so you can't get the data. Not much you can do about this, other than try splitting the task into multiple queries and run them one at a time.

    • @gzfraud
      @gzfraud ปีที่แล้ว

      @@MyOnlineTrainingHub
      Goods News 1 ..... Solving the throttle problem. When PQ and BI won't work, I use Instant Data Scrapper. It's a free Chrome extension and works 95% of the time. It let's you set a time delay to go to next page. I usually start at 12 seconds then decrease the delay 1 second every 100 pages or so to about 4 or 5 seconds. Most I've ever done it scrapped more than 40,000 pages on a website.
      It scrapes only when the webpage is active. So if you navigate to a different webpage tab it pauses. To restart scrapping simply make that page active, ie displaying, and click Start Scrapping. To prevent pausing, simply drag the webpage to be stand alone before starting IDS.
      Goods News 2 ..... it does something that PQ and BI don't do. It extracts embedded URLs. Say email addresses are embedded in people's names. PQ and BI will import the names (as plain text) but I've never figured out how to get them to extract the embedded email address. IDS does extract the embedded URL.
      Bad News .... IDS doesn't connect to the website so you can "refresh" the query like you can with PQ and BI.

  • @harigokul4450
    @harigokul4450 4 ปีที่แล้ว

    Thank you, Madam, for this useful info! But I need to know "How can I scrape pages which have infinite scrolling using power bi?". looking forward to your suggestion!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Hi Hari, if the pages are loading the data on the fly and the data isn't in an html table or visible on the page, then I'm not aware of a way to do that, sorry.

  • @eo4922
    @eo4922 2 ปีที่แล้ว

    Incredible overview, thank you so much! Is it possible to do this if you have a site with multiple pages that uses the same URL? I'm trying to scrape data from a public site with multiple pages, but all of them use the same URL - there are no unique identifiers (e.g. page numbers). Any assistance would be greatly appreciated.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว +1

      Glad it was helpful. Unfortunately, if the site's URL doesn't change, then you can't scrape the data with Power Query.

    • @eo4922
      @eo4922 2 ปีที่แล้ว

      @@MyOnlineTrainingHub Understood. Could you recommend any other options that may be helpful? Thank you in advance.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      Only to say that if you know JavaScript (I don't) you can write some code to change the 'page' displayed so you can get the data.

  • @m_shakes
    @m_shakes 3 ปีที่แล้ว +1

    Amazing video and awesome ideas that I incorporated instantly! Quick question, how would you go about making each "page" into a separate query (each page a query on its own)?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +2

      Glad you liked it, Mohammed! To make each page a separate query, you'd have to create them one by one by pasting in the URL for each page, or copying the query and modifying the URL to point to a different page.

    • @m_shakes
      @m_shakes 3 ปีที่แล้ว +1

      @@MyOnlineTrainingHub Thanks for your prompt reply!

  • @charlesmcdermott282
    @charlesmcdermott282 4 ปีที่แล้ว

    Awesome! I managed to import a table for 1 page from a URL. It is a list of books unfortunately the number of books per web page varies. Is there a way to handle the issue of generating each page number in this case? As a backup is there a method of exporting all pages to a csv file and Load & Transform the csv back into PBI or PQ?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Glad it was useful, Charles. In terms of figuring out the number of items on a page, I'm not sure there's any way to do that in advance of accessing the pages. Whether there's a way to export the pages to a csv file would be down to that website and whether it offers that as an option. It's not something Power Query can do.

  • @wayneedmondson1065
    @wayneedmondson1065 4 ปีที่แล้ว +1

    Hi Mynda.. another great example and technique. Thanks for sharing it :)) Thumbs up!!
    PS - Any idea when the Add Table Using Examples feature will come to Power Query in Excel in Microsoft 365?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว +1

      Thanks, Wayne! No idea when Excel will get Add Table Using Examples :-( it has been available in Power BI for quite a while now, but that doesn't seem to mean anything.

  • @nurezzati9888
    @nurezzati9888 2 ปีที่แล้ว

    Hi Mynda. Thank you for sharing it. Very useful. However, Is there any way to get the actual URL since the position keeps changing whenever I refresh data in Power BI.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      You can use the actual URL if it's more suitable for your scenario.

  • @peterh7842
    @peterh7842 2 ปีที่แล้ว

    This is great - can you show how to do this with multiple parameters though - cant find anything understandable on the web!!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      Please post your question and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum

    • @peterh7842
      @peterh7842 2 ปีที่แล้ว

      @@MyOnlineTrainingHub thanks - will do :)

  • @JasonAngWeiLung
    @JasonAngWeiLung 3 ปีที่แล้ว

    Hi thanks for your video, may I know how to extract data from web pages in multiple tabs (same layout)?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +1

      Not sure what you mean by multiple tabs. If the URL follows a pattern then you can use the technique described in this video. If not, then each URL will have to be a separate query, which you can then append if required.

  • @rodrigomoro8047
    @rodrigomoro8047 3 ปีที่แล้ว

    Dear! Thank you so much for this video.
    Could you please share with us how can we do the following:
    I have a web based database that is constantly fed. Today it has 300 itens and 15 itens per page, so: 20 pages. But next week, this database may have 600 itens, and due that, 40 pages. How can I automate the function to identify the total number of the pages each time it acesses the web data source?
    Thank you!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +1

      Good question, Rodrigo. I see you've also posted this question on our blog post. We'll answer it there as that will be more helpful to others.

    • @rodrigomoro8047
      @rodrigomoro8047 3 ปีที่แล้ว +1

      @@MyOnlineTrainingHub thank you so much!

    • @iwcik
      @iwcik 2 ปีที่แล้ว

      @@rodrigomoro8047 Hi Rodrigo, could you please share the link with the reply to your question?

  • @austinbright-j3o
    @austinbright-j3o 3 หลายเดือนก่อน

    Can you get around captchas for more advanced stuff?

  • @arturodimas6988
    @arturodimas6988 4 ปีที่แล้ว

    Thank a lotu it was terrific, I'am from México.

  • @WeKnowIt100
    @WeKnowIt100 3 ปีที่แล้ว

    Hi! I have encountered a login page before the page that i need to scrap. Anyway can i bypass the page or key in the credentials?

  • @shrikantbadge3978
    @shrikantbadge3978 ปีที่แล้ว

    I still need to watch this video a few times. Our entire organization dont know this i bet

  • @bryandadiz5677
    @bryandadiz5677 ปีที่แล้ว +1

    The website is not anymore updated

  • @athilfaizaan8558
    @athilfaizaan8558 ปีที่แล้ว

    Absolutely useful video! Thank you for this. Also I have a doubt, I have to scrape 560 pages and each page has 25 number of items that I need. I'm a little confused on the modulo part. In 5:28 of the video you say the starting number of pages that u need are 1, 11, 22 etc. But in my case the pages the I require are 1, 2, 3 etc and 25 records in each page. So do I use modulo same as you with 1, 26, 51 etc or avoid the modulo part and continue?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  ปีที่แล้ว +1

      Great to hear! Yes, for the modulo enter the number of records in a page. Try it and see if you get the results you expect and adjust as necessary.

    • @athilfaizaan8558
      @athilfaizaan8558 ปีที่แล้ว

      @@MyOnlineTrainingHub Thank you so much for answering my question and yes it worked, except I got one small problem. Let's say I'm trying to get the rental data from a website and I require three columns; Address, Price and Area (sqft). I chose it correctly from the select from example option. But after seeing the preview of the tables, I see that it has gotten a different category of data (additional info like gym/swimming pool instead getting the address that I wanted). I thought I selected it wrongly and did it again but I got the same results. I'm tackling the problem by now getting only the address and merging with the two tables together. But do you know why this is happening or a workaround for it?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  ปีที่แล้ว +1

      Not sure as I'm not familiar with the site. You're welcome to post your question and sample Excel file on our forum where someone can help you further: www.myonlinetraininghub.com/excel-forum

  • @louielouie9502
    @louielouie9502 3 ปีที่แล้ว

    I'm currently new to this stuff. I see that you might be able to customize queries for specific data scraping tasks. I'm interested in learning ethical data scraping techniques. How would it be possible to create custom scraping software? What computing language would you recommend learning in that case?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว +1

      I can't answer that question, Louie. I know nothing about creating custom software.

    • @louielouie9502
      @louielouie9502 3 ปีที่แล้ว

      @@MyOnlineTrainingHub Thanks Anyway

  • @ngoduyvu
    @ngoduyvu ปีที่แล้ว

    Great example

  • @DLHSuper
    @DLHSuper 4 ปีที่แล้ว

    Hi Mynda, you’re videos are teaching me so much... is there a way to scrape a website that only works in google chrome or Firefox? Unfortunately the website I need to scrape doesn’t work in IE.

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว +1

      Pleased to hear that! If the website doesn't work in IE then you should still be able to scrape it, but you might not be able to see the preview and use the 'from example' tool. Power Query is looking for HTML tags in the web page source code, so as long as your data is stored in these, Power Query can find it. If it's tables generated using JavaScript, then you can't easily get the data using Power Query unless you know how to write JavaScript!

    • @DLHSuper
      @DLHSuper 4 ปีที่แล้ว

      MyOnlineTrainingHub thank you for explaining this, I’ll have another go toddy 😊

  • @abcVegeBreads
    @abcVegeBreads ปีที่แล้ว

    Is it possible to scrape the URL of each individual book? If yes, how can't it be done?

  • @gideonvisser2989
    @gideonvisser2989 4 ปีที่แล้ว

    Thanks for this great video. What if you want to loop through dates?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      You can use the same technique to loop through dates.

  • @MrOktovan
    @MrOktovan 4 ปีที่แล้ว

    What a great tutorial.. I've tried your tutorial and it works!
    However, when I upload the app to the Power Bi service and I set the automatic refresh schedule. There is a failure notification for automatic refresh for dynamic data. did you also experience this?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      Currently scheduled refreshes for queries where the data source is part of a function aren't supported. You should be able to manually refresh though.

  • @silvanoborien9777
    @silvanoborien9777 4 ปีที่แล้ว

    Thank you for the video, very informative...
    How long does it take to scrape the data from the 21000 pages?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว

      I never let it run to find out how long it takes. I'd expect the website would think it was a bot and throttle the query anyway.

    • @gauravagarwal3056
      @gauravagarwal3056 3 ปีที่แล้ว

      @@MyOnlineTrainingHub - Hi, on this point only- i have a query when I tried doing this for 2000 pages, data is coming only for few pages and rest is showing null after the Invoke function where as data is there on the web for that page.

  • @DougHExcel
    @DougHExcel 4 ปีที่แล้ว

    Scraping with PowerBI, hopefully it'll be fully enabled in Excel!

  • @bali501
    @bali501 2 ปีที่แล้ว

    Thank you soooo much! You changed my life this weekend. Been struggling with Excel's limitations for years, and lost countless hours of my life sometimes without even accomplishing my goal. I only discovered the existence of Power Query last night with your video, and you blew my mind. A brilliantly well presented and comprehensive video on it too! It got me partway through my current problem, but now I'm stuck again if you can help?
    I've created Query1 to gets multiple tables from each webpage with 10 records each , and includes a record ID. But each record has a link to a details page for more info for that record. The record ID is used within the URL string to get those details. Can I create a single query that collects the list of records and uses the ID to also collect the details for each record all in one go?
    Also, with 30,000 records in total, it takes hours to refresh. However, as the historic records don't change, and have a historic date of filing, is there any way for future updates to only get and append the latest records (with a filing date after the last date of the previous dataset, whilst removing any duplicates, and append it to the list?
    Finally, it would be great if a timestamp could be added in an additional column to denote the date when that query was run, so that I can easily see which data has been added and when. Is any of this possible with PowerQuery?

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว +2

      So pleased that my video was helpful! Please post your questions and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum

  • @parvez301
    @parvez301 4 ปีที่แล้ว +1

    First comment. thanks for the video

  • @arisekobain6400
    @arisekobain6400 4 ปีที่แล้ว

    Thank so much for your all playlist videos tutorial..awesome...please add post tutorial for making network or system monitoring with excel..many thks..:-)

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  4 ปีที่แล้ว +1

      My pleasure :-) I don't have any data I can use on network or system monitoring, sorry.

    • @arisekobain6400
      @arisekobain6400 4 ปีที่แล้ว

      @@MyOnlineTrainingHub tysm for your feedback, nope with many thank..

  • @jamesflieder8164
    @jamesflieder8164 3 ปีที่แล้ว

    Great video and so clear with the explanation! My researching will be much easier now!

  • @gabapritam
    @gabapritam ปีที่แล้ว

    Thus is AWESOME!!

  • @mohamedadjal8502
    @mohamedadjal8502 3 ปีที่แล้ว +1

    Hi, Professor, you have provide in a lot of effort for these videos, thank you, I have a question in excel, if we have for example in cell "a1" the number 10.00 m, how to have this number with the same format in cell "b1 "using a text function or some other function, thank you very much.😃👍

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว

      Thanks, Mohamed! I'm presuming the value in A1 is a number, in which case you can use this formula:
      =TEXT(A1,"##.00,,\M")

    • @mohamedadjal8502
      @mohamedadjal8502 3 ปีที่แล้ว

      Good evening Professor, I thank you for the answers that you sent me on the Internet. May God protect you. Suppose we have in cell a1 = "excel", in cell a2 = "is", in cell a3 = "fun", in cell b1 = 12.00m, in cell b2 = 10.00gr, in cell b3 = 15.00kg, use the vlookup function: vlookup = ("is", $ a $ 1: $ b $ 3,2, false), the result is 10, which means that This function didn't give me the full format of the number in cell b2 (b2 = 10.00gr), but my goal is to get b2 = 10.00gr and not b2 = 10. thank you so much.😃👍

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว

      Please post your question and sample Excel file on our forum where we can help you further: www.myonlinetraininghub.com/excel-forum

    • @mohamedadjal8502
      @mohamedadjal8502 3 ปีที่แล้ว

      @@MyOnlineTrainingHub Good morning Professor, I have emailed you an excel file containing a question and comment on the question, thanks a lot for the help.

  • @samuelsuarez8304
    @samuelsuarez8304 2 ปีที่แล้ว

    Hi! Nice video. I just want to know if there is a way to extract the paragraph that is inside some products using power query also. I was trying but power only extract information in the outside, that is visible, is there a code or a formula to do this? Thanks!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  2 ปีที่แล้ว

      Power Query can get data that's in a table. If it's outside the table, then you can try querying the 'document' rather than one of the tables.

  • @rakkesh85
    @rakkesh85 4 ปีที่แล้ว

    Nicely explained, loved it.

  • @johng5295
    @johng5295 ปีที่แล้ว

    Thanks in a million.

  • @stitch6410
    @stitch6410 หลายเดือนก่อน

    works great, thanks!

  • @StephenMattison66
    @StephenMattison66 3 ปีที่แล้ว

    I need to scrape data from a map page that shows thousands of map-pins that each lead to the contact data that I need. Do you have a video already showing that? Any suggestions? TYVM!!

    • @MyOnlineTrainingHub
      @MyOnlineTrainingHub  3 ปีที่แล้ว

      No examples of that. Unless the map data is stored in a table in the web page HTML then you won't be able to scrape it with Excel. You could try Power BI to get data by example: www.myonlinetraininghub.com/power-query-get-data-from-web-by-example