Getting Started with Dataflow in Microsoft Fabric Data Factory

แชร์
ฝัง
  • เผยแพร่เมื่อ 30 ก.ค. 2024
  • The Dataflow in Microsoft Fabric is an element for getting the data from the source, transforming it, and loading it into a destination. In this article and video, we will go through what Dataflow is and how it works with a simple example of it.
    Learn more from my article here:
    radacad.com/getting-started-w...
  • วิทยาศาสตร์และเทคโนโลยี

ความคิดเห็น • 33

  • @shafa7668
    @shafa7668 ปีที่แล้ว +1

    I wanted to get started with Fabic from day one of announcement literally. So thank you for starting this series. You have given us an ahead start!! Cheers

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      Always glad to help :)

  • @user-xx6gf2mi7j
    @user-xx6gf2mi7j ปีที่แล้ว

    Very Good Video and easy to understand to explore futher for beginners...

  • @raviv5109
    @raviv5109 ปีที่แล้ว

    Good Video, Thanks for creating and sharing. It would be interesting to know how it performs on real world large datasets.

  • @debasisrana6437
    @debasisrana6437 3 หลายเดือนก่อน

    Thanks for the video

  • @ruru1419
    @ruru1419 ปีที่แล้ว

    Thanks Reza great video as usual!
    We're trying some PoC with Fabric Warehouse (not Lakehouse) for our SQL user community. Although I have no issues loading small files with Dataflow Gen2, when trying to load On-Premis data through our Gateway (which works fine to refresh PowerBI Datasets) i always get this error:
    "An exception occurred: Microsoft SQL: This function doesn't support the query option 'EnableCrossDatabaseFolding' with value 'true'."
    I cannot find anything related to this...any clue? I wonder if many have tried to implement a "true" business scenario and not just some Exel samples...for this we need to pull data from the Gateway. Thanks!

  • @JorgeSantos-zx6gg
    @JorgeSantos-zx6gg 5 หลายเดือนก่อน

    First of all thanks for the video. Suggestion : It would be great to have the links for your other videos appearing as you speak or in the description below.

  • @AbhishekYadav-rb4bi
    @AbhishekYadav-rb4bi 10 หลายเดือนก่อน

    Thank you🙌

    • @RADACAD
      @RADACAD  10 หลายเดือนก่อน

      You're welcome 😊

  • @tea0819
    @tea0819 ปีที่แล้ว

    Excellent video. Thank you for sharing. I am new to your channel but enjoying all of the content. I recently started a YT channel as well focused on Azure Data and I was just curious what software are you using for drawing red boxes around items and zooming in on your video?

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      Best of luck! and thanks
      I use Zoomit

  • @mounikajuttiga3936
    @mounikajuttiga3936 หลายเดือนก่อน

    Can we refresh the dataset for every 15mins in fabric(schedule refresh)?

  • @adamsabourin9416
    @adamsabourin9416 ปีที่แล้ว

    Reza if we choose append instead of replace is it going to keep duplicates? If so how can we save as “append and remove duplicates”?

  • @yoismelperez2744
    @yoismelperez2744 ปีที่แล้ว

    Thanks for sharing Reza. I like how you are taking the lead to go over Microsoft Fabric products. One question, I may have missed, will replace do update on existing records and inserts for new, or just replace on the entire dataset. Being familiar with PBI Dataflows, I think the answer is it will replace all but just want to confirm.

    • @yoismelperez2744
      @yoismelperez2744 ปีที่แล้ว

      Reza, confirmed, you mentioned it in this video th-cam.com/video/qNoOQzMjrfk/w-d-xo.html, it will replace whatever exists 👍

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      Thanks :)
      Replace will wipe out the data and enters the new data, whereas the append will append it to the existing data.

  • @barttrudeau9237
    @barttrudeau9237 ปีที่แล้ว

    Reza, Your videos are amazing. You stay razor focused and on subject. I'm really enthused about Fabric but concerned about licensing. I don't want to try a bunch of new things for a month only to find out I can't afford them once the trial period is over. We have E5 licensing and I'm not sure what that's going to cover when the trial period is over. Any chance you could update the licensing video you did a while back to help us understand the cost implications of using Fabric?

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      Thanks Bart
      I will have a new video on Microsoft Fabric licensing soon. It is slightly different from how Power BI licensing works, but similar principals.

  • @kapiljadaun7264
    @kapiljadaun7264 ปีที่แล้ว

    Hi
    Your way of explaining is great.
    I would request you to make a video from starting to making reports in Power BI with demo. It will be very helpfull.
    Thank you

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      We are glad it is helpful

  • @Milhouse77BS
    @Milhouse77BS ปีที่แล้ว

    Thanks. Seems like there should be a "Publish & Refresh" option?

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      I agree :) would be helpful

  • @mjbah
    @mjbah ปีที่แล้ว

    Hi Reza.
    May thanks for the video. As always, your videos are helping a lot.
    I got a question around 'adding data to destination'. I was just wondering if you must add each table separately. I am just thinking that if you got so many tables and you want to add all the tables to the same destination whether you can't do it all at once?

    • @RADACAD
      @RADACAD  ปีที่แล้ว

      Hi Mohamed
      That is totally my question too; why shouldn't I be able to add one destination for multiple queries. Let's hope when the preview is done and is generally available, we have a feature like that :)

  • @antonyliokaizer
    @antonyliokaizer ปีที่แล้ว

    I'm wondering why public preview don't have the button "Add data destination" in 10:16 after I upload a csv file as a table? Thank you.

    • @antonyliokaizer
      @antonyliokaizer ปีที่แล้ว

      Without the button, I cannot send data to lakehouse nor warehouse....

    • @RADACAD
      @RADACAD  ปีที่แล้ว +1

      Are you creating dataflow gen2? Because Gen 1 doesn't have this option

    • @antonyliokaizer
      @antonyliokaizer ปีที่แล้ว

      @@RADACAD In public review, I don't see any entry for creating gen1 dataflow...
      Thanks, let me double check again and again

    • @antonyliokaizer
      @antonyliokaizer ปีที่แล้ว

      @@RADACAD Per checked, you're correct. Thank you.
      I guess the gen 1 data flow was created in pipeline.
      From Data Factory page, there's only "data flow gen 2" but "gen 1"
      Thank you again and again.

  • @decentmendreams
    @decentmendreams ปีที่แล้ว

    Hi Reza, these are all good but what has downed on me is that if you are with a Premium Per user licensing Fabric means squat . If feels like a rich man has moved in to your neighborhood and you are watching all his fancy toys as the movers unload . I actually went ahead and turned off the trial version as it seems to overcrowd my Service page .Am I far off here ?

    • @barttrudeau9237
      @barttrudeau9237 ปีที่แล้ว +1

      I share similar concerns

    • @RADACAD
      @RADACAD  ปีที่แล้ว +1

      I feel your concerns.
      And to be honest if you want to just purely use Power BI, you won't need Fabric.
      For example, a small business with a data analysts and a few users analyzing data of some Excel files using Power BI, works best as a pure Power BI solution.
      However, for larger scenarios you get more done with other elements. In large organizations, you would need a storage for structured and unstructured data, you need staging environment for the data, then a data warehouse, a fully automated ETL mechanism to load data in, then model it, visualize it etc. Power BI is only part of the picture. Fabric would enable organizations to achieve more in the data analytics space.
      It might look like a very huge product (which is), but remember how you eat an elephant? one bit at a time :D

    • @decentmendreams
      @decentmendreams ปีที่แล้ว

      @@RADACAD Hi Reza, you are right, Fabric will be overkill for most of my needs except for the DirectLake Connector, if I understood it correctly, blazingly fast data refreshes. My files are so large (>100mb per day) and I need to keep as many of them as I can.
      One bright spot about the introduction of Fabric is that it has made me curious about file compressions. For example, I learned that if I convert my CSV files to parquet files (never knew about it till this week) I can reduce its size by 75% which is so awesome.
      Thank you for everything.
      A person in Phoenix, Arizona.