Get started with SPARK in Azure Synapse Analytics

แชร์
ฝัง
  • เผยแพร่เมื่อ 7 ม.ค. 2025

ความคิดเห็น • 18

  • @bluedreamerssso
    @bluedreamerssso 2 ปีที่แล้ว +2

    need more of this, moving from AWS and Databricks to Azure Synapse Studio

    • @ryanbowns1517
      @ryanbowns1517 2 ปีที่แล้ว

      Can u tell me the benefits of moving away from Databricks go Synapse ?

    • @bluedreamerssso
      @bluedreamerssso 2 ปีที่แล้ว

      @@ryanbowns1517 cost, unfamiliarity with Scala/python, data flow model.

    • @sakeeta6498
      @sakeeta6498 10 หลายเดือนก่อน

      @@bluedreamerssso I can understand that Scala/python issue, but is cost really a thing? I think you could make it with a reasonable cost in Azure using Data Factory pipelines that call (Azure) Databricks' notebooks and do all the ELT and data modeling you would need that way (getting data flow model) without using Synapse. -> Hard to believe that would be more expensive solution than using Synapse, but of course needs Scala/Python expertise.

  • @keen8five
    @keen8five 2 ปีที่แล้ว +1

    Concerning Spark Pool settings: I think they mixed up "auto scale" and "dynamic allocation"; these are two different things

  • @nilsbuer
    @nilsbuer ปีที่แล้ว

    Very good and easy explained thanks

  • @megapixelphotos338
    @megapixelphotos338 2 ปีที่แล้ว

    Started using this today, but noted that there wasn't so much in term of documentation. Can you recommend a good source of further reading?

  • @MsVikramaditya
    @MsVikramaditya 2 ปีที่แล้ว

    Can you give some guidance on choosing optimal sizing of spark pools
    and understanding DAG

  • @jgowrri
    @jgowrri ปีที่แล้ว

    Is spark is good for DW? How this differ from sql dedicate ?

  • @natfind4724
    @natfind4724 2 ปีที่แล้ว

    Great episode!

  • @piraviperumal2544
    @piraviperumal2544 2 ปีที่แล้ว

    Loved it!

  • @germanareta7267
    @germanareta7267 2 ปีที่แล้ว

    Thanks for the video.

  • @kishlayamourya3141
    @kishlayamourya3141 2 ปีที่แล้ว

    was that pyspark?

    • @the_invisible__
      @the_invisible__ 2 ปีที่แล้ว

      Python+ spark = Pyspark... The language that used in Spark cluster for data processing, transformation etc

  • @ryanbowns1517
    @ryanbowns1517 2 ปีที่แล้ว +1

    Note sure how this is better than using Databricks in Azure. Can anyone shed some light here.

    • @benjamincarter6095
      @benjamincarter6095 2 ปีที่แล้ว +2

      Both Synapse and Databricks use Spark. Synapse uses an open source version of Spark with built in support for .NET, while Databricks uses an optimized version of Spark that improves performance and allows for GPU-enabled clusters with higher data concurrency which improve processing performance. Synapse may be easier for a BI team to pick up, and Power BI can be used inside Synapse Studio. At this time, Databricks outperforms Synapse for ML. Microsoft is pumping R&D into Synapse, so it is worth watching.

    • @sakeeta6498
      @sakeeta6498 10 หลายเดือนก่อน

      @@benjamincarter6095thanks for nice clarification. What do you think now, is it same thing in your opinion? MS putting a lot of effort on Fabric side, and on the other hand Databricks is constantly evolving...

  • @baiganil5203
    @baiganil5203 2 ปีที่แล้ว +1

    Im 314 th person to view this