ความคิดเห็น •

  • @LukeBarousse
    @LukeBarousse 2 ปีที่แล้ว +36

    If this intro doesn't convince you that Data Engineers are going to be one of the top most needed jobs for the foreseeable future... I don't know what will!
    Great content, Ben!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +5

      It's a wild world right now. So many tools, heres to hoping I can make some sense of it all.

    • @letechnicaljames
      @letechnicaljames 2 ปีที่แล้ว +2

      True.

    • @DataProfessor
      @DataProfessor 2 ปีที่แล้ว +5

      Exactly and this channel is the place to be for learning about this exciting area 😆

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +2

      @@DataProfessor You're too kind!

    • @sndselecta
      @sndselecta 2 ปีที่แล้ว +1

      @@SeattleDataGuy It's funny you say that, because so many companies hiring DEs these days, put on this act as though their data stack is the holy grail, when in reality they have just evolved from vendor lock in to vendor pathing. It's good to be humble and take a step back to actually realize the on slaught of too many options or ways to do the same thing, it is a good thing (competitive pricing, anti-vendor lock in etc...) but also bad thing (focus, overwhelmed on where to start, endless learning different vendor paths). I think your 3 layers is a great start for the base without getting overwhelmed in vendor marketing BS. Hats off to trying the hold the raging bull by its horns. Looking forward to more material.

  • @career-calling
    @career-calling 9 หลายเดือนก่อน +2

    Happy to finally find a channel that talks about data 360 degrees. Thank you for your effort to bring this to the audience.

    • @SeattleDataGuy
      @SeattleDataGuy 8 หลายเดือนก่อน

      thank you! hopefully you're finding it helpful

  • @ekta_r7417
    @ekta_r7417 2 ปีที่แล้ว +2

    Really looking forward to this series.
    Would love if you would discuss an end to end data Infrastructure design and walk us through the thought process while selecting the tools for each of them with diff use cases.
    Thank you again for guiding through your videos. They are a big help!☺️

  • @antonkostov1691
    @antonkostov1691 2 ปีที่แล้ว +5

    Hello again, brother. I want to brag again. After successfully snagging a DE job inspired by you . Now I obtained the DP-203 Microsoft Certified DE. Thanks again, brother for helping me make the big step a year and a half ago.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Congrats. That's so exciting. I am glad you're continuing to grow. How is the new DE job going?

  • @GuyThompsonFWTX
    @GuyThompsonFWTX 2 ปีที่แล้ว +2

    Great video! Can't wait to see where this series goes.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +2

      Yeah, so many tools to go over. VCs got to chill

  • @eth6706
    @eth6706 2 ปีที่แล้ว +1

    Perfect choice for a series! Looking forward to it

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Me too! Thanks for your support.

  • @Alez101010
    @Alez101010 2 ปีที่แล้ว +1

    So useful video! I’m so excited to see what’s next on this series of videos! Thanks for your work

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Yup, I am very excited for this series

  • @tethadam4929
    @tethadam4929 2 ปีที่แล้ว +2

    Another fantastic video. Thanks in advance for all your hard work!

  • @rachelzhang5709
    @rachelzhang5709 2 ปีที่แล้ว +3

    finally!!!! been looking forward to this series to have a more concrete and contextual understanding of data infrastructure.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Yeah, I am very excited! I actually think the next video might be something like "x types of data stacks" and then the EL video. We shall see.

  • @elis8185
    @elis8185 2 ปีที่แล้ว +1

    Great info! Thanks for the steady stream of super useful videos!!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      I am doing my best to keep a steady stream going

  • @letechnicaljames
    @letechnicaljames 2 ปีที่แล้ว +1

    Insightful video. Looking forward to the rest of this series!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      Thanks, I look forward to it!

  • @kapuriaritik
    @kapuriaritik 2 ปีที่แล้ว +1

    Amazing video! Excited for the next part!

  • @joseangelmedinacornejo6362
    @joseangelmedinacornejo6362 2 ปีที่แล้ว +3

    Great video, Ben! It is nice to hear this in simple words, so it helps me transmit such ideas to my team and the stakeholders, hopefully in a way the latter will understand.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      I am glad you found it helpful! I hope your stakeholders can also understand the value of building a reliable data stack.

  • @andi93007
    @andi93007 2 ปีที่แล้ว +4

    This is amazing sir. Thank you ahead for the series!
    Definitely help all of us to keep up with the changing landscape.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +2

      Watch half the videos I make be out of date 3 weeks after I put them out.😅

  • @shatandv
    @shatandv 2 ปีที่แล้ว +3

    Excited for this. Thanks!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Me too! I love this high level stuff.

    • @shatandv
      @shatandv 2 ปีที่แล้ว +1

      @@SeattleDataGuy I'm especially interested in all of this as a startup founder. We're just starting out, but already feel the need for a structured approach to our data

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      That makes sense. I am also kind of going through a similar walk through in my newsletter and it actually has links to the images seattledataguy.substack.com/p/the-baseline-datastack-going-beyond. It might be helpful

    • @shatandv
      @shatandv 2 ปีที่แล้ว

      @@SeattleDataGuy Sounds interesting, thanks! I’ll give it a read

  • @matiaspirovanovarela1241
    @matiaspirovanovarela1241 2 ปีที่แล้ว +13

    Thanks a lot for the video, the series has a lot of potential. May I request if you could talk about cost effective tools? Maybe a sample stack for companies of different sizes or maturity (like the Analysts chart that you showed).

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +5

      Yeah, I love this. I do find that there is the open source data stack for example. If I hear postgres, metabase and airflow. I know the company is trying to keep costs low or like engineering.

  • @reneeh9132
    @reneeh9132 ปีที่แล้ว +1

    Super userful chart in describing the datastack!!

  • @anathanholland
    @anathanholland 2 ปีที่แล้ว +1

    Looking forward to this series! We're implementing a MDS in my org right now and I think it will greatly improve our efficiency and ability to make data-driven decisions.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      That's exciting! What tools are you using?

    • @anathanholland
      @anathanholland 2 ปีที่แล้ว +2

      @@SeattleDataGuy Our main tools are snowflake, fivefran, dbt, and looker!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Solid stack. Where are you guys at thus far?

  • @virginiopancadao
    @virginiopancadao 2 ปีที่แล้ว +2

    More Videos like this!!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      On a trip this week and I actually have 2 other videos in front of this getting edited...but I will be back next week and filming the next part. You can also check out my newsletter where I am going over similar topics seattledataguy.substack.com/

  • @pushpanthkumar9028
    @pushpanthkumar9028 2 ปีที่แล้ว +3

    I Love this.. Ben please consider making videos how you are monitoring the data applications & Best practices to ensure Data Quality.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Yup! I have a few tools I like in that space.

  • @GiasoneP
    @GiasoneP 2 ปีที่แล้ว +1

    Great video. Looking forward to you finishing this series…and hopefully you finish the DE project series too 🙃

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      Hahaha,, the project series I need to restart. However, the next video in this series is completed I will be posting it towards the end of next week (most likely).

  • @obiradaniel
    @obiradaniel 2 ปีที่แล้ว +1

    Thank you very much, very insightful.

  • @N77b44
    @N77b44 2 ปีที่แล้ว +3

    It would be great to hear more about the testing part of the process. I feel like this gets talked about a lot for more core software engineering but I think data engineering presents unique challenges that make adapting something akin to Test Driven Development far from straightforward (many external dependencies, rapidly changing state, etc).

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +2

      Hmm, I might need to think about this for sure!

  • @theravitshow
    @theravitshow 2 ปีที่แล้ว +2

    Love it!

  • @Goku-br7yt
    @Goku-br7yt 2 ปีที่แล้ว +2

    If you could cover something on data
    observability , & how big techs implement data quality Audit Frameworks.

  • @Marcos-yg2vi
    @Marcos-yg2vi 2 ปีที่แล้ว +1

    Very nice! This series will help me about I've wrote in the previous video! thanks! I don`t know if it is the purpose of series, but if you can show tools for big files, high (distributed and parallel) processing, professional environments I appreciate! I see a lot youtube channels that give just simple (educational) examples but when you try to apply in the pipelines, in hard life (lol), nothing works!!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      Yeah, its always hard to actually create situations where you are processing lots of files. Mostly because it starts becoming expensive to store that much data. Overall, its just always easier to show a, here is a hello world vs here is a difficult configuration/scaling issue.

  • @datawitharslan
    @datawitharslan 2 ปีที่แล้ว +1

    Your videos are always very informative and Valuable for Data Lovers. Can you please tell me what you thing have better future , Jobs Modern Data Stack or Cloud Data Stack.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      I think both will have their place. A lot is shifting currently due to funding. It will take 2-3 years for it all to shake out.

  • @shawnteo3837
    @shawnteo3837 ปีที่แล้ว +1

    great video! what type of data infrastructure do you recommend to use for image data?

    • @SeattleDataGuy
      @SeattleDataGuy ปีที่แล้ว

      Image data is generally stored in like S3 or a similar solution. Then you store the url in the database with metadata.

  • @dnn1982
    @dnn1982 2 ปีที่แล้ว +1

    Very valuable video. Do you have plans to make part 2 or continuing the series ?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      There are parts 2 and 3 th-cam.com/video/lSoEI8jia3Q/w-d-xo.html

  • @jlm89jlm
    @jlm89jlm 2 ปีที่แล้ว +1

    Do any of your videos cover managing expectations and timelines as a data engineering consultant? Would love to check that out if it exists!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Not currently, but it might have to go on the list

  • @jester667
    @jester667 2 ปีที่แล้ว +1

    What kind of data observability tools (ideally open source) would you recommend to monitor the data pipelines?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      I should probably make a video on that!

  • @ridhwaans
    @ridhwaans 2 ปีที่แล้ว +1

    where do services like infra-as-code, configuration management, IAM, SSO live in this system?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      Hopefully with a different team...cries in reality

  • @bog7485
    @bog7485 2 ปีที่แล้ว +1

    What does "Core Data" mean? Is it a central data system like EDW?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Something of that nature. I generally mean the most granular form of your production tables. So essentially, what everyone else builds their KPIs, reporting, etc off of.

  • @caseypdx503
    @caseypdx503 2 ปีที่แล้ว +1

    Given that the Analytics Engineering role is mostly based on the advent of DBT--do you see that role continuing to be viable? Or in other words, is DBT and the like just a fad? or should it be considered in the long-term.

    • @loner007
      @loner007 2 ปีที่แล้ว +3

      I am interested in this question as well. I have heard that big companies that already have solid data infrastructure, the data engineers are actually analytics engineers. Whereas companies that don't have a solid data infrastructure, really need data engineers.

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +3

      I actually plan to respond to a post I read on the data engineering subreddit on some of this.

  • @muni7561
    @muni7561 2 ปีที่แล้ว +1

    Hi! I usually just watch your videos and not really comment (LOVE ur contents btw). Not really related to this video I just wanted to know your opinion about Data Engineering Bootcamps when transitioning from DA to DE. Do you consider that an option for the transition? Thanks!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      It's hard to say. Any specific bootcamps? They can help. But also, if as a data analyst, you can take data engineering projects. That can also bee a great way of shifting careers.

    • @muni7561
      @muni7561 2 ปีที่แล้ว

      @@SeattleDataGuy Thanks for the reply Ben! ☺️I found LearningFuze’s Data Science Bootcamp to be appealing since I live in Orange County and could work in person. Im a recent grad from UCSB with math major and I am trying to break into data field as a data analyst and hopefully be a data engineer later in my career. Do you think the bootcamp program would help me break into the field? Thank you for your time and thoughts!! I found your videos to be so inspiring and helpful guiding me through my tough time!!

  • @caseypdx503
    @caseypdx503 2 ปีที่แล้ว +2

    Would it not be true that a business just starting out (low maturity) could just get sources like Fivetran and a cloud data warehouse managed by an analytics engineer and if needed an analyst?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +2

      There are a whole bunch of ways people can set up their data infrastructure depending on budget, head count, priorities, etc. I do plan to talk through some of these in my next video. I was going to do EL. But I think it would be interesting to discuss different stack based on priorities, preferences, etc. The example I often give is the "open source stack". This is usually some combo like Postgres, Airflow, Metabase, re_data or datahub...etc vs the analytics engineer data stack. This tends to be Fivetran, Snowflake and Looker. But there are so many other different versions.

    • @caseypdx503
      @caseypdx503 2 ปีที่แล้ว +1

      @@SeattleDataGuy Thanks for the reply! If you wouldn't mind one more question? :)
      Given that the Analytics Engineering role is mostly based on the advent of DBT--do you see that role continuing to be viable? Or in other words, is DBT and the like just a fad? or should it be considered in the long-term.
      Thanks again!

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      I think there has always been a natural split for data engineers. Some data engineers tend to be more technical and software focused where others are more analytical. So even if the actual title of analytical engineer goes away..I think the role itself will always exist.

    • @caseypdx503
      @caseypdx503 2 ปีที่แล้ว

      @@SeattleDataGuy Right yeah, I guess a lot of the job descriptions I see are software focused, but I am definitely more analytical. I want to work with the data using technical skills, but I will always prefer to be closer to the data and not so much down the software engineer side.

  • @donchichiumelo2762
    @donchichiumelo2762 ปีที่แล้ว

    is that a pink quartz crystal.... its huge!!!!

  • @andrewdecotiis-mauro3709
    @andrewdecotiis-mauro3709 2 ปีที่แล้ว +2

    Do you have any resources for data lineage and governance?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      Do you mean like, which tools are solid? Also for data governance, there is the lightsondata channel.

    • @andrewdecotiis-mauro3709
      @andrewdecotiis-mauro3709 2 ปีที่แล้ว +1

      @@SeattleDataGuy Yeah what tools or any sort of books, articles. I'm going to check out lightsondata now

  • @kjdkmcvkdm
    @kjdkmcvkdm 2 ปีที่แล้ว +1

    Is the series out yet?

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว

      Here is part 2 th-cam.com/video/lSoEI8jia3Q/w-d-xo.html

  • @sylwiaanna2423
    @sylwiaanna2423 ปีที่แล้ว

    I haven't seen a single comment about the Naruto reference, so here it is!

  • @ankush_chatterjee
    @ankush_chatterjee 2 ปีที่แล้ว +1

    Also, along with the new guys, Informatica went IPO with around $10B valuation

    • @SeattleDataGuy
      @SeattleDataGuy 2 ปีที่แล้ว +1

      Yeah! For real. And I haven't even ever worked with it. There is so much money going into the space.