What To Consider When Building Data Pipelines - Intro To Data Infrastructure Part 2

แชร์
ฝัง
  • เผยแพร่เมื่อ 10 ก.ค. 2024
  • When Building Pipelines What Should You Consider
    Tools and technology are just that.
    🛠️ Tools.
    They won’t actually drive any form of impact on their own.
    They won’t develop processes that are connected to dashboards that in turn drive actions without people. Nor are the numbers they are creating going to magically jump off the screen and fix a business.
    So before building any data pipeline it’s important to consider a few things.
    If you enjoyed this video, check out some of my other top videos.
    Top Courses To Become A Data Engineer In 2022
    • Top Courses To Become ...
    What Is The Modern Data Stack - Intro To Data Infrastructure Part 1
    • What Is The Modern Dat...
    If you're looking to study for your SQL and data science interviews, then check out InterviewQuery:
    www.interviewquery.com/?ref=sdg
    If you'd like to read up on my updates about the data field, then you can sign up for our newsletter here.
    seattledataguy.substack.com/​​
    Or check out my blog
    www.theseattledataguy.com/
    And if you want to support the channel, then you can become a paid member of my newsletter
    seattledataguy.substack.com/s...
    Tags: Data engineering projects, Data engineer project ideas, data project sources, data analytics project sources, data project portfolio
    _____________________________________________________________
    Subscribe: / @seattledataguy
    _____________________________________________________________
    About me:
    I have spent my career focused on all forms of data. I have focused on developing algorithms to detect fraud, reduce patient readmission and redesign insurance provider policy to help reduce the overall cost of healthcare. I have also helped develop analytics for marketing and IT operations in order to optimize limited resources such as employees and budget. I privately consult on data science and engineering problems both solo as well as with a company called Acheron Analytics. I have experience both working hands-on with technical problems as well as helping leadership teams develop strategies to maximize their data.
    *I do participate in affiliate programs, if a link has an "*" by it, then I may receive a small portion of the proceeds at no extra cost to you.

ความคิดเห็น • 34

  • @SeattleDataGuy
    @SeattleDataGuy  7 หลายเดือนก่อน

    If you guys want to learn more about data engineering, then sign up for my newsletter here seattledataguy.substack.com/ or join the discord here discord.gg/2yRJq7Eg3k

  • @DarshilParmar
    @DarshilParmar 2 ปีที่แล้ว +3

    Thanks for this video, always learning something new

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว

      Thank you! And I am always learning from you as well!

  • @peteintania
    @peteintania 2 ปีที่แล้ว +2

    Very informative, as always. Thank you!

  • @makster92
    @makster92 2 ปีที่แล้ว

    Great content! Looking forward to upcoming videos

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว

      Glad you enjoyed it! Plenty more videos coming.

  • @pengamax6500
    @pengamax6500 2 ปีที่แล้ว +1

    Really useful! Thank you x

  • @vedanthasm2659
    @vedanthasm2659 2 ปีที่แล้ว +1

    Granular and informative video. Thanks Bro!

  • @lecryptojames
    @lecryptojames 2 ปีที่แล้ว +1

    Great video! Thanks.

  • @adalke2
    @adalke2 2 ปีที่แล้ว +1

    Terrific overview of key, universal tenets!

  • @harsha2375
    @harsha2375 2 ปีที่แล้ว +4

    Thanks. After watching ur videos I decided my career to be a data engineer. 💯💯

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว +1

      Wow! I hope you find a job you enjoy!

  • @AbhishekUpperwal
    @AbhishekUpperwal 2 ปีที่แล้ว +2

    Hey! Amazing video. Very informative and useful. Which tool is it in the video at 9:00 with all the stats for data quality?

  • @ralphpatrice1676
    @ralphpatrice1676 2 ปีที่แล้ว +2

    I never been the first in the comments section before 🤪. Thank you for the great information

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว

      Thanks for being the first commenter!!

  • @kreedur
    @kreedur ปีที่แล้ว

    What QCs are you performing? Is there a generally-accepted list?

  • @TA-vf8yi
    @TA-vf8yi 2 ปีที่แล้ว +3

    Hi Seattle Data Guy, since you've worked at Meta, I was thinking if you could do a video on Data Modeling? I keep reading all these blog articles that say that 'dimensional modeling is dead' and so on, so I'm wondering how it is done in top tech companies? I'm trying to learn more about data engineering, so thinking if Kimballs books and STAR schemas are still relevant or is there a new paradigm shift and if so what books would you recommend?

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว +2

      There are aspects of star schema and data warehousing that still exists at some companies and some companies still strongly rely on it. I think its important to have as a base because as you go forward you will see a lot of different ways people have morphed how to model data. In addition, not everyone can be Google and Meta. These places have a lot of other tools in place that allow them to break some of the rules of data modeling.

  • @flyffreak93
    @flyffreak93 2 ปีที่แล้ว

    Any good tool for schema analysis for rdms?

  • @filibertogarced
    @filibertogarced 2 ปีที่แล้ว +1

    Great video!
    E is for extract lol 🍪

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว

      Thanks! Just got to bring it back to the ABCs

  • @vinanguyen2583
    @vinanguyen2583 2 ปีที่แล้ว +1

    Thanks! Can you link to the I T K Funde video?

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว

      Whoops I meant to do that. Thanks for the reminder. Here is the link btw. th-cam.com/video/VtzvF17ysbc/w-d-xo.html

  • @hamsansari2111
    @hamsansari2111 2 ปีที่แล้ว +2

    Hey Ben
    In which env are working mostly with ETL or ELT.

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว +1

      I have worked in both. Currently I would say its a 40/60 split between ETL and ELT work.

  • @JoeG2324
    @JoeG2324 2 ปีที่แล้ว +1

    SSIS handles all our ETL/ELT

    • @SeattleDataGuy
      @SeattleDataGuy  2 ปีที่แล้ว

      It's a classic. It's where I started

  • @navarrodba
    @navarrodba 6 หลายเดือนก่อน

    From 20 years beign a DBA to DE