Chapter #9 - How to design data pipeline on gcp (Google Cloud Platform) ?

แชร์
ฝัง
  • เผยแพร่เมื่อ 30 มี.ค. 2021
  • Chapter #9 - Designing data pipeline solution on GCP
    #datapipeline #googlecloud #gcp
    What is Data Pipeline? - • Chapter #8 - Cloud IAM...
    **Google Cloud Platform Beginner Series**
    Google Cloud Platform Beginner Series (2020) - • Google Cloud Platform ...
    **Networking Basics Playlist**
    Networking and Infra Concepts - • Networking & Infra Con...
    ** Other Popular Playlists**
    Crunching Data Series (2020) - • Learn - Data Engineeri...
    Latest technology tutorial (2020) -
    • What is a Data Vault ?...
    Hi Friends, I am Anshul Tiwari, and welcome to our youtube channel IT k Funde.
    More about this video -
    In this video, we will learn something different, instead of discussing any specific product on google cloud platform, we will learn how to design a data pipeline solution on google cloud platform.
    This will help us in 2 ways -
    1. This will helps us improve our design thinking
    2. We will get an overview of all google cloud products that can be used in developing a data pipeline.
    We will start with understanding a typical on premise data pipeline and enterprise data warehouse (edw) design. We will then take this same solution and try to implement it on google cloud platform.
    On-Prem solution design contains below products -
    #Streaming data - Apache Kafka
    #ETL - Informatica, IBM Datastage, SAP Data Services
    #EDW - Teradata
    #Dashbaords & reporting - Tableau and SAP Business Objects
    GCP cloud design contains below products -
    #Streaming data - Cloud PubSub
    #ETL - Cloud Data Fusion
    #Storage - Google Cloud Storage
    #EDW - Google Big Query
    #Dashboards - Looker, Google Data Studio
    #ML - AutoML
    #Metadata Management - Data Catalog
    PLEASE WATCH ALL THE VIDEOS IN ANY PLAYLIST FOR BETTER UNDERSTANDING !!
    Credits & Resources for further study -
    GCP resources documentation for detailed study -
    #Cloud PubSub - cloud.google.com/pubsub/docs/...
    #Cloud Data Fusion - cloud.google.com/data-fusion/...
    #Google Cloud Storage - cloud.google.com/storage/docs...
    #Google Big Query- cloud.google.com/bigquery/doc...
    #Data Studio - cloud.google.com/bi-engine/do...
    #Looker - docs.looker.com/
    #AutoML - cloud.google.com/automl/docs
    #Data Catalog - cloud.google.com/data-catalog...
    images - pixabay.com
    **Social Links**
    TH-cam - / itkfunde
    Facebook - / itkfunde
    Linkedin - / ansh9685
    Twitter - / ansh9685
    Instagram - / itkfunde
    **About This Channel**
    Friends ITkFUNDE channel wants to bring I.T. industry knowledge, information, career advice, and much more to every individual regardless of whether he or she belongs to I.T or not. This channel is for everyone interested in learning something new!

ความคิดเห็น • 148

  • @metaocloudstudio2221
    @metaocloudstudio2221 2 ปีที่แล้ว +27

    As a Data Engineer who worked on Cloud solutions for 7 years, I have never seen such a correct and clean solution. I am interested to see the same solution on AWS as well.

  • @sanjayroberts3453
    @sanjayroberts3453 2 ปีที่แล้ว +3

    Incredibly helpful -- interviewing for senior level data engineering positions and many have data pipeline design sections. This is perfect for that

  • @antonioflores4240
    @antonioflores4240 2 ปีที่แล้ว +5

    I'm trying to build a data lake for an undergrad cloud class and honestly I was feeling pretty hopeless but after watching your video I feel way more prepared to tackle this!!!! Thank you!!!

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Antonio I am so happy it helped in some little way 😊

  • @ankursahu4731
    @ankursahu4731 ปีที่แล้ว

    This is one of the great video to make me understand how Project architecture is there in GCP, I would request to please create a series to build end to end demo project in GCP utilizing all these services.

  • @AmeyChittar
    @AmeyChittar 3 ปีที่แล้ว +3

    Hi. So glad for posting this. I was going through the coursera course for this and I never understood the big picture. It's so clear now. Thank you so much.

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Amey so happy it helped 🙏

  • @hellmutmatheus2626
    @hellmutmatheus2626 2 ปีที่แล้ว

    I really learned a lot through your videos. Thank you very much, cheers from Brazil!

  • @ChoandCho
    @ChoandCho 2 ปีที่แล้ว

    Thank you so much for the hard work you have taken for making these concepts very easy to understand..
    Great work.. Keep posting such informative videos

  • @hareeshgunda8371
    @hareeshgunda8371 2 ปีที่แล้ว +1

    Great work.
    We expect some realtime project design on GCP.

  • @jackleung2519
    @jackleung2519 2 ปีที่แล้ว +1

    really enjoy watching your video, very clear and well-explained.

  • @chehz
    @chehz 3 ปีที่แล้ว +1

    Thank you very much for all your hard work and the nice lecture.

  • @hemanth1262
    @hemanth1262 2 หลายเดือนก่อน

    Very informative to the beginners of cloud but helped with on-prem solutions, thanks tons!!!

  • @naginafathima4562
    @naginafathima4562 2 ปีที่แล้ว +10

    Great explanation. Can you also do a video on how the traditional DW system is used in AWS and AZURE?

  • @rushabhparmar7519
    @rushabhparmar7519 2 ปีที่แล้ว

    Very informative and detailed! Thank you!

  • @pankajmanik7789
    @pankajmanik7789 2 ปีที่แล้ว +3

    This video is too good !! So many topics covered crisply in 1 video ... Thank you very much Sir :)

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Pankaj ☺️🙏

  • @maayanyacobi8665
    @maayanyacobi8665 2 ปีที่แล้ว +1

    great lecture, well explained - Thank you

  • @joannwatu7603
    @joannwatu7603 2 ปีที่แล้ว +1

    Thank you for sharing. Your explanation of the data lake structure for both on-prem and cloud solutions was very clear and easy to follow.

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Joan ☺️🙏

  • @gauravjha7249
    @gauravjha7249 2 ปีที่แล้ว +1

    Very nice ......thanks for enlightning with so many information

  • @victorganguly4230
    @victorganguly4230 2 ปีที่แล้ว

    Nice explaination. 👍 Want to have some more videos like this for real time data processing with Google cloud (for example IOT data,sensor data)

  • @anandakumarsanthinathan4740
    @anandakumarsanthinathan4740 2 ปีที่แล้ว +1

    Absolutely, absolutely wonderful and insightful presentation. Thank you very much.
    Just an observation. With BigQuery as your data warehouse, companies need not go for datamarts any more. BigQuery being a scalable solution, personnel from the various departments such as Sales, HR, Marketing, Finance can run their queries against a single database and still get results back without compromising on performance. Security can be implemented at the user/department level.

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว +2

      Thanks Ananda I cant agree more, BQ could be a one stop solution for vatious diff use cases

  • @prasann26
    @prasann26 2 ปีที่แล้ว

    Very nicely explained. All the best. thank you!

  • @Poornima_life
    @Poornima_life ปีที่แล้ว

    It’s perfect explanation….you are genius man

  • @manuelbolivar9791
    @manuelbolivar9791 3 ปีที่แล้ว +7

    Thanks for posting and thanks for your hard working!! I'd like to see, whenever possible, similar videos for AWS and Azure cloud offerings. Using the same left side starting point. Cheers!

  • @LeonidAndrianov
    @LeonidAndrianov ปีที่แล้ว +1

    Thank you, mate! I like your optimism as well as the explanation.

    • @ITkFunde
      @ITkFunde  ปีที่แล้ว +1

      Thanks Leonid🙏

  • @prithvinoojibail3734
    @prithvinoojibail3734 3 ปีที่แล้ว +1

    I am glad you have tutorials in English (since I don't speak or understand Hindi). Thanks for this video!

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Prithvi

  • @aneyewitness
    @aneyewitness ปีที่แล้ว +1

    Awesome video - thank you! 👍
    Would love to see an equivalent video for AWS and Azure also.

  • @SamvithDevi
    @SamvithDevi 3 ปีที่แล้ว +2

    This has been very helpful. Thanks a lot for the video, and your detailing.

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks a lot🙏☺️

  • @gauthamvijayan
    @gauthamvijayan 2 ปีที่แล้ว +1

    Phenomenal Content.

  • @vanib6123
    @vanib6123 6 หลายเดือนก่อน

    Very Informative !! Thank you for the great efforts !!

  • @pankajchaudhari95
    @pankajchaudhari95 3 ปีที่แล้ว

    Thanks, Amazing video and very helpful

  • @rohanasnani4768
    @rohanasnani4768 2 หลายเดือนก่อน

    Loved it, Thank you soo much for this!

  • @mainajnabee
    @mainajnabee 2 ปีที่แล้ว

    Profound knowledge you are sharing with us. Thank you so much! One request: whenever you refer your other videos, can you please display them in those bubbles in a corner so that we can click on them and go to those videos? You know those small bubbles with link that pop up..

  • @siddeshsharma968
    @siddeshsharma968 ปีที่แล้ว

    Really Great video, been looking for such explaination

  • @sanaayakurup5453
    @sanaayakurup5453 ปีที่แล้ว

    brilliant brilliant explaination!keep it up!

  • @RizalIsmail
    @RizalIsmail 2 ปีที่แล้ว +1

    Really great video. So humble you are and you give good first impressions at 1:25

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Rizal ☺️🙏

  • @learner9670
    @learner9670 ปีที่แล้ว

    Too good learning material!

  • @siamakfarjami2116
    @siamakfarjami2116 ปีที่แล้ว

    Thank you very much. Great video.

  • @MubeenEbrahim-gk5xh
    @MubeenEbrahim-gk5xh ปีที่แล้ว

    Thank you. Very useful video.

  • @thammayagupta
    @thammayagupta ปีที่แล้ว

    Awesome Explanation.....

  • @59600muslim
    @59600muslim ปีที่แล้ว

    Très intéressant merci

  • @SeafireCH
    @SeafireCH ปีที่แล้ว +1

    Very nice video, great overview that is very helpful in navigating the jungle of google tools. Thanks a lot!

    • @ITkFunde
      @ITkFunde  ปีที่แล้ว

      Thanks Pesche

  • @vamsikri1234
    @vamsikri1234 2 ปีที่แล้ว

    Really good video and good explanation learned a lot. I just wanted to ask one thing ,the cloud model you showed is it conventional ETL flow or its more of ELT with subset of ETL , because here we are using both the flow or its some kind of new approach which we can use, if would be really good if you could share your thoughts on this

  • @swatitraveltales
    @swatitraveltales 3 ปีที่แล้ว +1

    Reallly Helpful videos! An aadvanced video on Bigquery & one on Cloud data fusion too:). Thanks for sharing these videos

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Swati sure will try ☺️

  • @swarupdeshpande4759
    @swarupdeshpande4759 4 หลายเดือนก่อน

    Thanks. it was good presentation. To the point.

  • @ambar752
    @ambar752 3 ปีที่แล้ว +1

    Very well explained, loved it !!

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Ambar🙏🙏☺️☺️

  • @meeral8703
    @meeral8703 2 ปีที่แล้ว +1

    You are amazing!! Thanks for the clear video

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว +1

      Thanks Meera ☺️

  • @samirchandra2637
    @samirchandra2637 2 ปีที่แล้ว

    Very informative boss, keep going

  • @lifeofpayal972
    @lifeofpayal972 2 ปีที่แล้ว

    Thanks for the video..!

  • @shameemferdous
    @shameemferdous ปีที่แล้ว +1

    Amazing !

  • @priteshraka6408
    @priteshraka6408 3 ปีที่แล้ว +2

    Thanks for this, I was waiting for one such video from long time.

  • @hailstorm7868
    @hailstorm7868 ปีที่แล้ว

    3rd party -> gcs stage can also be done via arbitrary code inside a container on "cloud run jobs". I did that, and downstream what is shown in the video almost exactly on one of my freelance projects.

  • @ramkumarsundaravel8301
    @ramkumarsundaravel8301 5 หลายเดือนก่อน

    Really good video and explained very well.

  • @sreetenap
    @sreetenap 2 ปีที่แล้ว +1

    Thx a bunch! Excellent explanation.

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Sreedhar☺️

  • @shubhamsinnarkar2923
    @shubhamsinnarkar2923 3 ปีที่แล้ว

    AMAZING VIDEO!

  • @moushmidas2584
    @moushmidas2584 3 ปีที่แล้ว +3

    Thanku sir As i requested you really created a video❤️❤️🙏 so nice video...

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว +1

      Thanks Moushmi hope you learnt something new from it pls keep sharing ur suggestions

  • @yshah9042
    @yshah9042 ปีที่แล้ว +1

    Excellent.

  • @informationsatellite5155
    @informationsatellite5155 ปีที่แล้ว

    great content..thank you so much...

  • @Pkmafffy
    @Pkmafffy ปีที่แล้ว

    Great video! I really appreciate the effort you put into this video and it really helped me out. Thanks a lot1

  • @syedaliuddin7046
    @syedaliuddin7046 3 ปีที่แล้ว +2

    Thank you for sharing the information really helpful

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Syed

  • @JOHNSMITH-ve3rq
    @JOHNSMITH-ve3rq 3 ปีที่แล้ว +1

    absolutely amazing video!!!!!!!!!!!!!!!!!!

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks John

  • @MoisesTrelles
    @MoisesTrelles 2 ปีที่แล้ว +1

    Great explanation, very well presented

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Moises ☺️🙏

  • @vardannegi
    @vardannegi 3 ปีที่แล้ว +4

    For data transformation and enrichment in GCP, Dataprep is the best tool

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Yes Vardan Dataprep is good tool

  • @TradeWithCodeOfficial
    @TradeWithCodeOfficial 3 ปีที่แล้ว

    very helpful video.. one question.. do we need to create a VM instance if we go by this approach for GCP. Where do we trigger the gsutil command ? How do we go to the backend of fusion instance?

  • @sanjayg2686
    @sanjayg2686 ปีที่แล้ว +1

    Sooooper Simple, you made sir, Thanks a lot

    • @ITkFunde
      @ITkFunde  11 หลายเดือนก่อน

      You are most welcome

  • @yuvabhagvatkathakarshriyag9305
    @yuvabhagvatkathakarshriyag9305 3 ปีที่แล้ว +3

    this is indeed a helpful video

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks a lot

  • @bhalachandrapatil
    @bhalachandrapatil ปีที่แล้ว +1

    excellent

  • @dr.kumaraswamybattula8428
    @dr.kumaraswamybattula8428 5 หลายเดือนก่อน

    Thank you ❤

  • @praveengarg6090
    @praveengarg6090 ปีที่แล้ว

    Great

  • @Rise_Citizens
    @Rise_Citizens 2 ปีที่แล้ว +1

    You might make my next job a bigger paycheck,nice video

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks buddy

  • @emmanuelihetu9848
    @emmanuelihetu9848 2 หลายเดือนก่อน

    This is soo good

  • @oguzhangunes
    @oguzhangunes 2 ปีที่แล้ว

    thank you !

  • @manpreetsohal11
    @manpreetsohal11 2 ปีที่แล้ว +1

    Informative video. Why haven't you used cloud dataflow for ETL? Can cloud data fusion perform better ETL tasks than cloud dataflow?

  • @venkatanithish7757
    @venkatanithish7757 2 ปีที่แล้ว +1

    Good one :) :D

  • @user-kk4yp8vd4g
    @user-kk4yp8vd4g 8 หลายเดือนก่อน

    In your architecture where is the pub sub messages are saving is in cloud storage and then etl pupeline pick and load in to big query?

  • @pradeepdewani6267
    @pradeepdewani6267 3 ปีที่แล้ว +1

    This is really helpful ..

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Pradeep

  • @CPat-yt6dm
    @CPat-yt6dm 10 หลายเดือนก่อน

    Thanks

  • @meeral8703
    @meeral8703 2 ปีที่แล้ว +1

    Question sir: on the Google solution, why do you have two separate Cloud Storage components?

  • @aamirsuleman9815
    @aamirsuleman9815 10 หลายเดือนก่อน

    Dataform is another amazing tool for ELT jobs

  • @rdhundare
    @rdhundare 3 ปีที่แล้ว +1

    Gud explanation

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Rahul

  • @capooti
    @capooti 2 ปีที่แล้ว +1

    Nice intro! well done

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks a lot Roland

  • @ankitajain6817
    @ankitajain6817 3 ปีที่แล้ว +3

    Very very informative and useful video.Thanks for sharing !

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Ankita 😊

  • @VinodKumar-wc6bu
    @VinodKumar-wc6bu ปีที่แล้ว

    It's really excellent with kind of explanation .
    Sir do you have gcp pde paid course as well,plz comment

  • @hemantfegde460
    @hemantfegde460 ปีที่แล้ว

    Good explanation Sir, could you please make video on Streamsets pipeline?

  • @sanjayg2686
    @sanjayg2686 ปีที่แล้ว +1

    Thanks a lot

    • @ITkFunde
      @ITkFunde  11 หลายเดือนก่อน

      Most welcome

  • @kushagrak4903
    @kushagrak4903 3 ปีที่แล้ว +1

    Sir really amazing video ! I was wondering where can I learn basics of the software which you mentioned ( SAP , IBM ) ?

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks youtube has everything for free or else try udemy for reasonable courses

  • @anandnerurkar8482
    @anandnerurkar8482 ปีที่แล้ว

    good one. what is datamart, for datamart you have been using data studio,automl,looker? can u pls explain why??

  • @sathishgowda4272
    @sathishgowda4272 2 ปีที่แล้ว +1

    Very good explanation, please make some videos related to complete data engineer project flow with pyspark or SQL

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Thanks Sathish ☺

    • @pawnyogi
      @pawnyogi ปีที่แล้ว

      @@ITkFunde hi , is this is roadmap for data engineer using google cloud ?

  • @RakeshGupta23
    @RakeshGupta23 2 ปีที่แล้ว +1

    Great explanation..one request .. please use mic 🎙️ as voice is Little low as compared to other educational videos.... great job..

    • @ITkFunde
      @ITkFunde  2 ปีที่แล้ว

      Sure Rakesh thx for feedback and support 🙏

  • @ArjunSingh-fm9sm
    @ArjunSingh-fm9sm ปีที่แล้ว

    Hey Thanks for this video, I have 1 question. If we want to do performance test for this type on msging queue application, what should be our approach.?? and how we can do this.. plz

  • @zudt
    @zudt 7 หลายเดือนก่อน

    just G.R.E.A.T.

  • @amitjaiswal781
    @amitjaiswal781 ปีที่แล้ว +1

    Please make a video real time projects in gcp

  • @sunilsahoo5199
    @sunilsahoo5199 ปีที่แล้ว +1

    can you do an end to end project in google cloud using this services.Then it will good for us.A lot of thanks to you for giving this type of content.

    • @ITkFunde
      @ITkFunde  ปีที่แล้ว

      thanks for suggestion

  • @rushipuneet1270
    @rushipuneet1270 2 ปีที่แล้ว

    Is there any practical video creating a data pipeline?

  • @pankajsangle2752
    @pankajsangle2752 6 หลายเดือนก่อน

    om shree ganpatye namo:

  • @SancheeKaushik
    @SancheeKaushik ปีที่แล้ว

    what can be the most cost efficient way to ingest a few tables from snowflake to GCP bigQuery

    • @SancheeKaushik
      @SancheeKaushik ปีที่แล้ว

      also is it possible to do Export and then transform before saving to bigQuery instead to do ELT as explained in the video. if yes then will that be a good option which is cost efficient.

  • @Kunal4980
    @Kunal4980 2 ปีที่แล้ว

    Hi I see that your videos are getting older and platforms like GCP changes their platforms very frequently, could you please update your videos to sync up or upload delta videos to complement older ones up to the latest ones...

  • @PradeepSinghRajputOfficial
    @PradeepSinghRajputOfficial 2 ปีที่แล้ว +2

    The cloud storage in the middle is wrong. You can't store structured data in Cloud Storage.

    • @anandakumarsanthinathan4740
      @anandakumarsanthinathan4740 2 ปีที่แล้ว

      @Pradeep Singh, can't we store the raw data as text files? Acts as a datalake and also as a backup of our original, untouched, unedited data.

  • @nilavasen8631
    @nilavasen8631 ปีที่แล้ว +1

    Dear Anshul, how are you ? I have been working as Data Engineer in on-prem solution of our company for more than 10 years and now wish to move to Cloud Based solution, like GCP Data Engineer. Can you please do help me with the below queries I have :-
    1. I am having total 15+ years of IT Experience. Will it be a good decision to switch to Cloud Data Engineer Roles now ? It seems to me bit late to start journey in this domain now. Please do correct me , if I am wrong.
    2. In order to start / work with the GCP Data Engineering Role, can you kindly suggest me what are the topics / tools / GCP Services I need to learn ? I have basic ETL Concept and also working with PLSQL , Python stuff.
    Thanks in advance !!

    • @ITkFunde
      @ITkFunde  ปีที่แล้ว +1

      Hi Nilava, 15+ years of exp is quite a lot, in todays age tech roles are defying age and exp boundaries but in indian IT your total exp and respective role does put an image on your career profile thus i would suggest you move towards cloud data architect sort of roles or solution architects. Also if you starting from a clean slate then you should also think of becoming a Data Lead or Manager and manage a team of Data engineers under you. Think big !

    • @nilavasen8631
      @nilavasen8631 ปีที่แล้ว

      @@ITkFunde yes very true Anshul.. thats what I was thinking also. 🙂

  • @V_sharmaji
    @V_sharmaji 3 ปีที่แล้ว +1

    Is Big Query DataLake or Cloud Storage?

    • @anandakumarsanthinathan4740
      @anandakumarsanthinathan4740 2 ปีที่แล้ว

      @vaibhav Sharma, it could act as a datalake as well as storage, but primarily it is a data warehouse for analytic purposes.

  • @krish_telugu
    @krish_telugu 2 ปีที่แล้ว

    We are facing problems with almost all services that we have been using with GCP, very limited plugins and features serve to basic use cases. even connectivity between services is also troublesome. long way to go

  • @rincymathew7716
    @rincymathew7716 3 ปีที่แล้ว +1

    Could you help me with dataProc, data flow and data prep please

    • @ITkFunde
      @ITkFunde  3 ปีที่แล้ว

      Thanks Rincy will try to make for sure