Streaming ETL With AWS Glue | ETL | AWS Glue | Kinesis Data Stream | Glue Crawler | Glue ETL Job

แชร์
ฝัง
  • เผยแพร่เมื่อ 4 พ.ย. 2024

ความคิดเห็น • 20

  • @neelkanthbk
    @neelkanthbk ปีที่แล้ว +1

    Thank you Bro, your videos are very helpful. I was stuck in one issue, through your video I got the solution :)

    • @cloudquicklabs
      @cloudquicklabs  ปีที่แล้ว

      Thank you for watching my videos.
      Glad that it helped you.

  • @HoustonPillay
    @HoustonPillay 6 หลายเดือนก่อน +1

    Thank you so much. Perfectly reproducible. Awesome video.

    • @cloudquicklabs
      @cloudquicklabs  6 หลายเดือนก่อน

      Thank youfor watching my videos.
      Glad that it helped you.

  • @bibinkunjumon
    @bibinkunjumon หลายเดือนก่อน +1

    how to automate athena query from etl job completions?

    • @cloudquicklabs
      @cloudquicklabs  หลายเดือนก่อน

      Thank you for watching my videos.
      Do you mean you want to run some query on your dataset in etl pipelines

  • @kumaru5796
    @kumaru5796 7 หลายเดือนก่อน +1

    thanq nicely explained.

    • @cloudquicklabs
      @cloudquicklabs  7 หลายเดือนก่อน

      Thank you for watching my videos.
      Glad that it helped you.

  • @noufalrijal9811
    @noufalrijal9811 7 หลายเดือนก่อน +1

    What will be process if i need to write transformations on the data, by comparing the existing data (previously processed data).

    • @cloudquicklabs
      @cloudquicklabs  7 หลายเดือนก่อน

      Thank you for watching my videos.
      You might need to have two branches one branch taking care of current data , while on another sourcs you are just crawling. And then you are comparing at a single Transform task like "custom transform". You can try many other approaches as well. I shall create video on your scenario if you can explain bit more.

    • @noufalrijal9811
      @noufalrijal9811 7 หลายเดือนก่อน +1

      Thanks for the quick response 🙂
      My scenario is -
      1. The source will be generating some ticketing information via kinesis stream
      2. I am creating a report which is an aggregated table from almost 8 other tables
      3. We are pushing data to an s3 data lake
      4. So I need to to perform all the aggregated transformations related to the report on the flight within the stream

    • @cloudquicklabs
      @cloudquicklabs  7 หลายเดือนก่อน

      Again when you say aggregate from stream data + table stored data ( May be rds) , are you merging or joining data from stream with table an then storing s3 bucket data lake

    • @noufalrijal9811
      @noufalrijal9811 7 หลายเดือนก่อน +1

      Data in kinesis stream will be a CDC from RDS and the tables to join meanse we can say tables from data lake via data catalogues

    • @cloudquicklabs
      @cloudquicklabs  7 หลายเดือนก่อน

      Okay.. and where is target to store the merge of CDC RDS + Table from Datalake catalog?

  • @surya-z9e
    @surya-z9e 11 หลายเดือนก่อน +1

    you told like will say the iam role cofiguration setting in final. but you did'nt

    • @cloudquicklabs
      @cloudquicklabs  11 หลายเดือนก่อน

      Thank you for watching my videos.
      Apologies if I have not covered but let me tell you that it full admi access with required trust definition. You can watch other videos on ETL I have shown it.