What Is S3 And How Can You Query It With AWS Athena - AWS Data Engineering 101

แชร์
ฝัง
  • เผยแพร่เมื่อ 25 ส.ค. 2024

ความคิดเห็น • 18

  • @AndrewAlarcon17
    @AndrewAlarcon17 4 หลายเดือนก่อน +3

    This is was super insightful. Would love more stuff like this!

    • @SeattleDataGuy
      @SeattleDataGuy  4 หลายเดือนก่อน

      glad you enjoyed it!

  • @PrinciplesOrDie
    @PrinciplesOrDie 4 หลายเดือนก่อน +4

    You could've used Glue - Crawler to create the tables faster you can just alter the DDL code in Athena later if you didn't like the way it was put together

    • @SeattleDataGuy
      @SeattleDataGuy  4 หลายเดือนก่อน +3

      100%! I just wanted to go through the CSV S3 bucket option this time. But I am planning to go over AWS Glue and some of the various glue concepts(the etl, catalog, etc) in the future video. This is meant to be a series so I am trying to only add so much per video.

  • @hansmandler7284
    @hansmandler7284 4 หลายเดือนก่อน +1

    Yeah,
    That's what I literally did last weekend:)
    Good to see that the professionals do it the same way I did it.

    • @SeattleDataGuy
      @SeattleDataGuy  4 หลายเดือนก่อน

      What were you doing? Reading from an S3 bucket

  • @ansonnn_
    @ansonnn_ 4 หลายเดือนก่อน +2

    Thanks for the amazing video again as always. We are using Athena as our main "engine" (not sure if that's the right term) to directly connect with Apache Superset for our dashboarding purposes. Our datasets are mostly in Hudi format and very few in parquet format. We are always querying our datasets from S3 using PySpark. I don't think using another huge data warehouse solution like Snowflake or BigQuery makes sense. Or are we missing out something crucial here? Just some thoughts...

  • @SeattleDataGuy
    @SeattleDataGuy  4 หลายเดือนก่อน

    If you guys want to learn more about data engineering, then sign up for my newsletter here seattledataguy.substack.com/ or join the discord here discord.gg/2yRJq7Eg3k

  • @richardduncan3403
    @richardduncan3403 4 หลายเดือนก่อน +1

    I now know why it is called S3. nICE:)

    • @SeattleDataGuy
      @SeattleDataGuy  2 หลายเดือนก่อน

      Glad you found the video helpful