This is a false statement. It is introduction into Glue Studio. Not into Glue. Only people who have worked with Glue in the past can follow this. What is a crawler? What is a Catalog, table etc. What are they in Glue I mean.
You have mentioned crawller... but no description as to what it is and the steps to do it. Would be good to have a pointer to how to work with the crawller
A crawler is a schema detection component of glue. It reads a sample of the data you crawl or want to create a schema for (Supported JDBC sources(RDS,DynamoDB,etc) or S3), after reading it uses built in classifiers to infer the schema(table structure) of the data/or pull the schema from sources that already have a schema and write it to the Glue catalog. You can then reference the tables from your job. For S3 datasource, you can access the same from Athena/Spectrum/EMR, from which you can even view the data from powerbi.
it need a better interface , Calling every object as node is not needed. the interface can be little better. it looks like SSIS ..I prefer Informatica kind of interface is more user friendly.
Calling every monitoring ui a "single pane of glass" is absurd. if you have more than one such UI, by definition you don't have a "single pane". You have a lot of pain. Otherwise, a helpful overview.
This video is a very good intro to AWS Glue, very clearly explained.
This is a false statement. It is introduction into Glue Studio. Not into Glue. Only people who have worked with Glue in the past can follow this. What is a crawler? What is a Catalog, table etc. What are they in Glue I mean.
You have mentioned crawller... but no description as to what it is and the steps to do it. Would be good to have a pointer to how to work with the crawller
A crawler is a schema detection component of glue. It reads a sample of the data you crawl or want to create a schema for (Supported JDBC sources(RDS,DynamoDB,etc) or S3), after reading it uses built in classifiers to infer the schema(table structure) of the data/or pull the schema from sources that already have a schema and write it to the Glue catalog. You can then reference the tables from your job. For S3 datasource, you can access the same from Athena/Spectrum/EMR, from which you can even view the data from powerbi.
Thanks for amazing video. Is there any way to get customer and customer review data set for practice.
But what about testing?
Nicely explained 👍
Thank you for the video.
it need a better interface , Calling every object as node is not needed. the interface can be little better. it looks like SSIS ..I prefer Informatica kind of interface is more user friendly.
I hate the way this is set up. We use dbt and it is so cool.
Calling every monitoring ui a "single pane of glass" is absurd. if you have more than one such UI, by definition you don't have a "single pane". You have a lot of pain. Otherwise, a helpful overview.
This was completely useless as it skips the IAM roles and permissions thus jobs failing with access denied.
lmao
It assumes you should already know all these iam basic aws stuff. So it can focus on real meat: glue
Ah yes, any demo that doesnt go all the way back to teaching you how to turn on your computer and accessing the internet is useless
@@DodaGarcia lmao
Wilson Ronald Clark Deborah Taylor Margaret
Sundar pichai in aws
😂😂
Thomas Paul Jones Dorothy Hall Paul