How to Use AWS Glue with Snowflake | PySpark-Snowflake Connectivity

Knowledge Amplifier

มุมมอง 18 110

เพิ่มลงใน
- เพลย์ลิสต์ของฉัน
- ดูภายหลัง
แชร์

แชร์

ฝัง

ขนาดวิดีโอ:

แสดงแผงควบคุมโปรแกรมเล่น

เล่นอัตโนมัติ

เล่นใหม่

เผยแพร่เมื่อ 19 ม.ค. 2025

ความคิดเห็น • 59

@nayanroy13 3 ปีที่แล้ว ⁺⁴
Exactly what I was looking for. Crisp, clear and to the point!
@KnowledgeAmplifier1 3 ปีที่แล้ว
Thank You IamDocxy😊Happy Learning :-)
@manishvishvkarma8030 2 ปีที่แล้ว ⁺¹
@@KnowledgeAmplifier1 hii sir can u pls create one vedio on glue job like it will read data from s3 and load it into snowflake table
@yamuna-p4u ปีที่แล้ว ⁺¹
sir thank you for this video this video helped me a lot ,your explaination is awesome, please keep doing this we will definitely support you sir
@sreejithsurendran6632 3 ปีที่แล้ว ⁺³
Thanks a lot bro..lot of use cases for snowflake and aws learners…
@yadi4diamond ปีที่แล้ว ⁺¹
You are simply awesome, Thank you for the knowledge share!!
@KnowledgeAmplifier1 ปีที่แล้ว
Thank you for your kind words , Yadi! Happy Learning
@puremjlee 2 ปีที่แล้ว ⁺²
4:10 says the glue job is executed by lambda but there was no lambda setting in the video. do we need to use lambda to call glue job?
@KnowledgeAmplifier1 2 ปีที่แล้ว
Hello MJ Lee, I was explaining that we can trigger the glue job from Lambda based on certain event occurrence if required , if you want to run Glue Job from Lambda trigger , then you can check this video --
th-cam.com/video/1tIM1jBmwD4/w-d-xo.html
Hope this will be helpful! Happy Learning :-)
@kunnunhs1 2 ปีที่แล้ว ⁺¹
daarun very good explanation.. one video full clarity
@KnowledgeAmplifier1 2 ปีที่แล้ว
অনুপ্রেরণামূলক মন্তব্যের জন্য আপনাকে ধন্যবাদ Desi Bhasa Main😊হ্যাপি লার্নিং✌
@praveenyadam2617 2 ปีที่แล้ว ⁺¹
You are a wonder and this is what I was looking for...thanks much
@KnowledgeAmplifier1 2 ปีที่แล้ว
Glad to know the video was helpful to you praveen yadam! Happy Learning :-)
@keshavamugulursrinivasiyen5502 2 ปีที่แล้ว ⁺²
Very well presented and nice job
@KnowledgeAmplifier1 2 ปีที่แล้ว
Thank You Keshava Mugulur Srinivas Iyengar! Happy Learning :-)
@MahendraSingh-sw1th 2 ปีที่แล้ว ⁺¹
That was awesome ! Precise !
@KnowledgeAmplifier1 2 ปีที่แล้ว
Thank You Mahendra Singh! Happy Learning :-)
@yamuna-p4u ปีที่แล้ว ⁺¹
crystal clear explanation thank you bro
@KnowledgeAmplifier1 ปีที่แล้ว
You are welcome!
@amitprasad6982 ปีที่แล้ว ⁺¹
sirji in initial architecture you said glue will read data from s3 and apply some transformation and write it to snowflake , but later in the video you pulled data from snowflake and write back to snowflake and s3 .
@KnowledgeAmplifier1 ปีที่แล้ว ⁺¹
Hello Amit Prasad, at 4:15-4:37 , I have mentioned that in this video the focus is integration between AWS Glue (or PySpark) & Snowflake as s3 to lambda and then lambda to glue part already covered in separate video , as the primary focus of this video is Glue & Snowflake , so I explained the possible scenarios around this -- pulling data from snowflake and write back to snowflake & pulling data from snowflake and write to s3. If you want to explore s3 to lambda and then lambda to glue, then you can refer this video--th-cam.com/video/1tIM1jBmwD4/w-d-xo.htmlsi=dYoD7GHeG3hhWAei Hope this answers your doubt , if you have any doubt , please feel free to comment , will try to help as much as possible
@AliMirfaisal ปีที่แล้ว ⁺¹
Nice video. please share same for EMR without airflow.
@KnowledgeAmplifier1 ปีที่แล้ว
Hello Ali Mir faisal, you can refer this video -- th-cam.com/video/oJ6TvZu6DqQ/w-d-xo.html Happy Learning
@swarajmehta3011 3 หลายเดือนก่อน
can we do same, to read csv data from s3 and write it as a table into snowflake
@sumeetsawant3398 ปีที่แล้ว
Hi How do i do this for EMR on EKS . How do I add the jar files in that case ?
@vaibhavverma1340 ปีที่แล้ว
Hello bhaiya, I am getting errror following each step still getting error .. "py4j.protocol.py4jjavaerror: an error occurred while calling o90.load snowflake" ??? please help me out
@laterlname7865 2 ปีที่แล้ว ⁺¹
Is it mandatory to have Spark to connect to Snowflake? Can’t we directly access data in Snowflake tables using SQL in AWS Glue’s python program? The reason I am asking this question is Spark is a big data analytics tool and not every application is meant for data analytics. Most business applications are Insert, Update, Select, Delete type SQL based programs. So can I embed these SQLs in AWS Glue’s Python scripts without using Spark in the code?
@KnowledgeAmplifier1 2 ปีที่แล้ว
Hello Later Lname, you asked a very good question , I created this separate video to give the answer of your question --
th-cam.com/video/OJM2IkcIW_o/w-d-xo.html
Hope this will be helpful! Happy Learning :-)
@yamunau.yamuna5189 ปีที่แล้ว ⁺¹
Thanks a lot Bro your video is awesome
@KnowledgeAmplifier1 ปีที่แล้ว
Most welcome!
@madhubhardwaj4512 ปีที่แล้ว
How can we find the compatible version for the jar files with the current spark version? Please reply.
@krishnasanagavarapu4858 ปีที่แล้ว
can we create reverse integration, i.e. to fetch huge data (80 million rows) from snowflake to S3 without using stage. We have only "read only access to snowflake ?
@swarnadeepchowdhury563 2 ปีที่แล้ว ⁺¹
Is aws glue mandatory for running spark jobs on Snowflake?
@KnowledgeAmplifier1 2 ปีที่แล้ว
Hello Swarnadeep Chowdhury, no it's not mandetory , you can use other services where spark can run like emr etc. Here is a reference video -- th-cam.com/video/oJ6TvZu6DqQ/w-d-xo.html
Happy Learning
@vikinist 2 ปีที่แล้ว ⁺¹
can you share the video link to s3 and Lamdbda trigger
@KnowledgeAmplifier1 2 ปีที่แล้ว ⁺¹
Hello Vikram , if you want to trigger AWS Glue Job whenever some file lands in s3 (s3 to Lambda and then Lambda to AWS Glue Job) , you can refer this video --
th-cam.com/video/1tIM1jBmwD4/w-d-xo.html
Hope this will be helpful! Happy Learning :-)
@kittu1010 2 ปีที่แล้ว
Hello Sir
I am trying to perform many spark operations once i read the table ( just not group by ) . I used the same jars but i am getting the following error - "An error occurred while calling o94.load. scala/Product$class" . Do u know using which jar will solve this issue . thanks in advance.
@vikinist 2 ปีที่แล้ว ⁺¹
one doubt can you please answer when we will go for snowpipe and when we can go for glue ?
@KnowledgeAmplifier1 2 ปีที่แล้ว ⁺¹
Hello Vikram , snowpipe is used for real-time data ingestion from datalake to snowflake using SQS or SNS kind off services .... AWS Glue you can use for any batch processing purpose , batch ingestion or for transforming your data , you can use AWS Glue / EMR
@vikinist 2 ปีที่แล้ว ⁺¹
@@KnowledgeAmplifier1 thanks for the quick reply
@KnowledgeAmplifier1 2 ปีที่แล้ว
@@vikinist no problem .. Happy Learning
@ketank344 2 ปีที่แล้ว
hello, i am getting connection refused error. any idea what could be the reason
@MrRajat769 ปีที่แล้ว
Plz make video to answer what u r doing in snowflake.....
@ravikreddy7470 2 ปีที่แล้ว
Can you post one video with S3 -> Glue -> RS pipeline (not using pyspark)
@anhdo7704 ปีที่แล้ว
may I ask what exactly is the username for snowflake this time? because I don't know where to find the user name
@KnowledgeAmplifier1 ปีที่แล้ว
Hello Anh Do, username is what you use to login in the Snowflake Web console , you might have setup while sign up or your admin team can confirm on this , if using OAuth , then , mostly there will be a dedicated user to connect with Python , PySpark etc. the admin team in your project can confirm on the same ...
@yogeshbharadwaj6200 3 ปีที่แล้ว
Tks a lot brother....very helpful...very good easy, clear explanation.... If I have a need to join 2 tables, can I specify table names as comma separated in "source_table_name" and perform the join in ".option("query","********")", pls help to suggest. Thanks.
@adithyabulusu8812 2 ปีที่แล้ว ⁺¹
Thanks a lot bro. Can you also please share the video to load the data from S3 to snowflake by using lambda and glue
@KnowledgeAmplifier1 2 ปีที่แล้ว
Hello Adithya , here I have explained AWS Glue and Snowflake integration and in the below video I have explained s3 , Lambda , Glue integration , you can club these together & customize as per your requirements --
th-cam.com/video/1tIM1jBmwD4/w-d-xo.html
Happy Learning :-)
@codewithsharath5988 2 ปีที่แล้ว
Awesome. can you make video on how to connect redshift using pyspark in similar way ?
@pachappagarimohanvamsi4641 2 ปีที่แล้ว ⁺¹
Hello .. this approach is not so useful it seems .. here we are processing the snowflake table and processing in spark and storing the data in snowflake again if I am right.. for we can use snowflake itself.. aws glue is extra cost 😅
@KnowledgeAmplifier1 2 ปีที่แล้ว ⁺¹
Hello PACHAPPAGARI MOHAN VAMSI, yes your are right that this transformations can be done using compute power of Snowflake only , actually , this video fundamentally explains how to integrate Snowflake with Spark in AWS Glue platform , and to explain that I took a dummy transformation , the concept can be used for any other workloads which is not possible by snowflake only , for example , if the data is available in mysql rds (source) , then we can use spark to read the data from mysql and then write in snowflake(destination) , in that case , if we want to use AWS Glue as execution env, this video concepts can be useful for someone in that case ...
@pachappagarimohanvamsi4641 2 ปีที่แล้ว ⁺¹
@@KnowledgeAmplifier1 👍
@krishnashukla3638 2 ปีที่แล้ว
Hi Friend, How can I read data from RDS and ingest the same to snowflake using glue. Do you have any example for that, It will be really helpful for me. Thanks.
@rajeevranjan5913 ปีที่แล้ว
Hi ,
I am having exactly same requirement. Could you please help with the process if you have achieved the same.
@AY1986R ปีที่แล้ว
Thank you very much for this video
Please could you do an exemple with Oracle and Python ?
@bishnupriyamukherjee4746 3 ปีที่แล้ว ⁺²
👌👌👌👌👌👌👌👌
@KnowledgeAmplifier1 3 ปีที่แล้ว
Thank You 😄
@krishnasanagavarapu4858 3 ปีที่แล้ว ⁺²

ต่อไป

เล่นอัตโนมัติ

Connecting Snowflake with Python Shell Jobs in AWS Glue