How to Use AWS Glue with Snowflake | PySpark-Snowflake Connectivity

  Рет қаралды 18,117

Knowledge Amplifier

Knowledge Amplifier

Күн бұрын

Пікірлер: 59
@nayanroy13
@nayanroy13 3 жыл бұрын
Exactly what I was looking for. Crisp, clear and to the point!
@KnowledgeAmplifier1
@KnowledgeAmplifier1 3 жыл бұрын
Thank You IamDocxy😊Happy Learning :-)
@manishvishvkarma8030
@manishvishvkarma8030 2 жыл бұрын
@@KnowledgeAmplifier1 hii sir can u pls create one vedio on glue job like it will read data from s3 and load it into snowflake table
@yamuna-p4u
@yamuna-p4u Жыл бұрын
sir thank you for this video this video helped me a lot ,your explaination is awesome, please keep doing this we will definitely support you sir
@sreejithsurendran6632
@sreejithsurendran6632 3 жыл бұрын
Thanks a lot bro..lot of use cases for snowflake and aws learners…
@yadi4diamond
@yadi4diamond Жыл бұрын
You are simply awesome, Thank you for the knowledge share!!
@KnowledgeAmplifier1
@KnowledgeAmplifier1 Жыл бұрын
Thank you for your kind words , Yadi! Happy Learning
@puremjlee
@puremjlee 2 жыл бұрын
4:10 says the glue job is executed by lambda but there was no lambda setting in the video. do we need to use lambda to call glue job?
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello MJ Lee, I was explaining that we can trigger the glue job from Lambda based on certain event occurrence if required , if you want to run Glue Job from Lambda trigger , then you can check this video -- kzbin.info/www/bejne/Z6WsfmSgd9KqepY Hope this will be helpful! Happy Learning :-)
@kunnunhs1
@kunnunhs1 2 жыл бұрын
daarun very good explanation.. one video full clarity
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
অনুপ্রেরণামূলক মন্তব্যের জন্য আপনাকে ধন্যবাদ Desi Bhasa Main😊হ্যাপি লার্নিং✌
@praveenyadam2617
@praveenyadam2617 2 жыл бұрын
You are a wonder and this is what I was looking for...thanks much
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Glad to know the video was helpful to you praveen yadam! Happy Learning :-)
@keshavamugulursrinivasiyen5502
@keshavamugulursrinivasiyen5502 2 жыл бұрын
Very well presented and nice job
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Thank You Keshava Mugulur Srinivas Iyengar! Happy Learning :-)
@MahendraSingh-sw1th
@MahendraSingh-sw1th 2 жыл бұрын
That was awesome ! Precise !
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Thank You Mahendra Singh! Happy Learning :-)
@yamuna-p4u
@yamuna-p4u Жыл бұрын
crystal clear explanation thank you bro
@KnowledgeAmplifier1
@KnowledgeAmplifier1 Жыл бұрын
You are welcome!
@amitprasad6982
@amitprasad6982 Жыл бұрын
sirji in initial architecture you said glue will read data from s3 and apply some transformation and write it to snowflake , but later in the video you pulled data from snowflake and write back to snowflake and s3 .
@KnowledgeAmplifier1
@KnowledgeAmplifier1 Жыл бұрын
Hello Amit Prasad, at 4:15-4:37 , I have mentioned that in this video the focus is integration between AWS Glue (or PySpark) & Snowflake as s3 to lambda and then lambda to glue part already covered in separate video , as the primary focus of this video is Glue & Snowflake , so I explained the possible scenarios around this -- pulling data from snowflake and write back to snowflake & pulling data from snowflake and write to s3. If you want to explore s3 to lambda and then lambda to glue, then you can refer this video--kzbin.info/www/bejne/Z6WsfmSgd9KqepYsi=dYoD7GHeG3hhWAei Hope this answers your doubt , if you have any doubt , please feel free to comment , will try to help as much as possible
@AliMirfaisal
@AliMirfaisal Жыл бұрын
Nice video. please share same for EMR without airflow.
@KnowledgeAmplifier1
@KnowledgeAmplifier1 Жыл бұрын
Hello Ali Mir faisal, you can refer this video -- kzbin.info/www/bejne/pXuZhamQqpt3p7M Happy Learning
@swarajmehta3011
@swarajmehta3011 3 ай бұрын
can we do same, to read csv data from s3 and write it as a table into snowflake
@sumeetsawant3398
@sumeetsawant3398 Жыл бұрын
Hi How do i do this for EMR on EKS . How do I add the jar files in that case ?
@vaibhavverma1340
@vaibhavverma1340 Жыл бұрын
Hello bhaiya, I am getting errror following each step still getting error .. "py4j.protocol.py4jjavaerror: an error occurred while calling o90.load snowflake" ??? please help me out
@laterlname7865
@laterlname7865 2 жыл бұрын
Is it mandatory to have Spark to connect to Snowflake? Can’t we directly access data in Snowflake tables using SQL in AWS Glue’s python program? The reason I am asking this question is Spark is a big data analytics tool and not every application is meant for data analytics. Most business applications are Insert, Update, Select, Delete type SQL based programs. So can I embed these SQLs in AWS Glue’s Python scripts without using Spark in the code?
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello Later Lname, you asked a very good question , I created this separate video to give the answer of your question -- kzbin.info/www/bejne/hXuwY3yhmK6KldE Hope this will be helpful! Happy Learning :-)
@yamunau.yamuna5189
@yamunau.yamuna5189 Жыл бұрын
Thanks a lot Bro your video is awesome
@KnowledgeAmplifier1
@KnowledgeAmplifier1 Жыл бұрын
Most welcome!
@madhubhardwaj4512
@madhubhardwaj4512 Жыл бұрын
How can we find the compatible version for the jar files with the current spark version? Please reply.
@krishnasanagavarapu4858
@krishnasanagavarapu4858 Жыл бұрын
can we create reverse integration, i.e. to fetch huge data (80 million rows) from snowflake to S3 without using stage. We have only "read only access to snowflake ?
@swarnadeepchowdhury563
@swarnadeepchowdhury563 2 жыл бұрын
Is aws glue mandatory for running spark jobs on Snowflake?
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello Swarnadeep Chowdhury, no it's not mandetory , you can use other services where spark can run like emr etc. Here is a reference video -- kzbin.info/www/bejne/pXuZhamQqpt3p7M Happy Learning
@vikinist
@vikinist 2 жыл бұрын
can you share the video link to s3 and Lamdbda trigger
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello Vikram , if you want to trigger AWS Glue Job whenever some file lands in s3 (s3 to Lambda and then Lambda to AWS Glue Job) , you can refer this video -- kzbin.info/www/bejne/Z6WsfmSgd9KqepY Hope this will be helpful! Happy Learning :-)
@kittu1010
@kittu1010 2 жыл бұрын
Hello Sir I am trying to perform many spark operations once i read the table ( just not group by ) . I used the same jars but i am getting the following error - "An error occurred while calling o94.load. scala/Product$class" . Do u know using which jar will solve this issue . thanks in advance.
@vikinist
@vikinist 2 жыл бұрын
one doubt can you please answer when we will go for snowpipe and when we can go for glue ?
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello Vikram , snowpipe is used for real-time data ingestion from datalake to snowflake using SQS or SNS kind off services .... AWS Glue you can use for any batch processing purpose , batch ingestion or for transforming your data , you can use AWS Glue / EMR
@vikinist
@vikinist 2 жыл бұрын
@@KnowledgeAmplifier1 thanks for the quick reply
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
@@vikinist no problem .. Happy Learning
@ketank344
@ketank344 2 жыл бұрын
hello, i am getting connection refused error. any idea what could be the reason
@MrRajat769
@MrRajat769 Жыл бұрын
Plz make video to answer what u r doing in snowflake.....
@ravikreddy7470
@ravikreddy7470 2 жыл бұрын
Can you post one video with S3 -> Glue -> RS pipeline (not using pyspark)
@anhdo7704
@anhdo7704 Жыл бұрын
may I ask what exactly is the username for snowflake this time? because I don't know where to find the user name
@KnowledgeAmplifier1
@KnowledgeAmplifier1 Жыл бұрын
Hello Anh Do, username is what you use to login in the Snowflake Web console , you might have setup while sign up or your admin team can confirm on this , if using OAuth , then , mostly there will be a dedicated user to connect with Python , PySpark etc. the admin team in your project can confirm on the same ...
@yogeshbharadwaj6200
@yogeshbharadwaj6200 3 жыл бұрын
Tks a lot brother....very helpful...very good easy, clear explanation.... If I have a need to join 2 tables, can I specify table names as comma separated in "source_table_name" and perform the join in ".option("query","********")", pls help to suggest. Thanks.
@adithyabulusu8812
@adithyabulusu8812 2 жыл бұрын
Thanks a lot bro. Can you also please share the video to load the data from S3 to snowflake by using lambda and glue
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello Adithya , here I have explained AWS Glue and Snowflake integration and in the below video I have explained s3 , Lambda , Glue integration , you can club these together & customize as per your requirements -- kzbin.info/www/bejne/Z6WsfmSgd9KqepY Happy Learning :-)
@codewithsharath5988
@codewithsharath5988 2 жыл бұрын
Awesome. can you make video on how to connect redshift using pyspark in similar way ?
@pachappagarimohanvamsi4641
@pachappagarimohanvamsi4641 2 жыл бұрын
Hello .. this approach is not so useful it seems .. here we are processing the snowflake table and processing in spark and storing the data in snowflake again if I am right.. for we can use snowflake itself.. aws glue is extra cost 😅
@KnowledgeAmplifier1
@KnowledgeAmplifier1 2 жыл бұрын
Hello PACHAPPAGARI MOHAN VAMSI, yes your are right that this transformations can be done using compute power of Snowflake only , actually , this video fundamentally explains how to integrate Snowflake with Spark in AWS Glue platform , and to explain that I took a dummy transformation , the concept can be used for any other workloads which is not possible by snowflake only , for example , if the data is available in mysql rds (source) , then we can use spark to read the data from mysql and then write in snowflake(destination) , in that case , if we want to use AWS Glue as execution env, this video concepts can be useful for someone in that case ...
@pachappagarimohanvamsi4641
@pachappagarimohanvamsi4641 2 жыл бұрын
@@KnowledgeAmplifier1 👍
@krishnashukla3638
@krishnashukla3638 2 жыл бұрын
Hi Friend, How can I read data from RDS and ingest the same to snowflake using glue. Do you have any example for that, It will be really helpful for me. Thanks.
@rajeevranjan5913
@rajeevranjan5913 Жыл бұрын
Hi , I am having exactly same requirement. Could you please help with the process if you have achieved the same.
@AY1986R
@AY1986R Жыл бұрын
Thank you very much for this video Please could you do an exemple with Oracle and Python ?
@bishnupriyamukherjee4746
@bishnupriyamukherjee4746 3 жыл бұрын
👌👌👌👌👌👌👌👌
@KnowledgeAmplifier1
@KnowledgeAmplifier1 3 жыл бұрын
Thank You 😄
@krishnasanagavarapu4858
@krishnasanagavarapu4858 3 жыл бұрын
Connecting Snowflake with Python Shell Jobs in AWS Glue
9:58
Knowledge Amplifier
Рет қаралды 5 М.
Air Sigma Girl #sigma
0:32
Jin and Hattie
Рет қаралды 45 МЛН
14. AWS Glue Practical | AWS Glue CSV to JSON | AWS Data Engineer
16:31
learn by doing it
Рет қаралды 7 М.
How to Use AWS S3 with NodeJS?
30:39
Piyush Garg
Рет қаралды 62 М.
Configuring a Snowflake Storage Integration to Access Amazon S3
19:18
Knowledge Amplifier
Рет қаралды 10 М.
AWS Lambda Layers Python | Snowflake-lambda-layer
29:45
Knowledge Amplifier
Рет қаралды 7 М.
Snowflake BUILD | How To Use Apache Iceberg With Snowflake And AWS
14:33
Snowflake Developers
Рет қаралды 6 М.