AWS Tutorials - Introduction to AWS Glue DataBrew

  Рет қаралды 4,982

AWS Tutorials

AWS Tutorials

Күн бұрын

Пікірлер: 16
@4niceguy
@4niceguy 2 жыл бұрын
Ye... I really appreciate all your wonderful classes.
@enidaguja6655
@enidaguja6655 2 жыл бұрын
Thank you for all tutorials. They are great and helpful! I am wondering could one project have many recipes? Best regards
@AWSTutorialsOnline
@AWSTutorialsOnline 2 жыл бұрын
It is one recipe per project. However - one recipe can be used in many projects.
@bobbrandt3043
@bobbrandt3043 2 жыл бұрын
Is it possible to merge the columns WITHOUT deleting the original columns?
@AWSTutorialsOnline
@AWSTutorialsOnline 2 жыл бұрын
yes. You can create a new column with concatenation.
@charliehunter6387
@charliehunter6387 3 жыл бұрын
Awesome tutorial! when i run my Databrew job i get multiple csv's called 'XXXX_part00001', 'XXXX_part00002', etc. Is there a way i can make it output one csv with all the parts?
@AWSTutorialsOnline
@AWSTutorialsOnline 3 жыл бұрын
Out of box, the files are partitioned based on size and you cannot control it. Idea is to not to make too big or too small file as both degrade performance if you try to access data using Athena. You can also configure column based partition. Why you want to avoid partitioning?
@sridharvuligonda319
@sridharvuligonda319 3 жыл бұрын
Can we use this DataBrew as an ETL tool instead of Glue studio?
@AWSTutorialsOnline
@AWSTutorialsOnline 3 жыл бұрын
You can. You can convert the recipe into Job. But keep in mind, you don't have any control over code generated by DataBrew and you can change it.
@sancho709
@sancho709 4 жыл бұрын
Very, very good
@AWSTutorialsOnline
@AWSTutorialsOnline 4 жыл бұрын
Thank you very much
@krishnaprasadas8566
@krishnaprasadas8566 3 жыл бұрын
What is the underlying processing engine for DataBrew ? I mean where is the jobs, data quality jobs, running.
@AWSTutorialsOnline
@AWSTutorialsOnline 3 жыл бұрын
It uses managed server for this purpose but I am not sure about processing engine configuration. It could be Apache Spark. Sometime - I get errors with DAG mentioned which gives me idea that it might be using Apache Airflow. but I am not 100% sure.
@krishnaprasadas8566
@krishnaprasadas8566 3 жыл бұрын
@@AWSTutorialsOnline Okey, But don't know whether they really support BigData cost effectively.
@katiushkaflores
@katiushkaflores 3 жыл бұрын
They even used the same vocabulary as Trifacta!
@AWSTutorialsOnline
@AWSTutorialsOnline 3 жыл бұрын
sorry did not understand your feedback.
AWS Tutorials - Using AWS Glue Workflow
30:55
AWS Tutorials
Рет қаралды 13 М.
How to have fun with a child 🤣 Food wrap frame! #shorts
0:21
BadaBOOM!
Рет қаралды 17 МЛН
I'VE MADE A CUTE FLYING LOLLIPOP FOR MY KID #SHORTS
0:48
A Plus School
Рет қаралды 20 МЛН
Beginners Guide To AWS Glue DataBrew
21:11
Johnny Chivers
Рет қаралды 6 М.
AWS Tutorials - Introduction to AWS Glue Studio
28:21
AWS Tutorials
Рет қаралды 8 М.
AWS Glue DataBrew and Glue | AWS Essentials
45:32
Alex The Analyst
Рет қаралды 6 М.
AWS Tutorials - Data Quality Check using AWS Glue DataBrew
42:50
AWS Tutorials
Рет қаралды 9 М.
AWS Tutorials - AWS Glue Studio vs. Glue DataBrew
28:52
AWS Tutorials
Рет қаралды 7 М.
Detailed Demo on AWS DataBrew
36:47
Vishwaraj Gupta
Рет қаралды 6 М.
AWS Tutorials - Handling PII Data in AWS Glue
35:12
AWS Tutorials
Рет қаралды 4,6 М.