Ye... I really appreciate all your wonderful classes.
@enidaguja66552 жыл бұрын
Thank you for all tutorials. They are great and helpful! I am wondering could one project have many recipes? Best regards
@AWSTutorialsOnline2 жыл бұрын
It is one recipe per project. However - one recipe can be used in many projects.
@bobbrandt30432 жыл бұрын
Is it possible to merge the columns WITHOUT deleting the original columns?
@AWSTutorialsOnline2 жыл бұрын
yes. You can create a new column with concatenation.
@charliehunter63873 жыл бұрын
Awesome tutorial! when i run my Databrew job i get multiple csv's called 'XXXX_part00001', 'XXXX_part00002', etc. Is there a way i can make it output one csv with all the parts?
@AWSTutorialsOnline3 жыл бұрын
Out of box, the files are partitioned based on size and you cannot control it. Idea is to not to make too big or too small file as both degrade performance if you try to access data using Athena. You can also configure column based partition. Why you want to avoid partitioning?
@sridharvuligonda3193 жыл бұрын
Can we use this DataBrew as an ETL tool instead of Glue studio?
@AWSTutorialsOnline3 жыл бұрын
You can. You can convert the recipe into Job. But keep in mind, you don't have any control over code generated by DataBrew and you can change it.
@sancho7094 жыл бұрын
Very, very good
@AWSTutorialsOnline4 жыл бұрын
Thank you very much
@krishnaprasadas85663 жыл бұрын
What is the underlying processing engine for DataBrew ? I mean where is the jobs, data quality jobs, running.
@AWSTutorialsOnline3 жыл бұрын
It uses managed server for this purpose but I am not sure about processing engine configuration. It could be Apache Spark. Sometime - I get errors with DAG mentioned which gives me idea that it might be using Apache Airflow. but I am not 100% sure.
@krishnaprasadas85663 жыл бұрын
@@AWSTutorialsOnline Okey, But don't know whether they really support BigData cost effectively.