Рет қаралды 6,702
This video takes you through how to create a simple Hadoop cluster on Google Dataproc, along with all the settings needed to be able to (a) connect to the cluster using a Jupyter Notebook, (b) save the Jupyter Notebook on a Google Storage bucket, and (c) connect to the cluster Master computer through a terminal. This is part-1 of a two-part series. (In part 2, I will walk you through how to import a CSV file into your Hadoop cluster and analyze it using PySpark on Jupyter Notebook.)