Рет қаралды 5,153
Provides an overview of Dataplex Explore for executing some Spark SQL against BigQuery internal tables, external tables and Hive tables. The demo also shows how you can use a notebook along with scheduling and sharing your artifacts. Everything is provisioned via an Airflow DAG using Terraform to setup the data lakes, Dataproc Metastore and the Dataplex environment so you can create your own queries.
All code is on GitHub: goo.gle/dagd