Рет қаралды 1,116
In this video I show how to profile data in Azure Synapse Dedicated SQL Pools.
Hello Sweeties,
Profiling data in tables is hard and slow, but worth doing.
There are a couple of good reasons for this;
a) Everything runs faster when you use the right data types!!! Pick the smallest data type for each column.
b) Picking the best column for HASH distributions ; select a column with a large amount of distinct values. If the number of distinct values is close to the number of rows, then you might as well pick round robin.
But WARNING this script looks at every row in every column, making it very slow. So be careful, test this on small tables (small number of columns and rows) first.
The script is here; gist.github.co...
Enjoy,
Love you!!
#AzureSynapse #DedicatedSQLPool