97. Databricks | Pyspark | Data Security: Enforcing Column Level Encryption

  Рет қаралды 9,422

Raja's Data Engineering

Raja's Data Engineering

Күн бұрын

Пікірлер: 26
@tanushreenagar3116
@tanushreenagar3116 Жыл бұрын
GREAT EXPLANATION SIR
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks, keep watching!
@sravankumar1767
@sravankumar1767 Жыл бұрын
Nice explanation 👌 👍 👏
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thank you 🙂
@SravanthiP-v3o
@SravanthiP-v3o 11 ай бұрын
Sir After creating the encrypted and decrypted column.. how do we hide the original ssn table? and when we run the command select ssn from dimeployee how do we ensure that we get the encrypted format ?
@sanjayr3597
@sanjayr3597 10 ай бұрын
Good video.I have a question regarding KEY, how is that value stored ..? can we use the same function another notebook with the same cluster? any draws using this method?
@rmrz2225
@rmrz2225 Жыл бұрын
Good job, but,I have a question, when we encrypt information aren't we not supposed to be able to decrypt it?
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Thanks. Yes we need to decrypt it when we need to use it later
@NagarjunaSunguluru
@NagarjunaSunguluru Жыл бұрын
Can we apply those functions on nested json data also??
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Yes we can do. At dataframe level, we can apply encryption and write in json format which will keep encrypted data within json file
@lavanijavidalikhan3844
@lavanijavidalikhan3844 Жыл бұрын
Can we apply same thing to encrypt a csv file
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Yes we can do
@ramreddy1138
@ramreddy1138 Жыл бұрын
Good one. But, how do we filter the data and apply comparisons?
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
In order to filter and compare, we need to decrypt the data on the fly using decrypt method
@ramreddy1138
@ramreddy1138 Жыл бұрын
​@@rajasdataengineering7585 It will impact performance too much..
@revjr1284
@revjr1284 Жыл бұрын
I am getting the below error while encrypting the data. 'TypeError: encoding without a string argument' Kindly help
@phanisrikrishna
@phanisrikrishna Жыл бұрын
Hi Raja, This particular video is great. I have one question. will the size of df increase by creating encryption on some of the columns? How do we take care of memory while designing? Thanks in advance.
@sravankumar1767
@sravankumar1767 Жыл бұрын
What is md5, could u please explain this one
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Md5 is one of the hashing function
@nagamanickam6604
@nagamanickam6604 8 ай бұрын
Thank you
@rajasdataengineering7585
@rajasdataengineering7585 8 ай бұрын
You're welcome
@sabesanj5509
@sabesanj5509 Жыл бұрын
Raja sir, Will they these kind of questions in Spark interviews??
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Yes, data security is one of must topic in interviews
@azureadi-q3y
@azureadi-q3y Жыл бұрын
good Explain, could you please explain delta lake live table (today i loaded 1 table with 20 columns and same table next day i am getting with 2 more extra columns how to handle in delta loads in ADB and how to manage delta merge command (In production))
@rajasdataengineering7585
@rajasdataengineering7585 Жыл бұрын
Sure, will create a video series for delta live table
@gattureddy3796
@gattureddy3796 9 ай бұрын
Hi Raja, could you please share a notebook link or DBC file
98. Databricks | Pyspark | Interview Question: Pyspark VS Pandas
9:09
Raja's Data Engineering
Рет қаралды 5 М.
75. Databricks | Pyspark | Performance Optimization - Bucketing
22:03
Raja's Data Engineering
Рет қаралды 20 М.
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 38 МЛН
Cheerleader Transformation That Left Everyone Speechless! #shorts
00:27
Fabiosa Best Lifehacks
Рет қаралды 15 МЛН
УДИВИЛ ВСЕХ СВОИМ УХОДОМ!😳 #shorts
00:49
Арыстанның айқасы, Тәуіржанның шайқасы!
25:51
QosLike / ҚосЛайк / Косылайық
Рет қаралды 685 М.
Azure Databricks Security Best Practices
24:27
Databricks
Рет қаралды 15 М.
Working with JSON in PySpark - The Right Way
23:41
Anirvan Decodes
Рет қаралды 1,1 М.
Advancing Spark - Dynamic Data Decryption
15:37
Advancing Analytics
Рет қаралды 3,8 М.
Protecting PII/PHI Data in Data Lake via Column Level Encryption
32:44
Advancing Spark - Row-Level Security and Dynamic Masking with Unity Catalog
20:43
Azure Databricks Tutorial | Data transformations at scale
28:35
Adam Marczak - Azure for Everyone
Рет қаралды 402 М.
Advancing Spark - Implementing Row Level Security in Databricks
17:34
Advancing Analytics
Рет қаралды 8 М.
61. Databricks | Pyspark | Delta Lake : Slowly Changing Dimension (SCD Type2)
20:03
Chain Game Strong ⛓️
00:21
Anwar Jibawi
Рет қаралды 38 МЛН