Join Strategies in Apache Spark
12:18
Azure DevOps - Introduction
4:42
4 ай бұрын
Пікірлер
@chandrasingh1184
@chandrasingh1184 23 күн бұрын
Very descriptive.
@dataenggstudies
@dataenggstudies 23 күн бұрын
Glad you found it helpful!
@chandrasingh1184
@chandrasingh1184 25 күн бұрын
Nice
@dataenggstudies
@dataenggstudies 25 күн бұрын
Thanks, glad you liked it!
@Александр-ъ3о1щ
@Александр-ъ3о1щ 2 ай бұрын
is it possible to add XML extension to pyspark-jupyter installation or is this extension only available for databrics?
@dataenggstudies
@dataenggstudies Ай бұрын
To add XML extension to your PySpark-Jupyter installation, you'll need to install the spark-xml library. Open your Jupyter Notebook and run the following command !pip install spark-xml Now, you can use the library in your PySpark code
@serdalaslantas
@serdalaslantas 2 ай бұрын
Hi, installling failed due to pemisssion deniel. It says permission denied! What am I doing wrong? Appreciate if you can help!
@dataenggstudies
@dataenggstudies 2 ай бұрын
Run docker compose with sudo if you don't have permission sudo docker compose up
@Александр-ъ3о1щ
@Александр-ъ3о1щ 2 ай бұрын
Hi! I can't to save file (csv from example) through dataframe.write into local docker folder. How I can deal with it?
@dataenggstudies
@dataenggstudies 2 ай бұрын
I will get back on this shortly.
@Александр-ъ3о1щ
@Александр-ъ3о1щ 2 ай бұрын
@@dataenggstudies thnx, I will be waiting!
@dataenggstudies
@dataenggstudies 2 ай бұрын
Step 1: Mount a Volume When you run your Docker container, you need to mount a volume that will act as a bridge between the container's filesystem and your local filesystem. Step 2: Write the CSV File in Your Code Within your Python code running inside the Docker container, use the DataFrame's to_csv() method to write the CSV file to the mounted volume. df.to_csv("/path/in/container/your_file.csv", index=False) I will create a video on this steps.
@user-qv6dz8un5q
@user-qv6dz8un5q 2 ай бұрын
the goat
@thanushsivasai9909
@thanushsivasai9909 3 ай бұрын
Thanks!
@dataenggstudies
@dataenggstudies 3 ай бұрын
Welcome!
@hasanrazashaikh421
@hasanrazashaikh421 3 ай бұрын
thanks nice explanation
@dataenggstudies
@dataenggstudies 3 ай бұрын
Glad you liked it
@chandrasingh1184
@chandrasingh1184 3 ай бұрын
Useful
@dataenggstudies
@dataenggstudies 3 ай бұрын
Glad it helped!
@midhunskani
@midhunskani 3 ай бұрын
This is a very good and easy solution
@dataenggstudies
@dataenggstudies 3 ай бұрын
Thank you. Glad it helped.
@vinoda3480
@vinoda3480 3 ай бұрын
Need end to end project demo. Video
@dataenggstudies
@dataenggstudies 3 ай бұрын
Glad it helped. Sure,thank you for the suggestion. I will work on end to end project demo. 👍
@hanpanBR
@hanpanBR 3 ай бұрын
Thanks for the video! It was so simple and yet I was very confused till I got here.
@dataenggstudies
@dataenggstudies 3 ай бұрын
Glad it helped!
@maureenwaitherero8373
@maureenwaitherero8373 4 ай бұрын
Could you create tables in dedicated sql pool fro these files dynamically
@dataenggstudies
@dataenggstudies 4 ай бұрын
Sure I will create a video for this scenario.
@HannekeDebie
@HannekeDebie 4 ай бұрын
This was very helpful in getting my csv files into dataframes!
@dataenggstudies
@dataenggstudies 4 ай бұрын
Glad it helped! Keep learning.
@saddamhussain9975
@saddamhussain9975 4 ай бұрын
learn many things from this video...waiting for more videos
@dataenggstudies
@dataenggstudies 4 ай бұрын
Good. Keep learning! I am glad it helped!
@julietruong2059
@julietruong2059 4 ай бұрын
Really good explanation! Thanks for this!
@dataenggstudies
@dataenggstudies 4 ай бұрын
You are welcome! Glad it helped.
@vinoda3480
@vinoda3480 4 ай бұрын
Nice explanation. Expecting more videos like this. Requesting to do video how to load both files and SQL into azure folder using single dedicated pipeline.
@dataenggstudies
@dataenggstudies 4 ай бұрын
Thanks for the idea! Noted. I will upload soon.
@RajashekharKumbar-gj8wz
@RajashekharKumbar-gj8wz 3 ай бұрын
It can be one more approach for this senario using copy behavior
@dataenggstudies
@dataenggstudies 3 ай бұрын
@@RajashekharKumbar-gj8wz you are correct.
@alayareddymeka7721
@alayareddymeka7721 5 ай бұрын
Hello sir... Thank you for the solution.. can you please explain the same solution if XML file has varying nested data types
@dataenggstudies
@dataenggstudies 5 ай бұрын
Sure. I will!
@alayareddymeka7721
@alayareddymeka7721 5 ай бұрын
​@@dataenggstudiesThank you...Also the nested data types may contain various depth level. So, flattening should be a dynamic logic. Is that possible?
@dataenggstudies
@dataenggstudies 4 ай бұрын
Based on what I research dynamic flattening is not possible. I will share if I find any details
@Ajaybchaudhari
@Ajaybchaudhari 5 ай бұрын
These videos are so helpful, simple yet so informative
@dataenggstudies
@dataenggstudies 5 ай бұрын
Happy to hear that!
@varmauppalapati7556
@varmauppalapati7556 5 ай бұрын
thanks for the video bro, but local host 4040 is not working
@dataenggstudies
@dataenggstudies 5 ай бұрын
Local host 4040 might have been already used by another application in your device. Try using a different port.
@varmauppalapati7556
@varmauppalapati7556 5 ай бұрын
@@dataenggstudies have tried using different ports, it didn't work
@dataenggstudies
@dataenggstudies 5 ай бұрын
@@varmauppalapati7556 Could you please share the error you are getting.
@aswathis5902
@aswathis5902 5 ай бұрын
Tqqqq so much bro❤
@dataenggstudies
@dataenggstudies 5 ай бұрын
Glad it helped you.
@QuantizedFields
@QuantizedFields 5 ай бұрын
I think you misspelled, as it should be a square of a number, not square-root of a number. But this is a very smart approach, thanks for sharing this work.
@dataenggstudies
@dataenggstudies 5 ай бұрын
That is correct. Thank you for the correction. I am glad it help ❤
@supriyakushwaha21
@supriyakushwaha21 5 ай бұрын
NICE EXPLAINATION.
@dataenggstudies
@dataenggstudies 5 ай бұрын
Glad it helped.
@chandrasingh1184
@chandrasingh1184 6 ай бұрын
Great
@dataenggstudies
@dataenggstudies 6 ай бұрын
Glad it helped!
@MrMaiio123
@MrMaiio123 6 ай бұрын
Great explanation! Thank you
@dataenggstudies
@dataenggstudies 6 ай бұрын
Glad it helped!
@mnikhil8491
@mnikhil8491 6 ай бұрын
thanks for the explanation .... it helped a lot..
@dataenggstudies
@dataenggstudies 6 ай бұрын
Glad it helped!
@shatirdimag1011
@shatirdimag1011 7 ай бұрын
Best solution thanks💯💯💯
@dataenggstudies
@dataenggstudies 7 ай бұрын
Glad it helped!
@nitishrawat9391
@nitishrawat9391 7 ай бұрын
Great to see clear instructions and simple approach
@dataenggstudies
@dataenggstudies 7 ай бұрын
Glad it was helpful!
@manderb1684
@manderb1684 8 ай бұрын
Amazing content , Thanks
@dataenggstudies
@dataenggstudies 8 ай бұрын
Glad you liked it
@avilateresa1486
@avilateresa1486 8 ай бұрын
Well explained, very interesting scenario!
@dataenggstudies
@dataenggstudies 8 ай бұрын
Thank you.
@vinoda3480
@vinoda3480 8 ай бұрын
Good Explanation. Expecting more scenarios from you
@dataenggstudies
@dataenggstudies 8 ай бұрын
Thank you. I am glad it helped you!
@peterodedeyi3366
@peterodedeyi3366 8 ай бұрын
Very straightforward and clear tutorial. Thank you, Joseph.
@dataenggstudies
@dataenggstudies 8 ай бұрын
Glad it was helpful!
@dhp106
@dhp106 9 ай бұрын
Can you demonstrate how to pgp encrypt a file in azure storage blob, using synapse notebooks where the public key is also in storage blob
@dataenggstudies
@dataenggstudies 9 ай бұрын
Sure, I will, thank you for letting me know.
@prabhatgupta6415
@prabhatgupta6415 10 ай бұрын
u can use unpivot functions
@dataenggstudies
@dataenggstudies 10 ай бұрын
Yes looks like it is a new feature in spark version 3.4.0. Thank you for sharing. When recording this video this functionality was not available.
@tinashechinyati6823
@tinashechinyati6823 10 ай бұрын
great content
@dataenggstudies
@dataenggstudies 10 ай бұрын
Thank you. I am glad you liked.
@nikhilrothe3419
@nikhilrothe3419 10 ай бұрын
Very helpful
@dataenggstudies
@dataenggstudies 10 ай бұрын
Thank you. I am glad that it helped.
@vinoda3480
@vinoda3480 10 ай бұрын
Good Content very much helpful. But please parameterize the target folder and Input folder
@dataenggstudies
@dataenggstudies 10 ай бұрын
Thank you for the suggestion. I will incorporate in the upcoming videos.
@abduljaweed8131
@abduljaweed8131 10 ай бұрын
Hi bro I have one scenario like i have a documents in cosmosdb for nosql and i want to create a pipeline to triggered it if certain value is updated in cosmosdb document like age=21then trigger the event and then perform some transformation using python and then send that changes to new cosmosdb container If you make one video on that scenario that could be great helpful
@dataenggstudies
@dataenggstudies 10 ай бұрын
Sure, I will create a video for this scenario. Thank you for sharing
@lutherlusk2975
@lutherlusk2975 11 ай бұрын
youre writing while inside for , doesnt that incrase the time complexity from n to m * n or something like that ?
@dataenggstudies
@dataenggstudies 11 ай бұрын
This solution looks like O(n*m) but it is actually not because we are iterating over the while loop only for the numbers that are 'first' numbers, in other words the numbers that satisfy the condition (if n-1 not in nums:) not for all the numbers. Hence the time complexity is O(n).
@mrunknown69692
@mrunknown69692 11 ай бұрын
bhai thanks for the python solution. there are not much resources available for dsa with python. keep going
@dataenggstudies
@dataenggstudies 11 ай бұрын
Sure, I will upload more. Glad it helped.
@JosephGeorge-pt3hd
@JosephGeorge-pt3hd 11 ай бұрын
Well done.thank you
@dataenggstudies
@dataenggstudies 11 ай бұрын
Thank you.
@JosephGeorge-pt3hd
@JosephGeorge-pt3hd 11 ай бұрын
Good informative. Keep it up
@dataenggstudies
@dataenggstudies 11 ай бұрын
Thanks you.
@JosephGeorge-pt3hd
@JosephGeorge-pt3hd 11 ай бұрын
Well explained😊
@dataenggstudies
@dataenggstudies 11 ай бұрын
Thank you 🙂
@vinoda3480
@vinoda3480 Жыл бұрын
Good Start !! Expecting more videos from you.
@dataenggstudies
@dataenggstudies Жыл бұрын
Thank you! Glad it helped.