I was looking for a wildcard to pick all the file in a folder since 3 hrs, you have made my life easy. I have been through all the complex code available online nothing worked, your code is simple and does the job.
@DataMites3 жыл бұрын
Thank you for your comment.
@jobaute39844 жыл бұрын
I was looking for this and didn't realise it was that simple! Thanks
@DataMites4 жыл бұрын
Thank you and welcome
@allieubisse4704 жыл бұрын
I need for of this in my data Science life. You make everything light and easy. One concept per video. Short and sweet.
@ThankGodItsToday4 жыл бұрын
Glad it helped :-)
@bhayj2 жыл бұрын
I checked many web-pages in google and nothing helped, but your video solved my issue. Thanks a lot!!!😀
@DataMites2 жыл бұрын
Glad to hear that
@ATHARVA893 жыл бұрын
just what i was looking for in my work
@VivianHarbison Жыл бұрын
Thank you for such a succinct, and thorough video. Just what I needed. I followed along and got my files to all come together with one viewing of your video.
@DataMites Жыл бұрын
Glad it helped!
@darkphysics45612 жыл бұрын
Wow...you made it look so simple and saved me tons of time with this video. Many thanks.
@DataMites2 жыл бұрын
You're welcome!
@odeyemitemitope53 Жыл бұрын
This just helped solved the issue i am having. Thank you!
@DataMites Жыл бұрын
Glad to hear it! Keep Watching
@MultiKasuku3 жыл бұрын
You Sir deserve 1 billion subscribers
@DataMites3 жыл бұрын
Thank you. Please do subscribe and share with friends.
@karinabrouwer27333 жыл бұрын
Thank you for explaining this. Ive watch many tutorials on YT but nothing worked but yours did.
@DataMites3 жыл бұрын
"Hi Karina Brouwer, thank you for your comment."
@yichienchong74742 жыл бұрын
Thank you sir! Your video saved me lots of time.
@DataMites2 жыл бұрын
You're welcome!
@judytsai90133 жыл бұрын
Hello Sir! Thank you very much for sharing it. It helps a lot on the obstacles I’m having. I referenced this video to upload multiple .JSON, would you mind share suggestions if I want to flatten the nested node before concatenate files. Thank you!
@DataMites3 жыл бұрын
Glad it was helpful!
@Abel_The_4 жыл бұрын
thank you. you are a life saver
@DataMites4 жыл бұрын
Glad it helped!
@timothyhanson3694Ай бұрын
Is there a way to include a column which shows what file the data has been pulled from?
@nikitamaru4198 Жыл бұрын
Thank you sir
@harithameka83282 жыл бұрын
thank you for the video sir i merge my multiple csv files into excel file output is it leaving one row when it printing the new file when it printing 10.csv it leaving 10rows then the data is going to print plz reply sir
@manojkarthicksriramulu86393 жыл бұрын
thanks for the concept ....the logic you explained very simple to understand for a beginners
@DataMites3 жыл бұрын
Glad it was helpful!
@manojkarthicksriramulu86393 жыл бұрын
Can you explain..... Detailed concept of data cleaning null values, NaN, replace concept..... How to set data types.....
@multitaskprueba12 жыл бұрын
You are a genius! Thank you! What if the files share some columns but, we do not know which ones?
@DataMites2 жыл бұрын
Can you please reframe your question?
@GodX369993 жыл бұрын
Yeah thank for your great videos. But dear sir, if a folder have too much more file but we just want to pick some of them and concatenate as horizontal . So how can we do sir
@pallabiroy90913 жыл бұрын
Very helpful. Thankyou
@DataMites3 жыл бұрын
Glad it was helpful!
@riyazbagban91902 жыл бұрын
sir i have plain text how can i add these text file in each cell or i have to add each text file in each row of datastet how to do it
@nidhi_singh9494 Жыл бұрын
thank you for wonderful explanation. my query is , I have .csv files which are needed to drop first 20 rows in every files , and after that I needed to concat those all . any suggestions?
@DataMites Жыл бұрын
Use simple for loop to remove first 20 records from every file and then concatenate those files
@cynthiaphillips71432 жыл бұрын
Thank you very much. so now how can i save the new cancate data in txt file?
@DataMites2 жыл бұрын
Check with np.savetxt() or df_object.to_csv('filename.txt')
@amolmzope4 жыл бұрын
Thank you so much sir
@DataMites3 жыл бұрын
All the best
@gulshankumar173 жыл бұрын
I use dask data frame for reading multiple file. something like import dask.dataframe as dd df = dd.read_csv('*.txt').compute()
@DataMites3 жыл бұрын
Good work
@salamalsamman13143 жыл бұрын
Thank you!
@DataMites3 жыл бұрын
You're welcome!
@mohammadtamimkashifi95703 жыл бұрын
Great work!
@DataMites3 жыл бұрын
Thanks!
@chinmaya.69344 жыл бұрын
Sir, can you tell how to add a new column of "source file name" in the dataframe? ) Purpose is like to know source of the data where it is coming from
@ThankGodItsToday4 жыл бұрын
You can directly assign it using dataframe.loc it creates column if doesn't exist
@chinmaya.69344 жыл бұрын
@@ThankGodItsToday Thanks for the response. Will check it out!
@dheerajkumark56024 жыл бұрын
Sir plz make more pandas ,dask and scikit learn Videos
@ThankGodItsToday4 жыл бұрын
Sure.. will take time for this.
@yousfoss43673 жыл бұрын
Please how to behave if you have three groups one of test file another of train file and the main file thks
@DataMites3 жыл бұрын
Hi YOUS FOSS, you can manually set the location of files and read it using python.
@yousfoss43673 жыл бұрын
@@DataMites thks a lot
@lilymaths45843 жыл бұрын
Glob is such a useful but funny library!
@DataMites3 жыл бұрын
"Hi Lily Maths, yes, glob is useful library."
@pierreclaude13452 жыл бұрын
Hi, thank you for great video. My question is how to include encoding='utf-8' ?
@DataMites2 жыл бұрын
Using 'encoding' parameter in read_csv(). Kindly refer www.w3.org/International/questions/qa-what-is-encoding
@aakash24023 жыл бұрын
Can we follow same procedure for JSON files as well?
@DataMites3 жыл бұрын
Hi Aakash Thorat, yes you can provided that all json has similar structure and it can be loaded with pandas as dataframe
@satishkumar-ir9wy3 жыл бұрын
Sir can you help me to read multiple parquet files from miltiple directories
@DataMites2 жыл бұрын
If we try to read more than one file the data gets overlapped. What we can do is to read different files to different variables and then concatenate them to a single variable.
@NishantKumar-dw5er3 жыл бұрын
Awesome
@DataMites3 жыл бұрын
Thank You!
@tayseeruthman95254 жыл бұрын
THANK YOU VERY MUCH
@DataMites4 жыл бұрын
You're welcome!
@sarahferko22573 жыл бұрын
Thank you! What if you are trying to do something similar with PDF files?
@DataMites3 жыл бұрын
Hi Sarah Ferko, to work with pdf file there are lots of other packages like textract, PyPDF2, etc.
@hazelerturk93143 жыл бұрын
Thanks a lot for this! I have two empty files in 207 files. I believe it gives an error for those files in between. Is there a solution for those cases?
@DataMites3 жыл бұрын
Hi you can always check if your file is empty or not.
@yousfoss43673 жыл бұрын
thks a lot. what about multiple text files, please. thks
@DataMites3 жыл бұрын
Hi YOUS FOSS, you can use for loop for txt files line a given directory and do processing like shown in video.
@yousfoss43673 жыл бұрын
@@DataMites thks a lot
@GodX369993 жыл бұрын
@@DataMites thank sir, but how can we do if we just wanna pick some of them with a list specified file name ( not all file in folder)
@pennylaine87033 жыл бұрын
Thank you! :)
@DataMites3 жыл бұрын
You're welcome!
@mariakosberg92443 жыл бұрын
Hi, I really liked this video however i am struggling to do the same with my 120 log files. The reasons for this is that the columns are not defined in the files. They exist in the files but on the same row as a descriptor for the files. I have 3 rows with descriptors on their own but then on the 4th row the descriptor shares the row with the columns names. Also, all the data set is separated with spaces so I need to set the delim_whitespaces= True function but i can't do that with concat. I can fixed the file when I used the the delim_whitespaces functions and adding 'names' = each column name but i don't manage with the concat function. Is anyone able to assist?
@DataMites3 жыл бұрын
"Hi Maria Kosberg, Based on your query, you can remove top 'n' lines from your file by opening and saving a new txt file without those lines (you can automate this if you need to remove the same number(n) of top lines that you don't want in your data frame). After that, you can go with the approach suggested in the video itself. "