I love the way you are blowing me with all these purrr, tibble and readr libraries functions. I have told my friends about you and your course. We are planning to purchase and do it. Thank you Jonathan!
@JonathanNg5 жыл бұрын
Thanks Ezekiel, that's awesome. It's much easier to add structure, details and exercise files to a course. I'm also doing a bit of one on one video coaching for a handful of people who sign up early.
@allahjoseph Жыл бұрын
Here's the code for plug and play. Since "R" packages are always updating, here is how this code is used as of today via tidyverse: desired_name_df
@vivekthaker79705 жыл бұрын
Fantastic, I piped in the read_excel into your first code and it worked like a charm. Faster than power query! Cheers!
@JonathanNg5 жыл бұрын
Thanks for your comment. That's great to hear and I'm glad this was able to help you out.
@allahjoseph Жыл бұрын
super cheers and an upvote
@JonathanNg5 жыл бұрын
I completely forgot to point out at 7:52 that there is also specifically an Import from Excel option.
@DG-ju2wf3 жыл бұрын
Brilliant, excellent work.Highly appreciated
@avinashbachani32854 жыл бұрын
Hello. I want to do some statistical analysis in R and will import data from excel. I want to ask that if my another sheet is coded based on questionnaires survey then how can i analyze the data where as the coding would be done on another sheet of excel.(Same file)
@JonathanNg4 жыл бұрын
Absolutely. This video should show you how. kzbin.info/www/bejne/aYW4qqNqj7Kobpo
@avinashbachani32854 жыл бұрын
May i know how to run t test analysis?
@lewismarkpickersgill87794 жыл бұрын
Hi Jonathan. Thank you for this, it is very helpful but I need one more step to really solve a problem I have. I have about 300 excel files which I would like to merge into one table. The challenge I have is that none of the files have a datetime column from which to build a history of changes (This is a legacy data collection issue which we have now resolved). I would like to finish with one table with a datetime column so that I can model the changes in the files across time. Is it possible at importation of the file, or at any other stage in the process, to append a datetime column based on the time that the file was originally created? Thank you.
@ngovanthiu12772 жыл бұрын
Thank you for this video, it is really helpful !
@ambyvlogs4 жыл бұрын
How do i check the control totals of each file and display them? How can we be sure that r read all files correctly.
@khoale42383 жыл бұрын
Hi, what if I wanto to skip the first 7 lines in each of my CSV files before creating the dataframe. Is there a command for this? Thank you
@JonathanNg3 жыл бұрын
Definetly read_csv(skip = 7)
@khoale42383 жыл бұрын
@@JonathanNg when I tried this: dir("data",full.names = T) %>% map_df(read_csv(skip = 7)), I get the followin eerror: Error in read_delimited(file, tokenizer, col_names = col_names, col_types = col_types, : argument "file" is missing, with no default Can you please advice how to solve this error. Thanks
@JonathanNg3 жыл бұрын
@@khoale4238 when used inside a map function remove the brackets dir("data",full.names = T) %>% map_df(read_csv,skip = 7), I just wanted to make it clear that skip is a parameter of the read_csv function.
@Sumner184 жыл бұрын
Suppose I have 76 files, each with 36 sheets, and each sheet with about 1300 rows. How would you tweak this code to combine all of that data?
@JonathanNg4 жыл бұрын
Great question. Try combining the code from this video kzbin.info/www/bejne/aYW4qqNqj7Kobpo
@danadaurenkyzy4 жыл бұрын
Nice tutorial, thanks! And can we do the same but using multiple .xls files with read_xls instead of read_csv?
@JonathanNg4 жыл бұрын
Yep, you can use the readxl library. Generally I try to always use csv over Excel files because you know it's only data in a specific format. Excel files can contain, formulas, formatting, blank rows and columns, etc. All of these things make the data more difficult to process for true data analysis (even for other Excel users). At point 7:52 you can see a menu that will assist you with the import of Excel files. Other videos such as this go over the methods which are required to extract data from untidy Excel sheets and will probably give you an idea of why data scientist tend to avoid Excel.
@danadaurenkyzy4 жыл бұрын
@@JonathanNg okay, Thanks!
@jorgequevedo80545 жыл бұрын
Error: Evaluation error: zip file 'C:\Users\coke_\Desktop\importaciones 2019\Siemens\~$1.xlsx' cannot be opened.
@JonathanNg5 жыл бұрын
R uses Unix style folder separators. Replace \ with /
@2adamast3 жыл бұрын
CVS is semicolon separated on some systems, making it a pain
@JonathanNg3 жыл бұрын
Not a big deal. Just use read_delim(file, delim = ";")
@josephreed81763 жыл бұрын
Hi Jonathan, I am trying to combine multiple CSV files which were all exported from the same site and have the same layout and everything. I keep getting the error "Error: Can't combine `GeoID` and `GeoID` ". Which at face surface makes sense but when I open the CSV files they are the same. Also when I try to combine all 50 it gives me an error but I can combine some of the states without receiving the error. It is really hard to troubleshoot and was wondering if you had any suggestions. Thanks!
@fernandocuesta32324 жыл бұрын
God bless you man. I can´t thank you enough for this Blessings you are awesome
@zanaibrahim92104 жыл бұрын
Thank you for this video, it is really helpful. I only have a question about the "csv" files, my CSV files are (comma delimited when saved). The problem is when uploaded i only see 2 columns out of 19? How can I fix this, because changing: the ' commas to tab' also didnot help. I hope you will be able to help me. Thank you
@kevinlin33305 жыл бұрын
Hi~ thank you so much for the video. I came up with an error: Column `session` can't be converted from character to numeric. Hope you could help me with it.. thank you!
@JonathanNg5 жыл бұрын
Thanks for your comment. That means the session column is inconsistent between files. Since the first file the field was recognized as a numeric field, later files containing characters will fail. This will always be an issue when dealing with messy data and frequently occurs with spreadsheets due to manual data entry and Excel auto-correct. Ideally, you'll want to address this at source by using un-altered systems exports to csv format. This approach if possible is much easier than dealing with the issue further down the track. If that is not an option you'll want to create a column specification when importing the data which I cover in more detail in some of my courses. Note that when you coerce text to a number, it may get dropped meaning that you could lose some data. It's worth to be aware of what data get's dropped on any import to see if it has a material impact on your results. To prevent dropped data you can first import as text and manually coerce to numbers for greater control. If you need further assistance with this please consider joining my training where I can spend the time to build out a full example for you if you need it. I've provided a discount here if you'd like to check it out. datastrategywithjonathan.com/p/dataschool/?coupon_code=START_60 Hope that helps Thanks Jonathan
@jugalkishore8924 жыл бұрын
Great tutorial, thanks! 1 doubt, suppose I have 3 excel files, name as Audi, BMW, Merc and able to merge using "Tidyverse" lib. Now, I want to know which rows belongs to which file name. So, can we create a new column having the file name using this lib. Appreciate your help. Below is the command I am using: consolidate % map_df(read_excel)
@vanessamohr80732 жыл бұрын
Great video! Thanks :)
@sidrarafique8355 жыл бұрын
Hi. i am interested in reading for example column 2 from all the sheets of an excel file and then rename those column with their sheet name and then save it. i have 10 such excel files and i have to do all this one by one with all 10 excel files. please help me with this.
@JonathanNg5 жыл бұрын
Perfect use case for automation. Although I usually advise against using VBA, it sounds like VBA would be more suitable in this situation. VBA allows you to control elements of a spreadsheet. So if your desired result is to save back to an Excel file then VBA is probably the way to go. R is more suitable for importing many datasets. It's much faster, will give you more tools for cleaning, transforming and analyzing the data with much less code than VBA. If you are looking for an automated reporting solution. My first option for building a report is this kzbin.info/www/bejne/bIi3knp7hLt9bNU Automation can be done like this kzbin.info/www/bejne/emeUknWNobuFoJI If you really need to use Excel then this is the best solution I've found here kzbin.info/www/bejne/j6PTeKygedpleNE Hope that helps
@sidrarafique8355 жыл бұрын
@@JonathanNg Thanks alott.. it helps alott
@JonathanNg3 жыл бұрын
Join 15,000 to Learn Advanced Data Analytics Skills. Through step by step lectures, complete with downloadable code examples and student support. R Tidyverse Reporting and Analytics for Excel Users Level up your advanced data analytic skills. Learn one of the fastest easiest ways to work with large datasets. www.udemy.com/course/r-tidyverse-reporting-and-analytics-for-excel-users/?referralCode=E6333F435381F1A83CCD R Shiny Flex Dashboards and Interactive Data Visualizations The fastest, easiest way to get up to speed with R Shiny to transform your analysis into data driven applications. www.udemy.com/course/r-shiny-flexdashboards/?referralCode=15405DCD94D9E80B5440 Easy Excel Dashboards, Models, Visualizations with Power Query Better than VBA. A no code way to easily transform data and build automated reports with Microsoft Excel www.udemy.com/course/easy-excel-dashboards-models-visualizations-power-query/?referralCode=331E9DFAE5A544402066 What do you want to know about advanced analytics and data science? Leave a comment below.
@statlab_stat.solution2 жыл бұрын
Great!!!
@stevennye50754 жыл бұрын
Interesting
@djangoworldwide7925 Жыл бұрын
You talk so much. Can't believe you just explained how to use the view function. Jesus