How I Work With MILLIONS OF ROWS DATA using PYTHON | PYSPARK & BIG DATA

  Рет қаралды 11,543

Mo Chen

Mo Chen

Күн бұрын

Пікірлер: 47
@mo-chen
@mo-chen Жыл бұрын
🎉 Check out Bright Data ➡︎ brdta.com/datawithmo
@hamedalcherif9064
@hamedalcherif9064 Жыл бұрын
Big Mo is back.
@mo-chen
@mo-chen Жыл бұрын
Haha I am indeed! Thanks a lot for watching 😁
@kevingonzalo9087
@kevingonzalo9087 Жыл бұрын
nice comment
@lorenzreparip4525
@lorenzreparip4525 Жыл бұрын
Now that I've seen this. I am more motivated to learn phyton!! Thanks!
@mo-chen
@mo-chen Жыл бұрын
I’m glad to hear that! Thanks for watching 😃
@nikjojo
@nikjojo Жыл бұрын
do you visualise your findings more on Python or Tableau?
@mo-chen
@mo-chen Жыл бұрын
Tableau just because it’s so much easier and looks way nicer 😃
@sawdawyah
@sawdawyah 11 ай бұрын
If I can press this 👍more than once I would press it MILLIONS of times [ Thanks! 😁 ] AAAnd I love your videos a lot
@mo-chen
@mo-chen 11 ай бұрын
That's very kind of you. Thanks a lot for watching!
@nhimallansupramaniam2626
@nhimallansupramaniam2626 Жыл бұрын
Mo i love your videos. Please do a data analytics tutorial that covers python and pyspark
@mo-chen
@mo-chen Жыл бұрын
Thanks for the kind words! I have a couple other videos on data analysis with Python on the channel already, feel free to check them out. And I'll try my best to make more content with PySpark 😁
@TheRhinorock
@TheRhinorock 7 ай бұрын
What would be preferred method reading billions of records and aggregating over 400 fields sum, min, max and writing into file or DB.
@samikshabhosale6634
@samikshabhosale6634 11 ай бұрын
Is this data has changed now? On website?
@karl2477
@karl2477 Жыл бұрын
where is your jumper from?
@mo-chen
@mo-chen Жыл бұрын
It’s a Massimo Dutti jumper
@erickajanee
@erickajanee Жыл бұрын
Would you recommend any MBA in analytics schools? In person and online? If so maybe a video idea!
@mo-chen
@mo-chen Жыл бұрын
MBA is something that more experienced workers tend to do later down in their careers. If you're starting out, you should focus on your core data analyst skills first. Thanks a lot for watching 😄
@sharath6346
@sharath6346 Жыл бұрын
The code just look like SQL query…. Is pyspark similar to SQL?
@mo-chen
@mo-chen Жыл бұрын
Yes, it has lots of SQL and Python syntax as well. Thanks a lot for watching 😄
@airmen_fresh
@airmen_fresh Жыл бұрын
Out of curiosity what are some pro's and cons' to becoming a data analyst? I'm currently in an Entry Level position to IT (help Desk) and am looking to upgrade or elevate myself in the IT field and have an interest in this field.
@mo-chen
@mo-chen Жыл бұрын
Pros for me are that I really enjoy my work and get paid well for it. No cons in general. If I really didn't like what I did on a daily basis, I'd just do something different 😃
@airmen_fresh
@airmen_fresh Жыл бұрын
@@mo-chen I'm happy you find something you enjoy and get paid for but what would you say the most complaints and or negativity have you heard about your job?
@balixong9704
@balixong9704 Жыл бұрын
Would you use google sheets over microsoft excel? If yes, then why?
@mo-chen
@mo-chen Жыл бұрын
I wouldn't. Google Sheets is free which is why most people use it.
@irfanali8106
@irfanali8106 Жыл бұрын
Hi Mo Chen, I'm a big fan of your work! I've been learning data science for the past 3 years, but I'm not sure how to start my career. Your KZbin channel has been a great resource for me, and I'm grateful for your kindness and loyalty. how to access the brightdata, site Error(DNS_PROBE_FINISHED_NXDOMAIN) or sample data? I have one question. I'm not able to access the site "brightdata" because it's blocked in my country. Would you be able to share same data samples with us so that I can practice on this project? I would be very grateful for your support. Thanks!
@mo-chen
@mo-chen Жыл бұрын
Thanks so much for the kind words 😁Using a VPN would be the best way. The sample data doesn't contain many rows so I wouldn't build the project on that. Thanks a lot for watching!
@welcometomathy
@welcometomathy Жыл бұрын
Always thought big data was about multiple types of data as dataset like images videos sound, 3d objects and other things stored as a data base . My 100GB power bi dataset doesn't fit big data?
@mo-chen
@mo-chen Жыл бұрын
There is no clear definition of big data in terms of size. What you're mentioning is unstructured data. Your Power BI 100gb data can safely be considered big data. Thanks a lot for watching 😁
@mrtayyab3101
@mrtayyab3101 Жыл бұрын
Please bring a next Streamlit tutorial
@mo-chen
@mo-chen Жыл бұрын
Great video idea, I'll see what I can do in the future!
@samira_pmn6488
@samira_pmn6488 Жыл бұрын
hello there , can I use this spark things for my HDF5 dataset too? it is so big and exactly as you said I can't work with it even with chunking :(
@mo-chen
@mo-chen Жыл бұрын
Yes, absolutely!
@amanpoojary5782
@amanpoojary5782 Жыл бұрын
Hi, from where can I learn excel for Data Analytics?? I am fully confused.
@mo-chen
@mo-chen Жыл бұрын
Please see the website link I put in my other answer to your other comment 😄
@ajinkyapantode5100
@ajinkyapantode5100 Жыл бұрын
Do you provide 1on1 mentorship
@mo-chen
@mo-chen Жыл бұрын
121 mentoring is not something I do right now unfortunately 😅
@CaribouDataScience
@CaribouDataScience Жыл бұрын
I vote for upgrade computer.
@mo-chen
@mo-chen Жыл бұрын
If money is no issue, of course 😃
@IlhamRhamadan-mf8yx
@IlhamRhamadan-mf8yx Жыл бұрын
here we go☕
@mo-chen
@mo-chen Жыл бұрын
Thanks for watching 😃
@aayushdedhia5781
@aayushdedhia5781 Жыл бұрын
Is the dataset free?
@mo-chen
@mo-chen Жыл бұрын
The sample is 😁
@Carbv1
@Carbv1 Жыл бұрын
Ok that focus cursor is not helping. It’s distracting. Great content though.
@mo-chen
@mo-chen Жыл бұрын
I'm glad you liked the video! Most people really like the cursor highlighter so I'll keep it for now. Thanks a lot for watching 😁
How I use Python as a Data Analyst
13:56
Luke Barousse
Рет қаралды 383 М.
Lamborghini vs Smoke 😱
00:38
Topper Guild
Рет қаралды 57 МЛН
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 47 МЛН
Creative Justice at the Checkout: Bananas and Eggs Showdown #shorts
00:18
Fabiosa Best Lifehacks
Рет қаралды 34 МЛН
Симбу закрыли дома?! 🔒 #симба #симбочка #арти
00:41
Симбочка Пимпочка
Рет қаралды 6 МЛН
This is how I actually clean data using Power Query
27:49
Mo Chen
Рет қаралды 53 М.
Intro to Python Dask: Easy Big Data Analytics with Pandas!
20:31
Bryan Cafferky
Рет қаралды 15 М.
SQL with PYTHON | Manage SQL databases using PYTHON ONLY
18:03
Do these Pandas Alternatives actually work?
20:19
Rob Mulla
Рет қаралды 15 М.
A Day in the life of a Data Analyst in Chicago
7:00
Justin Shin
Рет қаралды 906 М.
How to work with big data files (5gb+) in Python Pandas!
11:20
TechTrek by Keith Galli
Рет қаралды 41 М.
I Tried 50 Data Analyst Courses. Here Are Top 5
8:41
Stefanovic
Рет қаралды 133 М.
Exploratory Data Analysis with Python | PANDAS
18:38
Mo Chen
Рет қаралды 21 М.
Lamborghini vs Smoke 😱
00:38
Topper Guild
Рет қаралды 57 МЛН