I'm a test automation engineer who wants to begin with the path of data engineer. I've finished a data analyst bootcamp and I think it was a good first step in this world. How do I transform in a sr like you? XD It's a lot of thing to do to be a data engineer. Thanks for the video!
@GambillDataEngineering16 сағат бұрын
Thanks for your comment and for watching the video! Being a test automation engineer and already completing a data analyst bootcamp are fantastic steps toward becoming a data engineer. Here’s what I’d suggest: Leveraging Current Skills: As an automation engineer, you already have experience with scripting, debugging, and working with systems. Use those skills to start learning Python and SQL. Foundations: Since you’ve done a data analyst bootcamp, build on that knowledge by diving into: SQL: Master joins, subqueries, CTEs, and performance optimization. Data Modeling: Learn how to structure data for pipelines and reporting. ETL/ELT: Get hands-on experience with tools like Apache Airflow, dbt, or Azure Data Factory. Explore Cloud Platforms: Data engineers often work with cloud providers like AWS, Azure, or GCP. Start with foundational services like S3 (AWS) or Blob Storage (Azure) and gradually move into things like data pipelines and warehousing (e.g., Snowflake, BigQuery). Build Small Projects: For example, automate a data pipeline that pulls, cleans, and stores data from an API. Network and Upskill: Keep connecting with others in the field, (Like you are doing with me) , and consider either the AWS or Azure Data Engineer Certifications. You’re already on the right path! Keep learning, experimenting, and challenging yourself, and you’ll be well on your way to senior roles. Let me know if there’s anything specific I can help with-happy to be part of your journey!
@rotan90Күн бұрын
Hey Chris, From my perspective every aspiring Data Engineer should start asking SHIT LOAD OF QUESTIONS how something works. I see something being refreshed everyday - how do they managed to do ETL on this data and make insights out of it ? I see data about something being refreshed every 5 seconds - how do they do it ? Don't listen to people who say "nah bro, only Python, i dont give a damn about SQL or cloud - python is on fire right now!" - total bs. If you don't know how to reduce ETL process runing time/cost, prepare data on time or features in your pipeline are crashing one after another (even after fixing it) then you are worthless to the company. You are supposed to solve problems that business has or give them data that they can rely on, EOT. IT most of the time is a cost for the company, not something that brings money. I'm Data engineer (or to be precise Azure Data Engineer/Data Analyst) from Poland and I see that companies are looking for people you know how to solve problems, have 2-3 ideas how to do sth and test it. Data modeling, average ++ SQL, Azure/AWS from anaytics and devops point of view, some viz tool (Power BI or Tableau) and (at least here) you are ready to go for junior+ role.
@bestechdeals45393 күн бұрын
Subbed! What do you recommend to get started for someone with a background in college math and some codding?
@GambillDataEngineering3 күн бұрын
If you have a good foundation with math and some coding then SQL should be easy to pick up. I'd start there and then pick up a visualization tool like power bi and a ETL tool. That way if you find a analysis role, you can go for that.
@GambillDataEngineering3 күн бұрын
Some links to various tools and podcasts that we discussed during the session: Marquin's sites: www.linkedin.com/in/marquin-s-71b05622/ journey-to-data-engineer.marquinsmith.com/ marquinsmith.com/ Tools: www.docker.com/ www.domo.com/business-intelligence/ motherduck.com --Duck db gemini.google.com/ Podcasts: acquired.fm talkpython.fm
@MarquinSmith4 күн бұрын
Thanks for the opportunity to share my data journey. I had a blast doing this.
@GambillDataEngineering3 күн бұрын
I'm glad you had a good time! We can probably do a whole video just on all the marketing analytics and data! I bet that would be helpful and interesting!!
@GambillDataEngineering5 күн бұрын
Looking for more business related data content - Businesses: Reach out for a free consultation gambilldataengineering.com/contact Learn more about starting your business's data journey: www.gambilldataengineering.com/business-concept-blog Individuals looking to learn more: www.gambilldataengineering.com/data-engineering The NEW Data Roadmap: www.gambilldataengineering.com/data-engineering-honeycomb
@adityatomar98206 күн бұрын
I'm in my final year of college and got an offer for Data Engineer role at a startup. Starting next Week❤ Really excited
@GambillDataEngineering6 күн бұрын
That's awesome!! Congratulations!!!
@argus33543 күн бұрын
on campus or off campus? i wish to start as a data engineer too as a fresher, any tips & guidance? detailled answer would be very helpful
@adityatomar98203 күн бұрын
@@argus3354 Hey, let me tell you the current situation. I applied for more than 500 openings in Data engineer, analyst and Business Intelligence role. Got calls from 4 company, gave interview in 2 of them, got an offer from one of them as a Data Engineer. Just be very very good with Spark, both theory and practical pyspark questions. Advanced SQL is must, you should be comfortable with writing window functions and joins, Cte, etc. Basic DSA questions in python is must. Focus on list and dictionary. Basic algorithms too. And just apply like hell. create a good one page resume, 2 3 good projects with end to end architecture. And that's it. Cold mail worked for me. Keep applying 😉
@GambillDataEngineeringКүн бұрын
I agree with Aditya. If you want to flow directly into a data engineer role you will need advanced SQL skills, some pyspark as well as the basics about data modeling as well as the soft skills to be able to communicate that in a way that is easy to understand to a non technical audience. I'd throw in some cloud skills either with Azure or AWS. I'd recommend sending linkedin messages to the hiring manager or recruiter after you apply to share your linkedin profile and convey how excited you are about applying. I hope that helps!
@adityatomar9820Күн бұрын
@@GambillDataEngineering that's how it works 🙌🏻❣️
@RomanShchurko6 күн бұрын
thanks for the video :) one pretty basic recommendation is to avoid "select *" in the queries, and the second is to find out which IDE is the most convenient for you (DBeaver, DataGrip, etc), since they can help you in many ways, like fix formatting, identify possible errors, generate ddl, etc.
@GambillDataEngineering6 күн бұрын
@RomanShchurko I love it! Absolutely Select * is not only poor practice but can be an efficiency killer! And, IDE is an important decision early on as it can make or break some early habits! DBeaver is a great one! I've also used Toad a bit! I'm so glad you enjoyed the video and thank you for sharing!!
@DươngThịHin-p6b9 күн бұрын
Thanks for sharing! Just a quick off-topic question: I have TRC20 USDT in my OKX wallet and I have the recovery phrase: 『pride pole obtain together second when future mask review nature potato bulb』. What is the best way to transfer them to Binance?
@GambillDataEngineering8 күн бұрын
Hi Durong - Unfortunately I don't know a ton about Crypto, beyond how to pull this data from coin gecko!
@Joshfarmpig9 күн бұрын
AWS is pretty good
@GambillDataEngineering8 күн бұрын
AWS is really good. However, there is a point at which it is time to move to the cloud vs stay on prem. There are use cases that you have hit a certain point of scale that it is time to move to AWS or Azure and, from a data perspective and there is a time that it is just as good to stay on prem.
@GambillDataEngineering11 күн бұрын
What is your biggest hesitation moving to the cloud? ***Connect with Me*** facebook.com/gambilldataengineering/ instagram.com/thegambill/ www.linkedin.com/in/databasemanagement/ www.tiktok.com/@chris.gambill Free Business Templates: www.gambilldataengineering.com/business-templates Medallion Model: kzbin.info/www/bejne/bZm4eYOQmrF5m8k Azure Cost Estimation Tool: azure.microsoft.com/en-us/pricing/calculator Data Standards: kzbin.info/www/bejne/mprQeIaXi6hqd5Y Cloud Migration Blog: www.gambilldataengineering.com/business-concept-blog/5-steps-to-migrate-to-the-cloud Reach out for a free consultation: www.gambilldataengineering.com/contact
@akashbharadwaj112 күн бұрын
can you share similar points about data analytics career. thanks. insightful video this one. thanks.
@GambillDataEngineering12 күн бұрын
@@akashbharadwaj1 absolutely! I'll add it to the schedule!!
@Fredmaxwelline16 күн бұрын
Thanks for this video, It's so informative 😇
@GambillDataEngineering16 күн бұрын
@Fredmaxwelline Thank you for watching! Glad you found it helpful!
@Fredmaxwelline16 күн бұрын
@@GambillDataEngineering Yeah Your videos should be watched by a lot of people out there, What strategies are you trying to get your channel viral if I may ask?
@TheAdobe10116 күн бұрын
You might want to lay off the weed a little bit you're seeing in 16:9
@GambillDataEngineering16 күн бұрын
😄 🤣 😂 😆
@mohan-h9o18 күн бұрын
Hi sir how are you your video very usefull
@RidahShoorbajee20 күн бұрын
Too your question, what is holding me back, is that I use ChatGBT to do my code, Since im very new to the IT industry, I realized that in Technical interviews, You can not use chatGBT, Isnt ChatGBT an AI that can help us new folk code better and faster ? I don't have bachelors in computer science so using chat GBT is so much easier for me, I just wish I can use on a technical interview. On the flip side, I love this new challenge where i actually have to sit down and find time to learn the basics of python and how to type in python with out AI helping me.
@ALEX86ZILBER20 күн бұрын
Thank, very interesting, hope the machines will not replace us soon…
@GambillDataEngineering20 күн бұрын
I think we'll see some changes in the next few years. Focus on learning the tools and techniques that allow you to leverage and control AI. The next video I just finished and will be publishing for monday is a example of using Chat GPT to brainstorm how to optimize a SQL query!
@abdooraahmaanx20 күн бұрын
Hello sir, I am new in the channel, I want to ask if there a roadmap to break into Data Engineering feild and thanks alot for these informative videos
@GambillDataEngineering20 күн бұрын
Absolutely! I did this video on the data engineering roadmap! kzbin.info/www/bejne/aXeUe2yCht55lZI
@GambillDataEngineering20 күн бұрын
@@abdooraahmaanx I also have a more granular version on my website here: www.gambilldataengineering.com/data-engineering-honeycomb
@RidahShoorbajee21 күн бұрын
How do I handle multiple changing priorities: I ask my self, Wich one has the most crucial effect on the output of the company ? Wich pipe line must be repaired or which pipe line must be live at all costs. Are there any pipelines or databases that do not cuase harm to the companies operations ? The tasts with least harm I would push them to the last minute while the tasks that effect the lifeline of the database i would work on them first
@GambillDataEngineering21 күн бұрын
That is a great answer. My feedback would be to change push to prioritize. Wording matters in your interviews and this change will help! Find a way to make this more concise something like: I prioritize my workload by identifying which tasks have the biggest impact on the company. Then identifying which pipelines or processes are "mission critical" and can cause a business outage if they go down, finally the tasks and pipelines that have the least impact I push towards the end of the priority list.
@RidahShoorbajee21 күн бұрын
I love working in hybrid and remote settings, I always ask and communicate through the proper channels and always let me supervisors know
@GambillDataEngineering21 күн бұрын
Good answer. To enhance it: I love working in hybrid and remote settings. I find that using tools like teams and slack are great ways to stay connected when you are not in person. In addition, I keep my leaders in the loop as needed. Be careful about the "always let my supervisory know" part. It can come off as dependent or unable to take ownership.
@RidahShoorbajee21 күн бұрын
How to explain technical process to a non tech audience: I would Create visualizations, and explain it to them by creating examples they would understand, I would try and connect and relate it to something they understand like sports
@GambillDataEngineering21 күн бұрын
This is a great answer! Any time you can create a picture it will help, having an analogy about something relatable enhances understanding as well!
@RidahShoorbajee21 күн бұрын
INNER JOIN: when retriving information from two tables that have matching columns LEFT JOIN when getting ALL information by adding a right table RIGHT JOIN when getting ALL information by adding a left table OUTER FULL JOIN when getting information from two tables
@GambillDataEngineering21 күн бұрын
Good answer. Be sure to have some examples prepared as well to show you have a solid grasp on joins.
@RidahShoorbajee21 күн бұрын
This is great review for my Junior data engineer interview! Great video with great info and great info to review! fingers crossed I get it!
@GambillDataEngineering21 күн бұрын
Good Luck!! I will be keeping you in my thoughts! Let me know if you get it!!
@RidahShoorbajee20 күн бұрын
@@GambillDataEngineering Howdy! I i flunked the technical interview, My python skills are not the best. I have to brush up on how to open python and create a database and upload it to a server and use SQL with in Python. I nailed the SQL portions, I made great joins, I nailed the theory portion, I nailed the Critical thinking portion, I nailed the Normalization questions. you feedback helped me ALOT, especially on saying " I like to take ownership" Im someone who never had a B.S in computer science, So im learning on my own, Ive loved your videos so far! Im very new to the Data engineer world, so there is so much to learn.
@GambillDataEngineering20 күн бұрын
I'm glad you did well on the rest and that the feedback helped! I hate that you had a hard time with the python part! You will get there. I have some videos on python if you want to check those out and if there is something specific you need let me know and I'll put something together for you!
@ALEX86ZILBER24 күн бұрын
I would love to know if the AI will replace data related jobs like Data Engineers or Data Analysts…😢
@GambillDataEngineering24 күн бұрын
That is a great question! Watch out for a video I'm working on exactly that topic!
@jackd833625 күн бұрын
Interactive time - 🔔
@A5tr0RBX26 күн бұрын
No, THANK YOU!❤
@FatemaAkter-js6ne26 күн бұрын
I appreciate you and I want to share with you some tips that you can use for your channel, I look forward to hearing from you
@humsterburger27 күн бұрын
We need more ppl to make videos like you ❤
@GambillDataEngineering27 күн бұрын
@humsterburger thank you! I'm glad you like them! Let me know if there is anything specific you would like to see covered!
@apamwamba29 күн бұрын
FRAMING??
@EasyStreet88599Ай бұрын
Hello thank you for this video I didn’t know this channel. What do you think of msbi? I ve switched to power bi one year ago full time but now employers contact me for msbi while I think it’s a technology that has no futur as the last versions are full of bugs…
@GambillDataEngineeringАй бұрын
I'm glad you found the channel-welcome! When it comes to Microsoft’s business intelligence tools, I agree that they’re definitely legacy and are gradually fading out. However, they still hold a significant market share in reporting and ETL processes. Many companies haven’t fully transitioned yet, so a lot of your future work might involve migrating tools like SSRS and SSIS into modern platforms like Databricks or Azure Data Factory. Because of that, MSBI still plays an important role in the data ecosystem. But I’d love to hear your perspective-do you see it sticking around in any capacity, or do you think it’s on its way out entirely? Let’s debate!
@EasyStreet8859927 күн бұрын
@@GambillDataEngineering Thank you very much for your feedback! I agree that a significant portion of the market still relies on Microsoft BI and will need professionals to manage migrations to cloud-based solutions like Azure Data Factory. On the other hand, while most people seem to believe the cloud represents the future, I know some corporate owners still prefer to keep their data on-premises. For instance, I once worked with a retail client whose CTO believed the cloud was not the future. Now, I find myself wondering: could substantial hardware improvements in the future make cloud migration unnecessary? While this idea may seem far-fetched, history has shown that being right too early is not always advantageous. Thanks to our discussion, I plan to study how to transition from MSBI to Data Factory and analyze the feasibility of companies remaining on-premises. This exploration will also give me an opportunity to write my first article on the subject.
@GambillDataEngineering17 күн бұрын
I do believe continued hardware improvements and mid- to small business may be able to successfully stay on prem. It really depends on the use cases and the business needs! There is still a ton that can be done on prem!!
@mohan-h9oАй бұрын
Sir translate tamil please
@GambillDataEngineeringАй бұрын
Added Tamil to the captions!
@jackd8336Ай бұрын
That’s awesome Chris!
@ZooGod-si4hoАй бұрын
Good morning! I'm new to python and learning more, from what I was reading, if I'm correct I see the API already enforces secure HTTPS connections. So what is the logic behind creating the TLS wrapper? Is this just good practice in general? Is it something optional you decided to do? Thanks ! love the content so far btw
@GambillDataEngineeringАй бұрын
You are correct. but the API wants the TLS encryption. In this specific case the Marvel API required TLS. :(
@parvavlogs3722Ай бұрын
Joins in SQL combine rows from two or more tables based on related columns. Here's why each type is needed: 1. INNER JOIN: Retrieves matching rows from both tables. - Use: When you only need data that exists in both tables. 2. LEFT JOIN (OUTER): Retrieves all rows from the left table and matching rows from the right table; unmatched rows in the left table show NULL. - Use: When you need all data from one table, even if there's no match in the other. 3. RIGHT JOIN (OUTER): Retrieves all rows from the right table and matching rows from the left table; unmatched rows in the right table show NULL. - Use: When you need all data from the other table, even if there's no match in the first. 4. FULL JOIN (OUTER): Retrieves all rows from both tables, with NULL where no match exists. - Use: When you need all data from both tables, regardless of matching. 5. CROSS JOIN: Combines every row from one table with every row from another (Cartesian product). - Use: When you need all combinations of rows from both tables. 6. SELF JOIN: Joins a table to itself. - Use: When you need to compare rows within the same table.
@GambillDataEngineeringАй бұрын
@parvavlogs3722 great answer you clearly have a good grasp on the mechanics of joins. My feedback would be to make your answer sound less clinical in your interview, and provide a specific reason / use case as to why you might want a Cartesian product.
@parvavlogs3722Ай бұрын
Joins in SQL combine rows from two or more tables based on related columns. Here's why each type is needed: 1. INNER JOIN: Retrieves matching rows from both tables. - Use: When you only need data that exists in both tables. 2. LEFT JOIN (OUTER): Retrieves all rows from the left table and matching rows from the right table; unmatched rows in the left table show NULL. - Use: When you need all data from one table, even if there's no match in the other. 3. RIGHT JOIN (OUTER): Retrieves all rows from the right table and matching rows from the left table; unmatched rows in the right table show NULL. - Use: When you need all data from the other table, even if there's no match in the first. 4. FULL JOIN (OUTER): Retrieves all rows from both tables, with NULL where no match exists. - Use: When you need all data from both tables, regardless of matching. 5. CROSS JOIN: Combines every row from one table with every row from another (Cartesian product). - Use: When you need all combinations of rows from both tables. 6. SELF JOIN: Joins a table to itself. - Use: When you need to compare rows within the same table.
@marquinsmith970Ай бұрын
Hi Chris, Great content as usual. If possible I would also like to hear about your opinions on what level of data engineer (eg Senior, Lead, Staff) certain tasks or project components fall under. for example in the migration discussion you talked about having to decide between SQL server vs serverless database. How senior would a data engineer need to be to be able to decide or have input on these kinds of things. Context: I'm looking to make a career progression and curious what typical responsibilities fall under the different levels of seniority for a data engineer.
@GambillDataEngineeringАй бұрын
@marquinsmith970 that is awesome 👌 . First, thank you for watching! Absolutely, I will start working on putting that together. Maybe I can review that on next Friday night's live session!
@Vincent45678Ай бұрын
#cfbr
@Vincent45678Ай бұрын
And also how much DSA is required for becoming a data engineer sir for freshers. I have 6 months of time. Is this time enough?
@GambillDataEngineeringАй бұрын
Great question, Vincent! While data structures and algorithms (DSA) are not as heavily emphasized for data engineering as they are for software engineering roles, having a solid grasp of the fundamentals is still important. For a data engineer, focus on concepts like: 1. Arrays, linked lists, and hashmaps for data organization. 2. Sorting and searching algorithms for optimizing queries. 3. Trees and graphs for understanding hierarchical and relational data. 4. Basics of time and space complexity for efficient data handling. Six months is definitely enough time if you plan and practice consistently. I'd recommend dedicating an hour or two daily, focusing on applying DSA knowledge to data engineering tasks, like optimizing SQL queries or improving data pipeline performance. You could also explore real-world examples to solidify your understanding. If you're new to DSA, platforms like LeetCode and GeeksforGeeks are good places to practice. Aim for consistency rather than intensity, and you'll see results. Good luck!
@Vincent45678Ай бұрын
@@GambillDataEngineering Thank u sir 🙏
@Vincent45678Ай бұрын
Good evening Sir! Should i do DSA in python or java sir. Just started my preparation Sir
@GambillDataEngineeringАй бұрын
If you're just starting out and aiming to be a data engineer, Python is likely the better choice because of its versatility and relevance to the field. However, if you're already comfortable with Java or planning to work in roles where Java is heavily used, that could also be a great option. My suggestion: Choose the language you’re more comfortable with or the one more aligned with your career goals. Both are good, but for data engineering, Python often gives you a practical edge.
@Kumar-vh7utАй бұрын
Awesome
@GambillDataEngineeringАй бұрын
Thank you Kumar. I was wanting to share a story about the impacts of data engineering. In addition this will feed into my interview tip video coming tomorrow talking about using specific stats in your interviews!
@Vincent45678Ай бұрын
It'll be very helpful if u could give me any advices or suggestions sir!
@GambillDataEngineeringАй бұрын
Absolutely- shoot me an email with some details about where you are with your skills and what your current job is and I can build you a roadmap if you want.
@Vincent45678Ай бұрын
@GambillDataEngineering actually sir I am an engineering student ☺
@Vincent45678Ай бұрын
@GambillDataEngineering Btech 3rd year sir
@GambillDataEngineeringАй бұрын
@Vincent45678 - I'd make sure you are working on building a portfolio. Decide on a topic you are passionate about and then find a dataset. Build a full project from beginning to end from that point. Write a python script or create a pipeline set up transformations with dlt or within your script, set up orchastration of some kind (aka schedule your pipeline with a tool or you can schedule things in windows task scheduler even), model the data in your db by creating dimensions and facts, finally create a report in a visualization tool (power bi, looker, Tableau, etc.). I hope that helps! Also as you are preparing for interviews practice with a friend, mentor, or in front of a mirror. Record yourself answering questions and then critique your own answers. Day of interviews relax no more prep unless it is to remind yourself of syntax or some specific thing right before.
@Vincent45678Ай бұрын
Today i am starting my journey sir need ur blessings ☺
@GambillDataEngineeringАй бұрын
@Vincent45678 That is exciting! Let me know if there is anything you have issues with and I can try to put something together for you to help!
@CDMAG10Ай бұрын
Coming up to 3 years of being a Data Engineer and love it so far! Biggest surprising challenge I found was the degree to understanding various Business contexts, Translating technical work into value for stakeholders, like analysts and decision-makers, was an essential skill but one that also requires clear communication skills and business awareness!
@GambillDataEngineeringАй бұрын
That is awesome. Data Engineering is something that I love. It is my wife's hell and my mother has no idea what I do but, I love it! 100% Communication and learning how to explain what you do to business stakeholders is paramount to having a successful career and move up the ladder! If you are focusing on a specific domain having domain knowledge will take you far too... especially in healthcare and finance.