Please provide me a link for this particular data set
@ilyeskesri65245 ай бұрын
Good tutorial thanks
@ansumankumar47978 ай бұрын
Very nice explanation.
@InsightsByRish8 ай бұрын
Thank you. Pleased to know you liked the content.
@ibrahimnazari83766 ай бұрын
nice explanation, thanks
@latifaharums2 ай бұрын
hello, may i ask for predict testing?
@InsightsByRish2 ай бұрын
If you're asking how to make predictions on test data, you just need to load your trained model in pickle format and pass the test data to it
@hoangha66803 ай бұрын
Thanks for the video. But I think filling missing values with the mean in the Target variable isn't a good idea. Since it can lead to bias or loss of information
@InsightsByRish3 ай бұрын
So according to you, how should the missing values from target col be handled?
@hoangha66803 ай бұрын
@@InsightsByRish in this case, we can consider dropping those rows
@InsightsByRish3 ай бұрын
@@hoangha6680 It can only be done when you have a large volume of data. If you're dropping rows from a dataset that's already small, you'll lose a significant portion of data, which can eventually hamper your model's performance.
@frootiiiiiiii2 ай бұрын
Mam why this y=df['price '] used here insted of y=df[['price']] why 1D series used here
@InsightsByRish2 ай бұрын
y = df['Price'] here the single square bracket symbolizes the data in 'array format' whereas in y = df[['Price]] the double square bracket symbolizes data in 'data frame format'. And traditionally the target column/feature needs to be in array format and not in data frame. (That's a good practice)
@frootiiiiiiii2 ай бұрын
@@InsightsByRish Ma'am, after applying all these techniques, the Mean Absolute Error is still around 11k. How can I fix this? Could it be that the MinMax Scaler needs to be changed to Standard Scaler or something else? Please suggest a solution.
@ibrahimdenisfofanah64202 ай бұрын
@@frootiiiiiiii try changing the MinMax Scaler to Standard scaler
@vikashsingh68886 ай бұрын
It shows an error when I select_datatype('object') and when I change it to int then my column size changes please help 🙏 At line 27
@InsightsByRish6 ай бұрын
It's not select_datatype('object'). It is select_dtypes('object').
@vikashsingh68886 ай бұрын
Yes I tried this one also but it's showing issue... While I run the encoder it shows an error that you have to select int or str
@vikashsingh68886 ай бұрын
I tried it also but it's showing error while I run the encoder part its showing an error that you should select int or str in encoder.fit(df[cat_col]) @@InsightsByRish
@InsightsByRish6 ай бұрын
@@vikashsingh6888 Hey, not really able to understand what's going wrong with your code, but I would suggest that restart your kernel and again implement the code from beginning line by line as it is (I've provided the code on GitHub).
@vikashsingh68886 ай бұрын
If you will provide your insta or something for contact even email I will send you the ss .. if you can it will help me 🙏
@flavourlyfreshasmr1517 ай бұрын
Can you explain, how you remove columns and rows from the original dataset, logic?
@InsightsByRish7 ай бұрын
Logic is the columns that don't help us in any way for making predictions are removed. Like the User ID column which has zero correlation with the target column. And for rows they are either removed if they contain null values or outliers.
@flavourlyfreshasmr1517 ай бұрын
@@InsightsByRish can you make separate video to discuss which features we have to take and which should remove , in our particular problem statement.you r videos are so helpful
@InsightsByRish7 ай бұрын
Thank you! Will make one on that topic soon.
@flavourlyfreshasmr1517 ай бұрын
@@InsightsByRish Great 👍
@alaa_abdestar3 ай бұрын
kaggle dataset link plz??
@InsightsByRish3 ай бұрын
I downloaded the dataset from GeeksforGeeks : media.geeksforgeeks.org/wp-content/uploads/20240905183434/HousePricePrediction.xlsx
@akshit_trivedi7 ай бұрын
Can you please make a playlist of all projects?
@InsightsByRish7 ай бұрын
Yes sure..!
@Luminouss120966 ай бұрын
from sklearn.linear_model import LinearRegression model = LinearRegression() model.fit(X_train, y_train) mam this part of code show error how can i solve this
@InsightsByRish6 ай бұрын
What's the error?
@Luminouss120966 ай бұрын
@@InsightsByRish model.fit(X_train, y_train) Input X contains NaN. LinearRegression does not accept missing values encoded as NaN natively. For supervised learning, you might want to consider sklearn.ensemble.HistGradientBoostingClassifier and Regressor which accept missing values encoded as NaNs natively. Alternatively, it is possible to preprocess the data, for instance by using an imputer transformer in a pipeline or drop samples with missing values.
@InsightsByRish6 ай бұрын
@@Luminouss12096 I think your input data frame (X) still consist of missing values. Try removing that before building the model. If you still encounter the error then mail your .ipynb file on insightswithrish@gmail.com.
@hriupadhyay57636 ай бұрын
Can you please provide a link for this particular data set please
@InsightsByRish6 ай бұрын
Hey, I've provided the GitHub repository link in description box. Dataset is available there.
@MuhammadEhtisham-cm5vr6 ай бұрын
plz make a seperate videos for feature engineering and scaling
@InsightsByRish6 ай бұрын
Ok, will make soon.
@jaisiddhisuperpowerpltd18725 ай бұрын
mam , are you indian or foreigner
@green_valley_farms7 ай бұрын
can you please share the code
@InsightsByRish7 ай бұрын
Hey, I have provided the link of GitHub repository for the code in the description box.