Week 5 Assignment – Supervised Learning (Regression)

Course: Applied Data Science with AI
β€’Instructor: Dr. Muhammad Mohsin Nazir
Source: Kaggel.com
DataSet Name: Customer_support_data

Overview Of Week 5 Assignment

Goal:

Build a baseline regression model to predict a numeric target variable from your dataset. In your case:

The assignment focuses on supervised learning using regression, splitting the data into training and testing sets, building a Linear Regression model, evaluating it with MAE, MSE, RMSE, Explained Variance, and visualizing predictions.

Step 1: Load Dataset

πŸ”Ž First 5 Rows (Preview)

Unique idchannel_namecategorySub-categoryCustomer Remarks Order_idorder_date_timeIssue_reported atissue_responded Survey_response_DateCustomer_CityProduct_categoryItem_price connected_handling_timeAgent_nameSupervisorManager Tenure BucketAgent ShiftCSAT Score
7e9ae164-6a8b-4521-a2d4-58f7c9fff13fOutcallProduct QueriesLife Insuranceβ€” c27c9bb4-fa36-4140-9f1f-21009254ffdbβ€”01/08/2023 11:1301/08/2023 11:47 01-Aug-23β€”β€”β€”β€”Richard BuchananMason GuptaJennifer Nguyen On Job TrainingMorning5
b07ec1b0-f376-43b6-86df-ec03da3b2e16OutcallProduct QueriesProduct Specific Informationβ€” d406b0c7-ce17-4654-b9de-f08d421254bdβ€”01/08/2023 12:5201/08/2023 12:54 01-Aug-23β€”β€”β€”β€”Vicki CollinsDylan KimMichael Lee >90Morning5
200814dd-27c7-4149-ba2b-bd3af3092880InboundOrder RelatedInstallation/demoβ€” c273368d-b961-44cb-beaf-62d6fd6c00d5β€”01/08/2023 20:1601/08/2023 20:38 01-Aug-23β€”β€”β€”β€”Duane NormanJackson ParkWilliam Kim On Job TrainingEvening5
eb0d3e53-c1ca-42d3-8486-e42c8d622135InboundReturnsReverse Pickup Enquiryβ€” 5aed0059-55a4-4ec6-bb54-97942092020aβ€”01/08/2023 20:5601/08/2023 21:16 01-Aug-23β€”β€”β€”β€”Patrick FloresOlivia WangJohn Smith >90Evening5
ba903143-1e54-406c-b969-46c52f92e5dfInboundCancellationNot Neededβ€” e8bed5a9-6933-4aff-9dc6-ccefd7dcde59β€”01/08/2023 10:3001/08/2023 10:32 01-Aug-23β€”β€”β€”β€”Christopher SanchezAustin JohnsonMichael Lee 0-30Morning5

Step 2: Preprocessing

Preprocessing Summary

βœ… Preprocessing Done!

Shape after preprocessing: (85907, 3)

Explanation:
Shape = (Rows, Columns)
Rows = number of customer interactions
Columns = number of features (predictors) + target (CSAT Score)

Step 3: Define Features and Target

Features and Target

Features Shape: (85907, 2) β†’ Predictor variables used for regression

Target Shape: (85907,) β†’ CSAT Score values to predict

Step 4: Train/Test Split

Train/Test Split Completed

Training Samples: 68,725 β†’ Used to train the regression model

Testing Samples: 17,182 β†’ Used to evaluate model predictions

Step 5: Build Linear Regression Model

Linear Regression Coefficients

Item_price: -0.000021 β†’ Effect on CSAT Score per unit change in Item_price

connected_handling_time: 0.000140 β†’ Effect on CSAT Score per unit change in connec

Step 6: Predict on Testing Set

Model Evaluation Metrics (Manual Calculation)

Formulas used:

Step 7: Evaluate Model

Model Evaluation Metrics (Values)

MAE: 1.04 β†’ Average absolute prediction error

MSE: 1.87 β†’ Average squared prediction error

RMSE: 1.37 β†’ Standard deviation of prediction errors

Explained Variance: 0.01 β†’ Portion of variance explained by model

Step 8: Visualize Results

Correlation Analysis Visualization
Correlation Analysis Visualization

Step 9: Display Predicted vs Actual Table

Predicted vs Actual CSAT

Index Actual CSAT Predicted CSAT Residual
6787154.2604570.739543
4018754.2780210.721979
6007554.2604570.739543
6956054.2604570.739543
260554.2750760.724924
7332754.2604570.739543
438214.260457-3.260457
1040554.2604570.739543
2449454.2604570.739543
547314.260457-3.260457

πŸ“Œ Project Milestone – Week 5

"Build your first baseline regression model."

Explanation in context of your project (E-commerce Recommendation System):

This milestone demonstrates the first working predictive model for your semester-long project and sets the foundation for further analysis and model optimization.