Loan Default Risk Prediction
Python, Scikit-learn
→
Summary
Developed an end-to-end classification model to predict borrower loan default risk. Performed data cleaning, missing value treatment, and feature engineering to improve model performance. Applied one-hot encoding and feature scaling while preventing data leakage through proper train-test splitting. Trained and compared Logistic Regression, Random Forest, and Gradient Boosting models. Evaluated models using Accuracy, Precision, Recall, F1-score, ROC-AUC, and Confusion Matrix. Saved trained model for inference on unseen data.