Credit Default Prediction using XGBoost and Neural Network

The importance of Credit Default prediction

Solely in 2022, American Express amassed approximately $53 billion in revenue. TransUnion projected a surge in credit card delinquencies, foreseeing an increase from 2.1% at the end of 2022 to 2.6% by the conclusion of 2023. Anticipating which customers are most likely to default on their credit card accounts enables issuers to mitigate risk and exposure proactively.

Given the abundance of readily accessible customer data and many indicators, employing Machine Learning algorithms to forecast defaults presents a lucrative opportunity.

Skills and Technologies:

Python
Data Analysis and Preprocessing
Machine Learning
Feature Selection
Hyperparameter Tuning
Model Evaluation
Data Visualization
Libraries: pandas, numpy, scikit-learn, xgboost, keras, tensorflow

Business Impact:

Credit Default Prediction: The model predicts the probability of credit card default, which can help financial institutions assess customer risk.
Risk-Based Strategies: The code implements both conservative and aggressive strategies based on different prediction thresholds, allowing for flexible risk management.

Data

The historical data from credit card transactions encompasses 458,913 customers over a span of 13 months, covering 190 variables categorized into Payment, Spend, and Balance. Each month contains between 30,000 to 40,000 observations, and a percentage of customers defaulting in each month [23%, 28%]

Target Variable = 1 if the customer default on CC payment = 0 if the customer didn’t default

Dataset: https://www.kaggle.com/competitions/amex-default-prediction/overview

Features

All Features are divided into 5 categories: Delinquency, Payment, Balance, Risk & Spend

Feature Selection

Built 2 XGBoost models to rank features according to their feature importance score.

XGBoost - Grid Search

The following combinations in the grid search:

Number of trees: 50, 100, and 300 :- 50 was to decrease the complexity and the variance, later we tried 300 for a lower bias
Learning Rate: 0.01, 0.1 :- 0.1 = Conventional and 0.01 to validate if slower learning rate arrives at global minima smoothly without overshooting
% of observations used in each tree: 50%, 80% :- 50% for faster training & 80% to avoid overfitting
% of features used in each tree: 50%, 100% :- 50% again to avoid overfitting & faster training, and 100% for better results and low bias
Weight of default observations: 1, 5, 10 :– Since most of our non-default we need weights > 1

Plot 1: Technically, Bias-Variance Tradeoff at X=0.94 & Y = 0.0075 (diff in Y is small, therefore lowest bias preferred)

Plot 2: Linear relationship between AUC train and test2, therefore highest AUC train preferred

Final XGBoost Model Parameters

Rank Ordering

Here, in rank ordering, when we adjust the threshold upwards/increased, we observe that the default rate rises across higher threshold ranges.

SHAP Analysis

❒ BeeSwarm - Explains the cumulative impact of features on model

P_2 higher values drive the score down meaning higher the payment variable lower will be probability of default

Most features increase their impact on model output with higher feature value

❒ Waterfall - Explains prediction for specific observation

Expected Model Output = -1.308, Output for 1100th customer = -4.311

P_2 singlehandedly drives prediction down by 1.26 whereas 37 other features collectively drive it down by 1.17

Neural Network

NN Grid Search

Combination of Hyper-Parameters in the grid search:

Number of hidden layers: 2, 4 :– 4 to increase the complexity & to get low bias and 2 for faster runtime
Nodes in each hidden layer: 4, 6 :– 2 for simple neural network and 6 for complex neural network
Activation function: ReLu, Tanh – ReLu isn’t saturated/zero-centered, tanh causes vanishing gradients
Dropout regularization: 50%, 100% (no dropout) :– 50% to decrease complexity and avoid overfitting
Batch size: 100, 10000 :– 100 not low enough to overfit every batch and 10000 for faster processing

Final Model

Winners under various categories:

Category	Winner
Bias	XGBoost
Variance	Neural Network (diff in Std Dev is negligible)
Explanability	XGBoost (SHAP Analysis)

Strategy

The conservative strategy has a lower threshold compared with aggressive one; hence accepts less applicants.

0.5 :– Aggressive strategy because we want to increase our Revenue while maintaining the default rate below 10%

0.3 :– Conservative Strategy because the default decreases almost by half but revenue isn’t drastically affected

Thank You. Let’s keep learning and growing together!

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Data Split		Data Split
Feat Importances		Feat Importances
Graphs		Graphs
Grid Search Results		Grid Search Results
Practice Notebooks		Practice Notebooks
saved models		saved models
.DS_Store		.DS_Store
.gitignore		.gitignore
2.6		2.6
2.6.0		2.6.0
Credit Default Risk.pptx		Credit Default Risk.pptx
Credit Risk Project.docx		Credit Risk Project.docx
Feature Importance 1.csv		Feature Importance 1.csv
NN Grid Code.docx		NN Grid Code.docx
README.md		README.md
Results.xlsx		Results.xlsx
XGB model.ipynb		XGB model.ipynb
ex_strategy.csv		ex_strategy.csv
ex_strategy.xlsx		ex_strategy.xlsx
final model.ipynb		final model.ipynb
final_model.py		final_model.py
final_trial.csv		final_trial.csv
strategy.csv		strategy.csv
train_labels.csv		train_labels.csv
train_labels.csv.zip		train_labels.csv.zip
xgb_model.py		xgb_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Credit Default Prediction using XGBoost and Neural Network

Skills and Technologies:

Business Impact:

About

Uh oh!

Releases

Packages

Languages

KritiCParikh/Applied-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Credit Default Prediction using XGBoost and Neural Network

Skills and Technologies:

Business Impact:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages