This repository is dedicated to my book 📕 𝐀𝐩𝐩𝐥𝐢𝐞𝐝 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐟𝐨𝐫 𝐂𝐫𝐞𝐝𝐢𝐭 𝐑𝐢𝐬𝐤: 𝗔 𝗣𝗿𝗮𝗰𝘁𝗶𝗰𝗮𝗹 𝗚𝘂𝗶𝗱𝗲 𝗶𝗻 𝗥 𝗮𝗻𝗱 𝗣𝘆𝘁𝗵𝗼𝗻 and other topics related to credit risk modeling. It will be regularly updated with GitHub pages, slides, and PDF documents covering various modeling subjects.
The motivation behind writing this book and creating the repository stems from the observed gap between academic literature, industry practices, and the evolving landscape of data science. While there's been a notable increase in literature on credit risk modeling, discrepancies persist. The evolution of data science has led to significant automation in processes. Still, it has also brought the risk of overreliance on pre-programmed procedures, sometimes leading to the misuse of statistical methods. Moreover, many practitioners entering credit risk modeling often overlook fundamental principles, hindering their professional development. Hence, the repository aims to serve as a centralized hub for continuous education and consolidating essential concepts.
The repository and book will encompass practical examples utilizing both R
and Python
.
Below are links providing an overview of the repository's main topics, which include summaries from the book and insights gleaned from practical experience.
The Vasicek Distribution (Probability of Default Models):
- The Functional Form and Parameters Estimation Methods (pdf, presentation)
- Shiny Application for Estimating the Parameters of the Vasicek Distribution
- Asset Correlation Estimation - Maximum Likelihood: Analytical vs Numerical Optimization Approach (pdf, presentation)
- The Logistic Vasicek Distribution (pdf, presentation)
- Asset Correlation Estimation - Maximum Likelihood: Normal vs Logistic Vasicek Distribution (pdf, presentation)
- Asset Correlation Estimators and Bias Quantification (pdf, presentation)
- The Vasicek PD Model and Transition Matrices - Optimization of the Systemic Factor Z (pdf, presentation)
Loss Given Default:
- Loss Given Default as a Function of the Default Rate (pdf, presentation)
- The Vasicek LGD Model - The Functional Form and Parameters Estimation Method (pdf, presentation)
- The Vasicek LGD Model - Simulating the Distribution of the Parameters (pdf, presentation)
- The Vasicek LGD Model - The Bias Quantification of the Sensitivity Parameter (pdf, presentation)
Low Default Portfolios:
- Likelihood Approaches to Low Default Portfolios - Andrija Djurovic's Adjustment of Alan Forrest's Method to the Multi-Year Period Design: PD Domain Search Approach (pdf, presentation)
- Likelihood Approaches to Low Default Portfolios - Andrija Djurovic's Adjustment of Alan Forrest's Method to the Multi-Year Period Design: PD Optimization Approach (pdf, presentation)
- Estimating Probabilities of Default for Low Default Portfolios - Pluto-Tasche Approach (pdf, presentation)
- Conservative Estimation of Default Probabilities - Benjamin-Cathcart-Ryan Approach (pdf presentation)
Measuring Concentration Risk:
- Measuring Concentration Risk - A Partial Portfolio Approach (pdf presentation,
R
&Python
code) - Simulation dataset
Model Risk Management:
- Model Shift and Model Risk Management (pdf, presentation)
- The Instability of WoE Encoding in PD Modeling (pdf, presentation)
- Discriminatory Power Shortfalls in IRB Credit Risk Models - Risk-Weighted Assets Impact Analysis (pdf, presentation)
- The Economic Value of Credit Rating Systems - Quantifying the Benefits of Improving an Internal Credit Rating System (pdf, presentation)
- Heterogeneity Shortfalls in IRB Credit Risk Models - Risk-Weighted Assets Impact Analysis (pdf, presentation)
- Heterogeneity Shortfalls in IRB Credit Risk Models - Portfolio Returns Impact Analysis (pdf, presentation)
Model Development and Validation:
- Common Inconsistencies in Probability of Default Modeling (pdf, presentation)
- IRB PD Periodic Model Validation - Quantitative Testing Procedures (pdf, presentation)
- IRB Model Validation - Technical Aspects of Automated Reports (pdf,
R
&Python
code) - Validation of Credit Risk Models - Does the P-value Provides Sufficient Insight? (pdf, presentation)
- Validation of Credit Risk Models - On Favorable P-values in Statistical Tests (pdf, presentation)
- Margin of Conservatism Type C in PD Modeling - Central Tendency Uncertainty in the Presence of Autocorrelation (pdf, presentation)
- Time Series Analysis in Credit Risk Modeling - OLS vs Yule-Walker Estimator for Autoregressive Coefficients (pdf, presentation)
- Hypothesis Testing in Credit Risk - A Visual Approach for Deeper Understanding (pdf, presentation)
- On Testing the Concentration in the Rating Grades - The Initial and Periodic PD Model Validation (pdf, presentation)
- The Binomial Tests for PD Model Validation - The Independent and Correlated Binomial Distributions (pdf, presentation)
- The Model-Based Heterogeneity Testing (pdf, presentation)
- Risk-Weighted Assets as a Function of Probability of Default (pdf, presentation)
- Principal Component Analysis for IFRS9 Forward-Looking Modeling (pdf presentation,
R
&Python
code) - Bootstrap Hypothesis Tests (html, GitHub page)
- Bootstrap Hypothesis Tests (pdf, presentation)
- Statistical Binning of Numeric Risk Factors - PD Modeling (pdf, presentation)
- Statistical Binning and Model Validation - How the Choice of Binning Algorithm Influences Model Validation (pdf presentation,
R
&Python
code) - Hosmer-Lemeshow VS Z-score Test on Portfolio Level (html, GitHub slides)
- Hosmer-Lemeshow VS Z-score Test on Portfolio Level (pdf, presentation)
- Power Play: Probability of Default Predictive Ability Testing (pdf, presentation)
- Nested Dummy Encoding (pdf, presentation)
- Marginal Information Value (pdf, presentation)
OLS Regression:
- Consequences of Violating the Normality Assumption for OLS Regression (pdf,
R
&Python
code) - Consequences of Heteroscedasticity for OLS Regression (pdf,
R
&Python
code) - Consequences of Multicollinearity for OLS Regression (pdf,
R
&Python
code) - Consequences of Autocorrelation for OLS Regression (pdf,
R
&Python
code)
Effective Interest Rate:
Loan Repayment Plan: