Create an India credit risk(default) model, using the data provided in the spreadsheet raw-data.xlsx, and validate it on validation_data.xlsx. Please use the logistic regression framework to develop the credit default model.

Hints :

Data description – Please direct them to the video – Default Risk Prediction. After removing variables for multicollinearity, we should try to take at least one variable for creating the model from each of the 4 factors namely –

1) Profitability

2) Leverage

3) Liquidity

4) Company’s size

For Default Risk Estimation, all the variables are bifurcated in different buckets in the categories tab in raw_data file.

Creation of new variables – This is an important step in the project as the company which is the biggest in size, will also have bigger asset size, cash flows, etc. (Hint: We need to think in terms of ratios – Equity to asset ratio, debt to equity ratio, etc)

Dependent variable – We need to create a default variable which should take the value of 1 when net worth is negative & 0 when net worth is positive.

Validation Dataset –  We need to build the model on the raw dataset and check the model performance measures on the validation dataset.

Please find attached the files to be referred.

Please note the following:

  • You have to submit 2 files      :
    1. Business Report not exceeding 3000 words. In this, you need to submit all the answers to all the questions in a sequential manner. Your answer should include detailed explanations & inferences to all the questions. Your report should not be filled with codes.
    2. R code file: This is a must and will be used for reference.

 

Criteria Points

 

1. Outlier Treatment – Outlier   Treatment

2. Missing Value   Treatment

3. New Variables Creation   (One ration for profitability, leverage, liquidity and company’s size each )

4. Check for   multicollinearity

5. Univariate &   bivariate analysis

6. Build Logistic   Regression Model on most important variables

7. Analyze coefficient   & their signs

8. Predict accuracy of   model on dev and validation datasets

9. Sort the data in   descending order based on probability of default and then divide into 10 deciles  based on probability & check how well the model has performed

Found something interesting ?

• On-time delivery guarantee
• PhD-level professional writers
• Free Plagiarism Report

• 100% money-back guarantee
• Absolute Privacy & Confidentiality
• High Quality custom-written papers

Related Model Questions

Feel free to peruse our college and university model questions. If any our our assignment tasks interests you, click to place your order. Every paper is written by our professional essay writers from scratch to avoid plagiarism. We guarantee highest quality of work besides delivering your paper on time.

Grab your Discount!

25% Coupon Code: SAVE25
get 25% !!