![]() |
This is about finding accurate predictors of individual risk in the credit portfolios. The reader is invited to repeat these steps using the example on a fictitious mortgage loan data set to get a feeling for the procedure and also for the applicability of the microCortex computer environment. This data set refers to mortgage loans to individuals.
You will understand how to:
|
Just follow the steps and... enjoy it!
The data set used here refers to mortgage loans to individuals. The likelihood of repayment is measured as a simple: "Yes, the client will pay back the loan" or "No, the client will not pay back the loan".
ANN's work through a process of learning with examples from the past in order to predict the future - the same is to say, learning a generalizable association amongst data or simply training the ANN. This means you have to set a record of your clients' past behavior leading the ANN to learn which client profiles tend to fail the repayment and which don't. This record must have two main categories of data:
The input data (Fig.1) - the set of values for each criteria you think can influence the loan repayment.
Example: customer's age, income, number of children. Note that you will also be able to access the sensitivity analysis (how strongly each variable determines the repayment).
The output data (Fig.2) - the set of records with the "answer" of each time you loaned money: "Yes, the client payed back the loan" or "No, the client didn't pay back the loan".
If you gather that data in a spreadsheet you get something like shown in the figures:
Your Spreadsheet
Fig. 1 - Input Data in a spreadsheet
Fig. 2-Output Data in a spreadsheet
Note: The sample data used to generate it is in the links at the bottom of this page.
Please remember this sample data is totally fictitious. It's not our purpose to give you here an accurate view of the risk management in banking. Real world application of ANN's to Credit Risk Assessment can possibly understand the use of other criteria and relationships amongst data different from the ones shown here.
An example: "beeing divorced" is said here as having a strong influence on the mortgage likelihood of repayment - although this can be true don't take it as a cientific or even empirical basis to say this is what happens in the real world.
One last call about two important issues in your data:
Data can now be pasted into the right boxes.
First, the Inputs:
Fig. 3 - Pasting input data into browser
... then the Outputs:
Fig. 4 - Pasting output data into browser
Note: Data is presented in separate spreadsheets for a better understanding of the process, but obviously it can be placed in one spreadsheet only, as it usually is.
While entering data pay attention to the following Submitting Rules (check Figs. 3 and 4):
After this final operation you get a new screen (white box below) confirming that the data was submitted and an ID number is assigned so you can retrieve it and use it later:
|
Job submitted.
Please wait until you receive an email indicating that your job is finished, and then go to the ANN List page. TestUser_977142474_7069_bee3133fe707c783f61bddcbe6c42612
An e-mail was sent to TestUser@microcortex.com |
You also get an email with a similar message.
From now on the ANN can be retrieved, at any time, by going to the ANN List link after you log on.
By clicking on the ANN ID, you will move to the ANN Analysis page, where you can have a glance at the quality of the trained ANN:
Fig. 6 - Statistics of the trained ANN: Sensitivity |
|
For those more familiar with statistical analysis the ANN Analysis page gives a good impression on the statistical quality measure of the trained ANN. If that is your case, check the details on statistical results.
|
After pressing the "Predictions" button you are asked for " input questions".
As an example, the following 4 input sets can be submitted to request ANN predictions for the output - "Will they pay or not?":
|
The input questions are lines of data values, one column for each variable, in the same format used in the beginning for the input data. Once again you can simply copy and paste values from a tab-delimited text file or any spreadsheet. Apart from having as many input variables (columns) as the input data set used to train the ANN there is no other restriction.
The corresponding 4 output predictions, are generated bellow the box for submitting:
|
||
|
||
|
||
If you feel comfortable about ANN terms, you may check the section for advanced users for more details on analysing the trained ANN in the "Walk Through Guide".