| |||||||||||||||||
![]() |
![]() |
![]() |
|||||||||||||||
|
Adaptive Learning Numerical Example
Suppose we want to make a game to improve the ability of user to memorize only four very difficult Japanese Kanji characters. For simplicity, we call this Kanji character “a”, “b”, “c”, and “d” and in our example, user makes only 10 trials. In this example, we will use to make the program can learn adaptively to user response. In the first table, user input is recorded as “Right” if the answer is correct and “Wrong” if the user make mistake. In our example, the first generated character is “a” and the user give wrong answer. The second generated character is “b” and the user answer correctly. The fourth generated character is “a” again and this time our user pass correct answer, and so on. The user can only pass one input for every trial. Thus in each column of the table, only single entry with binary values is allowed, either “Right” or “Wrong”. Table 1 User response
Let us define Table 2 Cumulative number of entry for each character,
Histogram In the table 3, we show the histogram of weighted answer
Table 3 Histogram of reward and punishment,
If This probability of success is calculated by dividing each entry of Table 3 by the entry of table 2. The sum of each column is the denominator of equation (A) while each entry of the new table is the nominator of equation (A). The result of computation is shown in Table 4. The summation of probability in each trial is always 100%. Table 4 Probability that character
We need to divide the histogram Our target in this game is to get the cumulative probability of failure. We need to design the program response such that the characters that have small probability of correct answers will be asked more frequently. In other words, we want to get the failure distribution instead of success distribution. The probability distribution we have so far represents the success answers. The probability distribution of failure is given by normalizing one minus the probability of success. Table 5 gives probability distribution of failure of user response based on equation (B). Table 5 Probability distribution of failure
To examine the adaptive learning, we shall compare user
response in table 1 and the Probability distribution of failure (represents
the computer response) in Table 5. The first trial is unstable to be
compared because it has only single data. In the second trial, character
“b” get correct answer and the probability of failure reduce so much
it has smaller chance to be asked again. In trial 3, the user give wrong
answer to character “c”, and the probability of failure between character
“a” and “c” is now the same (because both has the same record of one
“wrong” answer) while character “d” has higher probability. This computer
response is very good because it will give better chance to the character
that has not been generated to be shown. Trial 4 gives correct answer
to character “a” make probability of “a” is lower (i.e. less chance
to be shown). Notice that in trial 4 the character “a” has 1 “wrong”
and 1 “right” answer, while character “b” only has one “right” answer.
The failure probability of “b” is smaller than “a”. This is the exact
response what we want. If we do not normalize the probability of success
by the cumulatively entry number .Table 6 Cumulative Probability of failure
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Table 6 shows the cumulative probability
of failure from Table 5. The inverse of the cumulative probability distribution
of failure can be used for Monte Carlo algorithm as in the .
For example, in trial 10, suppose the random number generated by computer
is denoted by Show character “a” if Show character “b” if Show character “c” if Show character “d” if Next: Adaptive learning with memory
These tutorial is copyrighted. Preferable reference for this tutorial is Teknomo, Kardi. Learning Algorithm Tutorials. http:\\people.revoledu.com\kardi\ tutorial\Learning\
Send your comments, questions and suggestion to author of this tutorial
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
© 2006 Kardi Teknomo. All Rights Reserved. Designed by CNV Media |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||