Kardi Teknomo
Kardi Teknomo Kardi Teknomo Kardi Teknomo


Visit Tutorials below:
Adaptive Learning from Histogram
Adjacency matrix
Analytic Hierarchy Process (AHP)
ArcGIS tutorial
Arithmetic Mean
Bayes Theorem
Bootstrap Sampling
Bray Curtis Distance
Break Even Point
Chebyshev Distance
City Block Distance
Conditional Probability
Continued Fraction
Data Analysis from Questionnaire
Data Revival from Statistics
Decimal to Rational
Decision tree
Difference equations
Digital Root
Discriminant analysis
Eigen Value using Excel
Euclidean Distance
Euler Integration
Euler Number
Excel Iteration
Excel Macro
Excel Tutorial
Feasibility Study
Financial Analysis
Generalized Inverse
Generalized Mean
Geometric Mean
Ginger Bread Man and Chaos
Graph Theory
Growth Model
Hamming Distance
Harmonic Mean
Hierarchical Clustering
Independent Events
Incident matrix
Jaccard Coefficient
Kernel basis function
Kernel Regression
k-Means clustering
K Nearest Neighbor
LAN Connections Switch
Learning from data
Lehmer Mean
Linear Algebra
Logarithm Rules
Mahalanobis Distance
Market Basket Analysis
Mean Absolute Deviation
Mean and Average
Mean, median, mode
Minkowski Distance
Minkowski Mean
Monte Carlo Simulation
Multi Agent System
Multicriteria decision making
Mutivariate Distance
Newton Raphson
Non-Linear Transformation
Normalization Index
Normalized Rank
Ordinary Differential Equation
Page Rank
Power rules
Prime Factor
Prime Number
Q Learning
Quadratic Function
Rank Reversal
Recursive Statistics
Regression Model
Reinforcement Learning
Root of Polynomial
Scenario Analysis
Sierpinski gasket
Sieve of Erastosthenes
Similarity and Distance
Solving System Equation
Standard deviation
Summation Tricks
Support Vector Machines
System dynamic
Time Average
Tower of Hanoi
Vedic Square
Visual Basic (VB) tutorial
What If Analysis

Q-Learning By Examples

by Kardi Teknomo

Q-Learning e-book

Share this: Google+

In this tutorial, you will discover step by step how an agent learns through training without teacher in unknown environment. Reinforcement learning is training paradigm for agents in which we have example of problems but we do not have the immediate exact answer. For playing a game, for instance, an agent will make series of decisions to move and only later will find out whether those decisions are right or wrong. Reinforcement learning paradigm is similar to real life of how we learn.

In this tutorial, you will find out part of reinforcement learning algorithm called Q-learning. Reinforcement learning algorithm has been widely used for many applications such as robotics, multi agent system, game, motion planning, navigation, and etc.

Instead of learning the theory of reinforcement that you can read it from many books and other web sites (see Resources for more references), this tutorial will introduce the concept through simple but comprehensive numerical examples. If you purchase the e-book of this tutorial, you will also receive the companion worksheet and the matlab files.

Tired of ads? Read it off line on any device. Click here to purchase the complete E-book of this tutorial

Let us start the tutorial (clicks the topic below).

Modeling the Environment
Agent, State and Action Introduction
Q Learning

Q Learning Algorithm
Numerical Example
Another Q learning Example: Tower of Hanoi
Q-Learning Solution for Tower of Hanoi

Q Learning using Matlab
Q Learning using MS Excel
Practice make perfect

Click here to purchase the complete E-book of this tutorial

Share and save this tutorial
Add to: Del.icio.us  Add to: Digg  Add to: StumbleUpon   Add to: Reddit   Add to: Slashdot   Add to: Technorati   Add to: Netscape   Add to: Newsvine   Add to: Mr. Wong Add to: Webnews Add to: Folkd Add to: Yigg Add to: Linkarena Add to: Simpy Add to: Furl Add to: Yahoo Add to: Google Add to: Blinklist Add to: Blogmarks Add to: Diigo Add to: Blinkbits Add to: Ma.Gnolia Information

Send your feedback and comments for this tutorial

This tutorial is copyrighted.

Preferable reference for this tutorial is

Teknomo, Kardi. 2005. Q-Learning by Examples. http://people.revoledu.com/kardi/tutorial/ReinforcementLearning/index.html


© 2006 Kardi Teknomo. All Rights Reserved.
Designed by CNV Media