ML/AI - Reinforcement Learning & MDP
Have you ever thought about understanding the basic concepts of Machine Learning & AI by doing some simple hands-on activities. In this course, we will focus on Reinforcement Learning and MDP.
Overview
Prerequisites
Reinforcement Learning
Three spaces
Observation space
Action space
Markov Decision Process (MDP)
MDP explained
MDP state
Reward for action
MDP state example
State space
State after an action
Maximise reward
Summary
Recap
MDP Recap continued
Possible states
An example
Reward based actions
Determine if the action is good
Reward Notation
Probability of Decision
Probability Notation
Formal definition
MDP defined
Why MDP
Policy
Policy example
Need for Policy
How to build Policy
Recap
Goodness of a state
Find value of state
Actions we can take
States an action lead to
How good a state is
Repeat for every state
Math Notation
Determine good action
More Specific Math Notation
Include Gamma
Congratulations on completing this course!
The instructor was well spoken as always, made his points clear, and had materials prepped so there were no delays in the material being delivered. I wish so...
Read MoreThe instructor was well spoken as always, made his points clear, and had materials prepped so there were no delays in the material being delivered. I wish some illustrations and the final equation at the end were a bit clearer.
Read LessIt would be more beneficial if there was a real life example in the introduction that could directly relates to this topic, and how does AI learn based on th...
Read MoreIt would be more beneficial if there was a real life example in the introduction that could directly relates to this topic, and how does AI learn based on the same principles. It will be fun if there is higher emphasis on why there would be a policy framework formed
Read Less