Q Learning

Q-learning is a model-free reinforcement learning algorithm used to learn the value of an action in a particular state. The ‘Q’ in Q-learning stands for ‘quality’, which represents how useful a given action is in gaining some future reward. It does not require a model of the environment, and it can handle problems with stochastic transitions and rewards without requiring adaptations.

Q Learning

Areas of application

  • 1. Robotics and Autonomous Systems
  • 2. Game Playing and AI
  • 3. Recommendation Systems
  • 4. Financial Trading and Investment
  • 5. Healthcare and Medical Decision Making

Example

For example, consider an autonomous vehicle navigating through a maze. The Q-learning algorithm can be used to learn the optimal action (e.g. left or right turn) in each state (e.g. at a fork in the road) to maximize the reward (e.g. reaching the destination).