Reinforcement Learning 101

February 26, 2024

Reinforcement Learning 101

Any new subject can be learned in the same way as we learn any new language (Learning new vocabulary → Understanding Language syntax → Conversing with localites)

Objective of reinforcement learning is to implement automation, optimization and discovery with minimal human intervention.

In this blog I will give an introduction to Reinforcement learning vocab

Agent

Single
Multi

Independent Q-learning
CTDE
MADDPG

Environment

Stationary
Non-Stationary

State
Action
Episode, Visit
Reward

Long Term
Short Term

Markov Decision Process
Bellman Equation
Transition Probability
Discounting Factor
Policy
Value Function

State

Monte Carlo Method to learn

Action

Model

Free

Q-Learning
SARSA
DQN

Based

Dyna-Q
Monte Carlo Tree Search
PILCO

Policy Gradient Method

REINFORCE

Iteration

Policy
Value

Evaluation

Policy

Improvement

Policy

Exploration vs Eploitation

Exporation Strategies

Initialization
Termination
Actor Critic Methods

I'll keep updating this blog whenever I have time

Search This Blog

Hello World!

Reinforcement Learning 101

Comments

Post a Comment

Popular Posts

Understanding higher dimensions