Showing posts from March, 2025

Markov Decision Process (MDP) Concept

The MDP is an extension of the Markov Reward Process (MRP) with added Action (A) and Policy (π). While the goal of MRP is to calculate the overall value of an episode or environment, MDP aims to determine a policy that maximizes the value of the env…

멀티코어

Markov Reward Process (MRP)

The Markov Reward Process (MRP) is an extension of the Markov Chain that includes a reward and a discount factor, denoted by gamma (γ), which represents the depreciation of future rewards over time. While a Markov Chain consists of states (S) and a …

멀티코어
Load More
That is All