Reinforcement learning

3 min readDec 22, 2020

What is Reinforcement Learning?

RL is a machine learning technique focused on feedback in which an agent learns to behave in an environment by performing actions and seeing the consequences of actions.

The agent gets positive feedback for each good action, and the agent gets negative feedback or punishment for each poor action. In RL, unlike supervised learning, the agent learns automatically using feedback without any labelled data.

Since there is no labelled data, the agent is only bound to learn from his experience.

Steps for Implementing Reinforcement Learning

The principle of “cause and effect” can be converted into the following steps for the RL agent:

1)The agent shall observe the input state
2)Conduct shall be decided by the decision-making mechanism (policy)
3)The action shall be taken
4)The agent shall receive a scalar incentive or reinforcement from the environment
5)Information on the incentive offered for that state/action pair is reported

Example of Reinforcement Learning.
· self-driving cars
· healthcare
· news recommendation
· gaming

Characteristics of Reinforcement Learning
· Here are major aspects of reinforcement learning
· There is no boss, just an individual number or incentive signal,
· Sequential Entscheidungsfinding
· In reinforcement questions, time plays a key role in
· Feedback is always delayed, not immediate.
· The acts of the agent decide the resulting information it receives.
Types of Reinforcement learning
Two types of reinforcement learning are
1) Positive 2) Negative
Positive is where reward is given for the presentation of the desired behaviour, and negative is taken away from an unpleasant aspect of a person’s life if the desired behaviour is accomplished.
Applications of Reinforcement Learning
· For industrial automation, Reinforcement learning can be used in robotics.
· In machine learning and data processing, Reinforcement learningcan be used for
· Reinforcement learning can be used to build training systems which according to student requirements, provide personalised instruction and materials.
Why use Learning for Reinforcement?
Here are the key factors for using Reinforcement Learning:
· It allows you to figure out which situation demands a response.
· It helps you to find out which action offers the highest reward over a prolonged period of time.
· Reinforcement Learning also provides a reward feature to the learning agent.
· It also helps the right strategy for receiving big incentives to be worked out.When can reinforcement learning not be used?
The entire condition is that you can’t apply the reinforcement learning paradigm.Here are certain situations when the reinforcement learning paradigm can not be used.
· If you have enough knowledge with a supervised learning approach to solve the problem
· You need to note the computing-heavy and time-consuming Reinforcement Learning is,In particular, when the space for action is high.
Summary

Reinforcement learning is a form of machine learning.

· Helps you to discover over the longer term which behaviour produces the greatest reward.

· Agent, State, Compensation, Environment, Value System Environmental model, Model-based approaches, are some important terminology used in the RL method of learning

· Your pet is an agent that is introduced to the world as an indicator of reinforcement learning.The biggest characteristic of this method is that there is no supervisor, only a real number or reward signal

· Two kinds of reward instruction are 1) positive 2) negative
1) Markov Decision Process 2) Q Learning Process 2) Two commonly used learning models are

· The Reinforcement Learning Method works on environmental interaction, while the supervised learning method works on sample or example data provided.

· Technology or reinforcement preparation approaches are: factory automation robots and business management planning

· If you have enough data to fix the problem, you need not use this process.
The key problem of this approach is that parameters will impact the speed of learning.

Reinforcement learning

Written by Popat Avani