What is the input of the neural network during reinforcement learning and what do we have at the output?

I

intTosha2017-11-03 20:30:50

Neural networks

intTosha, 2017-11-03 20:30:50

Good day.
When reading a book about reinforcement learning, the following questions arose:
1. What is the input? Environment conditions? Action value? If the former, how is the latter used? That is .. How the value of actions affects the choice, if we submit states to the input. And vice versa. How does the agent learn about the states of the environment if the input is the Value of actions. I'm sorry for the stupid question, but I'm just confused.
2. Do I understand correctly that there should be possible actions at the output?
3. What does a neural network look like? Is it a perceptron or something else?
Can you give a simple example from a game like Dendivian Ping-Pong? I don’t understand the example of a multi-armed bandit, at least kill ...

Reply

Answer the question

In order to leave comments, you need to log in

2 answer(s)

I

ivodopyanov, 2017-11-06
@ivodopyanov

As input - the state of the environment, somehow encoded, plus a possible action. The output is the value of this action. In their first article, DeepMind calculated values for all actions at once, so there was not one output, but N. But this is not necessary.
Plus, there are more complex actor-critic reinforcement learning models.
The architecture of the neural network depends on the way the environment is encoded. It can be a persptron, or it can be convolutional neural networks if we give a visual signal. And it can contain recurrent networks, if the input is text or some other sequence.

A

Arseny Kravchenko, 2017-11-06
@Arseny_Info

One of the easiest examples https://github.com/pytorch/examples/blob/master/re...