What is the activation function for?

R

RandomProgrammer2020-07-13 15:15:27

Neural networks

RandomProgrammer, 2020-07-13 15:15:27

Started studying machine learning. I tried to figure out how the perceptron works, and I ran into the fact that I don’t know what the activation function is for and what it should do.
With the rest of its parts (input data, weights, neuron), how they work and what they are for, I have already figured out. Please explain what the activation function is for, if possible with a specific example.

Reply

Answer the question

In order to leave comments, you need to log in

2 answer(s)

F

freeExec, 2020-07-13
@RandomProgrammer

For non-linearity, otherwise everything can be reduced to some simpler formula than 100500 coefficients.

M

mayton2019, 2020-07-17
@mayton2019

The activation function cannot be explained in isolation from the neuron. The main requirements for it are continuity and monotonicity over the entire domain of definition. Either the logistic sigmoid or the hyperbolic tangent is taken as such a function. They are saturated at the borders. That is, the infinite growth of the argument does not lead to the growth of the function.
The function is related to Kolmogorov Arnold's theorem.