Going Beyond ReLU

S6Regen · September 11, 2021, 2:36pm

ReLU has information blocking problems. You can say approximately 50% of the time it is off (f(x)=0.)
Information about the input gets lost before it can be fully used, information that could be used to construct the output is intercepted before it can get to the endpoint.

ResNet is one solution to the information flow problem.
Another solution is to go slightly beyond the activation function concept:
https://discourse.processing.org/t/relu-is-half-a-cookie/32134
The idea is to double the number of weights in the neural network.
Giving each neuron 2 different forward connected weight vectors, rather than the traditional 1 forward connected weight vector.
The activation function then is only ever f(x)=x (do nothing.)
Instead x is used to select between forward connected weight vectors.
if(x>=0){
use forward connected weight vector A
} else {
use forward connected weight vector B
}
The behavior is the same as ReLU when x>=0.

Topic		Replies	Views
Challenging the assumptions	4	461	February 28, 2020
Recreating a network with "Residual Blocks" DL4J	2	428	May 24, 2020
Activation function in batch norm layer (builders/configuration) DL4J	0	309	November 11, 2021
Questions on DL4J application DL4J	3	345	March 24, 2021
Kolmogorov-Arnold DL4J	4	105	June 12, 2024

Going Beyond ReLU

Related topics