Int8 neural networks supported?

MPdaedalus · July 25, 2020, 8:33pm

Hi

Just getting to grips with dl4j, I was wondering if dl4j supports neural networks using int8 for layers,weights etc, instead of fp16 or fp32? I have a pascal gpu which is useless for fp16 but can do int8 with a 4x speedup over fp32 but apart from an int8 compressor in the source I could not see if dl4j has options to use integer ops for the layers such as 1dconvolution, maxpooling etc.

Thanks in advance.

agibsonccc · July 26, 2020, 9:02am

You typically don’t want to build models by default with int8. Int8 is typically used for inference. Dl4j does have the int8 data type but I wouldn’t recommend trying to use it for training.

MPdaedalus · July 26, 2020, 9:24am

yeah , I thought that might be the case, a shame really, looks like I will have to save up for a Turing GPU. Some of the benchmarks for int8 performance are insane, 120 TeraOPS per card, even if the memory bandwidth is the limiting factor in the real world.

I can’t in principle see why since byte values could not be used for everything, there is enough precision with 256 discrete values to capture most patterns, there are some research papers showing that even fp16 is overkill for most problems as the extra precision makes no major difference to the overall performance of the network. I’m guessing that the backprop,activation functions, convolution kernels etc. might need to be reworked to actually train using int8.

Topic		Replies	Views
Graph Neural Networks DL4J	6	512	November 11, 2022
Neural Compute Stick 2 DL4J	3	241	April 24, 2023
Minimal version for inference	3	395	June 11, 2021
Transfomer neural network (TNN) DL4J	4	261	November 5, 2023
HALF data type with GPU backend DL4J	1	379	March 3, 2020

Int8 neural networks supported?

Related topics