How to perform mixed-precision training?

cqiaoYc · February 24, 2025, 2:57am

May I ask where I can find an example of mixed precision training? I converted the feature array and label array to half precision, and set the data type to half precision in the network configuration, but it fails to compute the gradients correctly. When I set the data type to FLOAT, it prompts a precision mismatch exception.

agibsonccc · February 24, 2025, 3:36am

@cqiaoYc unfortunately support for that will only be in my mid term rewrite that’s being worked of the internals.
That will be in the next release. That might be possible in cudnn somehow but I haven’t explored much on that.

Topic		Replies	Views
HALF data type with GPU backend DL4J	1	380	March 3, 2020
Cannot perform gradient check: Datatype is not set to double precision DL4J	2	647	July 1, 2020
ND4JIllegalStateException ND4J	6	528	December 8, 2020
Basic deeplearning4j classification example DL4J	4	1002	February 3, 2020
NVIDIA Tensor Cores Usage Tuning Help	0	95	February 13, 2024

How to perform mixed-precision training?

Related topics