[solved] Network in float16

dmidge · April 12, 2019, 3:30pm

Hi,

I have a working network that processes images in float32, using the C++ Symbol API. I now try to convert the network in processing in float16 (aka half_float). I am using the GPU for the computations.
After having some errors saying that convolutions or batchnormalization (for instance) can’t have mixed input type, I converted every input (including the kernel weights, biases, means, etc) to float16, using the “Cast” Symbol. However, I now get “Check failed: e.node->is_variable() Mutation target can only be Variable”. So I conclude that the kernel symbol, which is a variable Symbol mapped to a NDArray, can’t be casted to float16. And I don’t find anything to directly feed the data in float16 (even in the NDArray, I can’t find such things)
But then, how can I do?

Thanks!

ThomasDelteil · April 12, 2019, 4:53pm

Hi @dmidge, unfortunately there are no documented way to do inference in fp16 in CPP at the moment. I know it’s a pretty big gap, I hope it gets closed soon. Please register your interest on this github issue: https://github.com/apache/incubator-mxnet/issues/14159

I think to do inference in fp16 what you can do is in python do:

import mxnet as mx
from mxnet import gluon

ctx=mx.gpu()
net = gluon.model_zoo.vision.resnet18_v2(pretrained=True, ctx=ctx)
net.cast('float16')
export_net = gluon.nn.HybridSequential()
with export_net.name_scope():
    export_net.add(gluon.nn.HybridLambda(lambda F, x: F.cast(x, 'float16')))
    export_net.add(net)
export_net.hybridize()
export_net(mx.nd.ones((1,3,224,224), ctx=ctx))
export_net.export('my_model', 0)

You can then use my_model-symbol.json and my_model-0000.params in CPP and feed that network fp32 data that will be converted to fp16 in the first layer of the network.

Note that fp16 is only supported in GPU for now.

edit: Just re-read your question, why you get these errors is that some layers, batch norm typically, are using fp32 for accumulation so not all parameters need to be converted to fp16.

dmidge · April 12, 2019, 5:54pm

Hi @ThomasDelteil,

Thank you for this information. I indeed didn’t see any C++ tutorial about that, nor examples in the repository, so I suspected it was the reason. But thank you for this explanation! Despise my research, I missed the github feature request.
I added a comment pointing to this forum thread.

Thanks!

dmidge · April 15, 2019, 2:55pm

@ThomasDelteil,
Thanks to your post, I think I managed to work (at least partially) with floating point 16. In my case, I needed to keep everything in the batchnorm in float32. However, I don’t fully understand why I have no control over the type I should use for the batchnorm. But at least, this matter seems solved.

anirudh2290 · April 15, 2019, 3:32pm

I am working on offline conversion of fp32 to fp16 and should be built on top of AMP support: https://github.com/apache/incubator-mxnet/issues/14584 . This will also add support for CPP API and other frontends. Stay tuned.

dmidge · April 15, 2019, 5:35pm

@anirudh2290,

Perfect, thank you!

Topic		Replies	Views
Failed to convert symbol for mixed precision inference Discussion	4	543	October 26, 2018
[Mixed-Precision] In mixed-precision training/inference, does Gluon do float16 addition or float32 addition?	2	469	October 22, 2019
Inference using float16 general-question	1	362	February 21, 2019
Deserializing Gluon Symbol in C++ Gluon	2	1492	December 11, 2017
Convert input tensor to gray image in hybrid_forward Gluon python , gluon-cv , how-to	4	1757	February 15, 2019

[solved] Network in float16

Related Topics