Writing Python code for both ndarray and symbol

mseeger · November 16, 2017, 4:18pm

I am implementing some fairly complex criterion function in MXNet. I’d like to both support Gluon and the old way of binding executors.
This means I have to write all my code both using mx.nd and mx.sym, apart from that, it is pretty much the same (I am aware there are differences: I can access shape and dtype in mx.nd, but not in mx.sym. I can overwrite array content in mx.nd, have out=XYZ option, but not in mx.sym. But most is the same).

Is there some trick to avoid having to copy & paste?

Sorry if this is obvious.

piiswrong · November 16, 2017, 7:49pm

Use Gluon.HybridBlock

mseeger · November 17, 2017, 3:39pm

Hi Eric,
so you say I should abandon my mx.sym code, and instead rather use HybridBlock later on if speed is an issue.

I am happy to do that, but I also need a CustomOp, and it seems what I am doing with mx.sym.Custom does not work with mx.nd.Custom (another topic I just posted).

cbarber · November 17, 2017, 3:50pm

I would start out using a HybridBlock from the beginning. It will use either the NDArray or Symbol APIs based on what mode it is in. You will have to make sure you do not use functions that are not in both APIs, such as implicit broadcasting operations on NDArray. And you may have to implement a CustomOp for some cases.

mseeger · November 18, 2017, 10:16am

Hello Chris, thanks for your advice.
Gluon is novel to me.

In fact, I am implementing a loss function (albeit a quite complex one). I thought Block are reserved for elements in the middle? Anyway, much to learn. I need to look into HybridBlock. Is there a good tutorial for people who know the “old” MXNet way, to transfer to Gluon?

mseeger · November 18, 2017, 2:56pm

OK, I went through:
http://gluon.mxnet.io/chapter07_distributed-learning/hybridize.html
to understand HybridBlock. I see now how it works.

Maybe just allow me once more question: How can hybridize() work on something like the example in that chapter, if the input dimension (dimension of 1st layer) is not specified?

I’d expect hybridize() to do something like bind executors to all HybridBlock parts of the graph, to make things fast. But for that, I need to know all shapes, I also need to know the input shape, right?

I am asking because the layer (or loss function) I want to write, also has to know the shape of its input. Just wondering if this can somehow be accessed in HybridBlock.hybrid_forward?

cbarber · November 19, 2017, 3:49am

Hybridize will build your symbol graph but will not bind it. I believe that happens when you call it.

You can also just build a symbol graph by calling hybrid_forward with the input variable and then use it like any other symbol graph.

Topic		Replies	Views
Get mxnet.symbol expression and dict for all params out of Gluon HybridBlock	3	779	February 28, 2018
Hybridizing if elif and else statements in Gluon	3	754	March 25, 2019
Serializing and loading custom HybridBlock Gluon	3	645	March 22, 2019
Deserialize to HybridBlock, or convert SymbolBlock to HybridBlock	1	573	August 2, 2018
Is Gluon going to replace Symbol? Gluon gluon , best-practices	15	3727	April 28, 2019

Writing Python code for both ndarray and symbol

Related Topics