Maximizing with a non-standard probability model

brianmannmath · October 30, 2017, 7:40pm

I have a probability model where P(y = 1) = sigmoid( -|| xY ||^2 + c ). I figured I could pretty easily use MXNet to run the maximum likelihood optimization, since it’s essentially logistic regression, except instead of a dot product, you have something like Mahanalobis distance (trying to optimize parameters Y and c given data x,y). I tried something like:


data = mx.sym.Variable(“data”)

target = mx.sym.Variable(“target”)

Y = mx.sym.Variable(“weight”)

fc = mx.sym.FullyConnected(data=data, weight=Y, no_bias=True, num_hidden=10)

bias = mx.sym.Variable(“bias”)

norm = -mx.sym.sum(mx.sym.square(fc)) + bias

out = mx.sym.LogisticRegressionOutput(data=norm, label=target)

model = mx.mod.Module(symbol=out, data_names=[‘data’], label_names=[‘target’])

but there are two problems. (1) I cannot use a batch size greater than 1, even if I force bias to have shape (1,) and (2) after training for any number of epochs, the resulting weights are all nan. Am I making a simple mistake here? Is it possible to use MXNet to optimize a function like this?

eric-haibin-lin · November 17, 2017, 11:23pm

Can you post the error message for (1)? Did you have a infer_shape error?

Topic		Replies	Views
Max-norm constraint / regularizer on different layer how-to	2	1087	February 28, 2019
Running inference with varying input size	3	1144	October 20, 2019
How to eliminated the weight decay on the bias and batch nomalization?	4	479	August 16, 2019
Moving PyMC3 from Theano to MXNet Discussion	10	4145	June 3, 2019
Possible to Use MXNet gluon.Trainer without a Neural Network? Gluon	1	1017	April 23, 2018

Maximizing with a non-standard probability model

Related Topics