Riemannian SGD for spherical embeddings?

Emile · January 22, 2020, 11:41pm

I would like to run SGD on the unit sphere in order to learn spherical word embeddings (vectors of norm 1).

For general statements:

and more specifically, Equation (7) in Meng et al. 2019:

https://papers.nips.cc/paper/9031-spherical-text-embedding.pdf

Is there a simpler way of implementing the update rule (eq. (7) in Meng et al. 2019) paper than by re-writing C code similar to the update rule for plain SGD as in

github.com

apache/incubator-mxnet/blob/master/src/operator/optimizer_op.cc#L331




NNVM_REGISTER_OP(multi_sgd_update)
.describe(R"code(Update function for Stochastic Gradient Descent (SDG) optimizer.


It updates the weights using::


 weight = weight - learning_rate * (gradient + wd * weight)


)code" ADD_FILELINE)
.set_num_inputs([](const nnvm::NodeAttrs& attrs) {
    const MultiSGDParam& param = dmlc::get<MultiSGDParam>(attrs.parsed);
    return static_cast<uint32_t>(param.num_weights * 2);
  })
.set_num_outputs([](const nnvm::NodeAttrs& attrs) {
    const MultiSGDParam& param = dmlc::get<MultiSGDParam>(attrs.parsed);
    return static_cast<uint32_t>(param.num_weights);
  })
.set_attr_parser(ParamParser<MultiSGDParam>)
.set_attr<mxnet::FInferShape>("FInferShape", MultiSGDShape<MultiSGDParam, 2>)
.set_attr<nnvm::FInferType>("FInferType", ElemwiseType<-1, -1>)
.set_attr<nnvm::FListInputNames>("FListInputNames",

Emile · January 30, 2020, 1:12am

Update: problem solved. I managed running the algorithm by writing the riemannian gradient steps in python using gradients computed inside of autograd.record() and setting values of modified rows weights only manually inside the training loop. Note: the set_data method does not allow to use sparse gradients efficiently, I had to implement this myself

Topic		Replies	Views
Unable to run Multi-*_update operators for NN Optimizer	1	387	July 29, 2019
Help to use nd.sgd.update for R optimizers	1	377	July 3, 2018
Elastic SGD algorithm Discussion	1	305	June 8, 2019
Gradients for Embedding layers in Gluon Gluon	3	792	September 21, 2018
Is that possible to update only certain weights of embedding layers? Discussion	4	2128	December 10, 2017

Riemannian SGD for spherical embeddings?

Related Topics