New Compression Strategy

dbasu · December 11, 2018, 10:11am

How do I implement a new compression method instead of the default over here

Thanks!

safrooze · December 11, 2018, 10:51pm

What are you compressing? Gradients or the model itself?

dbasu · December 12, 2018, 12:21am

I’m trying to compress the gradients from every worker in a distributed setup.

safrooze · December 12, 2018, 2:49am

To do it properly, you’d have to modify the C++ source code. Best would be to follow implementation of two-bit compression (I’d start by searching for kTwoBit in the source code and follow its path). Once that’s there, you can build from source and follow instructions in the link you provided to select your new compression algorithm.

safrooze · December 12, 2018, 3:27am

Also if you decide to go down this path, I highly recommend reviewing the steps in this guide to make sure your development environment is setup for easy debugging.

Topic		Replies	Views
Gradient fetching Discussion	2	586	May 31, 2018
Confusion over implementation of Embedding: dense or row_sparse weights? Discussion	1	422	November 27, 2018
How to implement the addtion of grad in the backback-propagating,how to add extra term (which is the gradient to middle net layer output) to the network	2	590	August 18, 2018
Loss function in Mxnet C++	8	1597	June 22, 2018
Questions about loss functions Discussion	1	369	June 7, 2019

New Compression Strategy

Related Topics