How did EvalMetric update params?

zhanlong.hao · February 1, 2019, 3:14am

if we have several loss functions in a network ,like faster rcnn end to end version below,
rpn_eval_metric = metric.RPNAccMetric()
rpn_cls_metric=metric.RPNLogLossMetric()
rpn_bbox_metric=metric.RPNL1LossMetric()
eval_metric=metric.RCNNAccMetric()
cls_metric=metric.RCNNLogLossMetric()
bbox_metric=metric.RCNNL1LossMetric()
eval_metrics=mx.metric.CompositeEvalMetric()
for child_metric in[rpn_eval_metric,rpn_cls_metric,rpn_bbox_metric,eval_metric,cls_metric,bbox_metric]:
eval_metrics.add(child_metric)
how did EvalMetric update params? Each loss function will update all params in order after forward?

Maybe I describe with something unclear. I only want to know the gradient for update parameter is come from every loss function to update several times or come from the sum of every loss function to update only one time?

safrooze · February 1, 2019, 6:25pm

I’m not sure what code you’re looking at, but typically metrics are not for training the network, but rather for evaluation how when the network has trained. These metrics are not training losses and do not impact gradient calculation and parameter updates.

ChaiBapchya · February 1, 2019, 8:54pm

He is referring to this line of code

github.com

apache/incubator-mxnet/blob/a6ed6194e5b006ead69f79ad575998b3754ae6fe/example/rcnn/train.py#L82


    arg_params, aux_params = initialize_frcnn(sym, data_shapes, arg_params, aux_params)


# check parameter shapes
check_shape(sym, data_shapes + label_shapes, arg_params, aux_params)


# check fixed params
fixed_param_names = get_fixed_params(sym, args.net_fixed_params)
logger.info('locking params\n%s' % pprint.pformat(fixed_param_names))


# metric
rpn_eval_metric = RPNAccMetric()
rpn_cls_metric = RPNLogLossMetric()
rpn_bbox_metric = RPNL1LossMetric()
eval_metric = RCNNAccMetric()
cls_metric = RCNNLogLossMetric()
bbox_metric = RCNNL1LossMetric()
eval_metrics = mx.metric.CompositeEvalMetric()
for child_metric in [rpn_eval_metric, rpn_cls_metric, rpn_bbox_metric, eval_metric, cls_metric, bbox_metric]:
    eval_metrics.add(child_metric)


# callback

It occurs in the function train_net

@zhanlong.hao To answer your question
This line

github.com

apache/incubator-mxnet/blob/a6ed6194e5b006ead69f79ad575998b3754ae6fe/example/rcnn/train.py#L117


                    'wd': 0.0005,
                    'learning_rate': lr,
                    'lr_scheduler': lr_scheduler,
                    'rescale_grad': (1.0 / batch_size),
                    'clip_gradient': 5}


# train
mod = Module(sym, data_names=data_names, label_names=label_names,
             logger=logger, context=ctx, work_load_list=None,
             fixed_param_names=fixed_param_names)
mod.fit(train_data, eval_metric=eval_metrics, epoch_end_callback=epoch_end_callback,
        batch_end_callback=batch_end_callback, kvstore='device',
        optimizer='sgd', optimizer_params=optimizer_params,
        arg_params=arg_params, aux_params=aux_params, begin_epoch=args.start_epoch, num_epoch=args.epochs)




def parse_args():
parser = argparse.ArgumentParser(description='Train Faster R-CNN network',
                                 formatter_class=argparse.ArgumentDefaultsHelpFormatter)
parser.add_argument('--network', type=str, default='vgg16', help='base network')
parser.add_argument('--pretrained', type=str, default='', help='path to pretrained model')

If I am understanding it correctly, it indicates that it is an evaluation metric used while fitting the model i.e. training it (mostly implying it involves some sort of hyper-param tuning the model)

Correct me if I am wrong. Thanks

safrooze · February 1, 2019, 11:48pm

You are correct @ChaiBapchya. This is the list of evaluation metrics which are used only for monitoring the training progress. They are not used during training optimization.

zhanlong.hao · February 18, 2019, 1:24am

Yes ,it is the list of evalution metrics,but I want to know each metrics how to update the parameter when trainning net ,every metrics update all parameter one by one?

ThomasDelteil · February 19, 2019, 7:17pm

@zhanlong.hao, the metrics do contribute to updates to the parameters. The loss values are what are used for gradient computation and backward propagation which leads parameter updates based on the optimizer rule. Metrics are simply used for monitoring.

zhanlong.hao · February 25, 2019, 8:21am

Maybe I describe with something unclear. I only want to know the gradient for update parameter is come from every loss function to update several times or come from the sum of every loss function to update only one time?

Topic		Replies	Views
The metric how to get values of symbol network, in terms of variable name or Symbol name? How they interact of outputs of mx.symbol.Group? Discussion	2	504	April 5, 2018
Issue about training faster-rcnn using adam Discussion	2	1829	May 24, 2018
Custom loss + custom metric on R Discussion	6	825	May 16, 2019
How to print loss with symbol network? Discussion	2	801	April 5, 2018
Get the updated parameter every 4 times is Terrible Discussion	1	311	April 14, 2019

How did EvalMetric update params?

Related Topics