Filter Pruning scheduler does not work as expected #353

AlexKoff88 · 2020-12-14T09:53:58Z

I observed the strange behavior during the fine-tuning of MobileNet v2 when doing the filter pruning. I set the "pruning_target" to 0.3 and "pruning_steps" to15 but I didn't get the desired ratio after 15 epochs. It was achieved after the 25th epoch.

Please consider with high priority.

Here is the config:

{
    "model": "mobilenet_v2",
    "pretrained": true,
    "batch_size" : 512,
    "epochs": 100,
    "input_info": {
      "sample_size": [1, 3, 224, 224]
    },
    "optimizer": {
        "type": "SGD",
        "base_lr": 0.1,
        "weight_decay": 1e-5,
        "schedule_type": "multistep",
        "steps": [
            20,
            40,
            60,
            80
        ],
        "optimizer_params":
        {
            "momentum": 0.9,
            "nesterov": true
        }
    },
       "compression": [
       {
        "algorithm": "filter_pruning",
	"pruning_init": 0.1,
        "params": {
	    "schedule": "exponential",
            "pruning_target": 0.3,
            "pruning_steps": 15,
            "weight_importance": "geometric_median"
        }
        }
       ]
}

The text was updated successfully, but these errors were encountered:

mkaglins · 2020-12-16T10:35:24Z

@AlexKoff88 do you observe this only with an exponential scheduler or other schedulers also affected?

AlexKoff88 · 2020-12-16T12:48:32Z

@mkaglins, I have no idea, you can check whether this is a generic issue to it is a scheduler-specific.

mkaglins · 2020-12-24T13:58:10Z

Problem analysis summary:

The problem affects only exponential and exponential_with_bias schedulers.
The problem was caused by the momentum parameter in the optimizer. Statistics of the momentum from earlier epochs (with smaller pruning rates) summed with weights on the next epochs (with higher pruning rates) and make them non-zero.
After a couple of training steps, this effect is vanishing and the pruning rate of masks becomes equal to the pruning rate of weights.

As a solution, it was decided to add mask applying on every training step (that significantly speeds up vanishing of momentum non-zero elements). Also, a warning message about this effect will be added to the release notes.
Changes are done in #365

AlexKoff88 mentioned this issue Dec 14, 2020

Print the actual pruning rate #290

Closed

mkaglins self-assigned this Dec 14, 2020

vshampor closed this as completed Jan 22, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Filter Pruning scheduler does not work as expected #353

Filter Pruning scheduler does not work as expected #353

AlexKoff88 commented Dec 14, 2020

mkaglins commented Dec 16, 2020

AlexKoff88 commented Dec 16, 2020

mkaglins commented Dec 24, 2020

Filter Pruning scheduler does not work as expected #353

Filter Pruning scheduler does not work as expected #353

Comments

AlexKoff88 commented Dec 14, 2020

mkaglins commented Dec 16, 2020

AlexKoff88 commented Dec 16, 2020

mkaglins commented Dec 24, 2020