Name		Name	Last commit message	Last commit date
parent directory ..
examples		examples
nncf		nncf
tests		tests
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt

README.md

This directory contains the experimental implementation of the Neural Network Compression Framework for TensorFlow. The implementation is subject to change without notice and offers no guarantees, including for the public API. Stay tuned.

Neural Network Compression Framework for TensorFlow (NNCF TF)

This repository contains a TensorFlow*-based framework and samples for neural networks compression.

The framework is the implementaiton of the Neural Network Compression Framework (NNCF) for TensorFlow*.

The framework is organized as a Python* package that can be built and used in a standalone mode. The framework architecture is unified to make it easy to add different compression methods.

The samples demonstrate the usage of compression algorithms for two different use cases on public models and datasets: Image Classification, Object Detection. Compression results achievable with the NNCF-powered samples can be found in a table at the end of this document.

Key Features

Support of various compression algorithms, applied during a model fine-tuning process to achieve best compression parameters and accuracy:
- Quantization
- Sparsity
Automatic, configurable model graph transformation to obtain the compressed model. The model is wrapped by the custom class and additional compression-specific layers are inserted in the graph.

NOTE: Only Keras models created using Sequential or Keras Functional API are supported.
Common interface for compression methods.
Distributed training support.
Configuration file examples for each supported compression algorithm.
Exporting compressed models to Frozen Graph or TensorFlow* SavedModel ready for usage with OpenVINO™ toolkit.

Usage

The NNCF TF is organized as a regular Python package that can be imported in an arbitrary training script. The basic workflow is loading a JSON configuration script containing NNCF-specific parameters determining the compression to be applied to your model, and then passing your model along with the configuration script to the nncf.create_compressed_model function. This function returns a transformed model ready for compression fine-tuning, and handle to the object allowing you to control the compression during the training process:

import beta.nncf
from beta.nncf import create_compressed_model
from beta.nncf import NNCFConfig

# Instantiate your uncompressed model
from tensorflow.keras.applications import ResNet50
model = ResNet50()

# Apply compression according to a loaded NNCF config
nncf_config = NNCFConfig.from_json("resnet50_imagenet_int8.json")
compression_ctrl, compressed_model = create_compressed_model(model, nncf_config)

# Now use compressed_model as a usual Keras model

# ... the rest of the usual TensorFlow-powered training pipeline

# Export to Frozen Graph, TensorFlow SavedModel or .h5  when done fine-tuning 
compression_ctrl.export_model("compressed_model.pb", save_format='frozen_graph')

Model Compression Samples

For a quick start with NNCF-powered compression, you can also try the sample scripts, each of them provides a basic training pipeline for Image Classification and Object Detection correspondingly.

To run the samples please refer to the corresponding tutorials:

System requirements

Ubuntu* 16.04 or later (64-bit)
Python* 3.6 or later
NVidia CUDA* Toolkit 10.1
TensorFlow* 2.3.1

NNCF TF compression results

Achieved using sample scripts and NNCF TF configuration files provided with this repository. See README files for sample scripts for links to exact configuration files and pre-trained models.

Quick jump to the samples:

Classification
Object Detection
Instance Segmentation

Classification

Model	Compression algorithm	Dataset	TensorFlow FP32 baseline	TensorFlow compressed accuracy
Inception V3	INT8 w:sym,per-tensor a:sym,per-tensor	ImageNet	77.9	78.41
Inception V3	Sparsity 54% (Magnitude)	ImageNet	77.9	77.87
Inception V3	INT8 w:sym,per-tensor a:sym,per-tensor + Sparsity 54% (Magnitude)	ImageNet	77.9	77.52
MobileNet V2	INT8 w:sym,per-tensor a:sym,per-tensor	ImageNet	71.85	71.96
MobileNet V2	Sparsity 35% (Magnitude)	ImageNet	71.85	72.36
MobileNet V2	INT8 w:sym,per-tensor a:sym,per-tensor + Sparsity 35% (Magnitude)	ImageNet	71.85	72.17
ResNet50	INT8 w:sym,per-tensor a:sym,per-tensor	ImageNet	75.04	75.04
ResNet50	Sparsity 50% (Magnitude)	ImageNet	75.04	75
ResNet50	INT8 w:sym,per-tensor a:sym,per-tensor + Sparsity 50% (Magnitude)	ImageNet	75.04	74.46
TensorFlow Hub MobileNet V2	Sparsity 35% (Magnitude)	ImageNet	71.84	71.73

Object detection

Model	Compression algorithm	Dataset	TensorFlow FP32 baseline mAP	TensorFlow compressed mAP
RetinaNet	INT8 w:sym,per-tensor a:sym,per-tensor	COCO2017	33.44	33.3
RetinaNet	Sparsity 50% (Magnitude)	COCO2017	33.44	33.13

Instance Segmentation

Model	Compression algorithm	Dataset	TensorFlow FP32 baseline mAP	TensorFlow compressed mAP
MaskRCNN	INT8 w:sym,per-tensor a:sym,per-tensor	COCO2017	bbox: 37.33 segm: 33.56	bbox: 37.25 segm: 33.59
MaskRCNN	Sparsity 50% (Magnitude)	COCO2017	bbox: 37.33 segm: 33.56	bbox: 36.93 segm: 33.23

Legal Information

[*] Other names and brands may be claimed as the property of others.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

beta

beta

README.md

This directory contains the experimental implementation of the Neural Network Compression Framework for TensorFlow. The implementation is subject to change without notice and offers no guarantees, including for the public API. Stay tuned.

Neural Network Compression Framework for TensorFlow (NNCF TF)

Key Features

Usage

Model Compression Samples

System requirements

NNCF TF compression results

Classification

Object detection

Instance Segmentation

Legal Information

Files

beta

Directory actions

More options

Directory actions

More options

Latest commit

History

beta

Folders and files

parent directory

README.md

This directory contains the experimental implementation of the Neural Network Compression Framework for TensorFlow. The implementation is subject to change without notice and offers no guarantees, including for the public API. Stay tuned.

Neural Network Compression Framework for TensorFlow (NNCF TF)

Key Features

Usage

Model Compression Samples

System requirements

NNCF TF compression results

Classification

Object detection

Instance Segmentation

Legal Information