"Protobuf parsing failed" error when loading a quantized Mistral model #20113

idruker-cerence · 2024-03-27T20:40:56Z

Describe the issue

I exported Mistral model to ONNX format using optimum-cli tool:

optimum-cli export onnx --task text-generation-with-past -m "mistralai/Mistral-7B-v0.1" <path/to/output-onnx-model>

I was able to load and run the model using onnxruntime. Then I quantized it:

optimum-cli onnxruntime quantize --avx2 --onnx_model <path/to/output-onnx-model>/model.onnx --output <path/to/output-onnx-quantized-model>

The attempt to load the quantized model ended up with the error "Protobuf parsing failed".

What have I missed?

P.S. The quantized model is successfully read by Netron.

To reproduce

See the description

Urgency

No response

Platform

Linux

OS Version

Ubuntu 20.04

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

1.17.1

ONNX Runtime API

C++

Architecture

X64

Execution Provider

Default CPU

Execution Provider Library Version

No response

The text was updated successfully, but these errors were encountered:

idruker-cerence · 2024-06-28T21:41:55Z

Apparently, the root cause was that the quantization with --avx2 was not supported on the target machine. I made it work by explicitly setting parameters in ORTConfig.json file.

Closing.

yufenglee assigned yihonglyu Mar 28, 2024

idruker-cerence closed this as completed Jun 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Protobuf parsing failed" error when loading a quantized Mistral model #20113

"Protobuf parsing failed" error when loading a quantized Mistral model #20113

idruker-cerence commented Mar 27, 2024

idruker-cerence commented Jun 28, 2024

"Protobuf parsing failed" error when loading a quantized Mistral model #20113

"Protobuf parsing failed" error when loading a quantized Mistral model #20113

Comments

idruker-cerence commented Mar 27, 2024

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

idruker-cerence commented Jun 28, 2024