Commit
·
b7c4022
1
Parent(s):
3518c77
update int8 onnx model and readme
Browse filesSigned-off-by: dujun <[email protected]>
- README.md +3 -3
- model.onnx +2 -2
README.md
CHANGED
|
@@ -63,14 +63,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
|
|
| 63 |
|
| 64 |
The original fp32 model comes from the fine-tuned model [electra-small-discriminator-mrpc](https://huggingface.co/Intel/electra-small-discriminator-mrpc).
|
| 65 |
|
| 66 |
-
The calibration dataloader is the eval dataloader. The
|
| 67 |
|
| 68 |
#### Test result
|
| 69 |
|
| 70 |
| |INT8|FP32|
|
| 71 |
|---|:---:|:---:|
|
| 72 |
-
| **Accuracy (eval-f1)** |0.
|
| 73 |
-
| **Model size (MB)** |
|
| 74 |
|
| 75 |
|
| 76 |
#### Load ONNX model:
|
|
|
|
| 63 |
|
| 64 |
The original fp32 model comes from the fine-tuned model [electra-small-discriminator-mrpc](https://huggingface.co/Intel/electra-small-discriminator-mrpc).
|
| 65 |
|
| 66 |
+
The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
|
| 67 |
|
| 68 |
#### Test result
|
| 69 |
|
| 70 |
| |INT8|FP32|
|
| 71 |
|---|:---:|:---:|
|
| 72 |
+
| **Accuracy (eval-f1)** |0.8957|0.8983|
|
| 73 |
+
| **Model size (MB)** |16|52|
|
| 74 |
|
| 75 |
|
| 76 |
#### Load ONNX model:
|
model.onnx
CHANGED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:11550dc03b7a071d26cd3b5191c9df3a83113227bb82a6e78f28e6c2ca59e6b3
|
| 3 |
+
size 16314577
|