Xenova HF Staff whitphx HF Staff commited on
Commit
8745360
·
verified ·
1 Parent(s): bc8c017

Add/update the quantized ONNX model files and README.md for Transformers.js v3 (#2)

Browse files

- Add/update the quantized ONNX model files and README.md for Transformers.js v3 (78c8247999e65f0d6caea90dc1a5e8be9afd523e)


Co-authored-by: Yuichiro Tachibana <whitphx@users.noreply.huggingface.co>

README.md CHANGED
@@ -5,4 +5,22 @@ library_name: transformers.js
5
 
6
  https://huggingface.co/facebook/blenderbot-400M-distill with ONNX weights to be compatible with Transformers.js.
7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
 
5
 
6
  https://huggingface.co/facebook/blenderbot-400M-distill with ONNX weights to be compatible with Transformers.js.
7
 
8
+ ## Usage (Transformers.js)
9
+
10
+ If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
11
+ ```bash
12
+ npm i @huggingface/transformers
13
+ ```
14
+
15
+ **Example:** Text-to-text generation.
16
+
17
+ ```js
18
+ import { pipeline } from '@huggingface/transformers';
19
+
20
+ const generator = await pipeline('text2text-generation', 'Xenova/blenderbot-400M-distill');
21
+ const output = await generator('how can I become more healthy?', {
22
+ max_new_tokens: 100,
23
+ });
24
+ ```
25
+
26
  Note: Having a separate repo for ONNX weights is intended to be a temporary solution until WebML gains more traction. If you would like to make your models web-ready, we recommend converting to ONNX using [🤗 Optimum](https://huggingface.co/docs/optimum/index) and structuring your repo like this one (with ONNX weights located in a subfolder named `onnx`).
onnx/decoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1f0e0ba43de5d3b5cf791f5eb388c70b7100bb5e9da49a12af6592b444e61413
3
+ size 220326706
onnx/decoder_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e7126b5db6999618623611eea135b999b0a60e7907b971b04ce8945766f8d4f
3
+ size 651092026
onnx/decoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bbfca61d3d650fa2d8c9e8d9e4e9601c154bea110ca179c001a09e8c9ee529ea
3
+ size 367838820
onnx/decoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4a51e5d7b760d041af1961cb1e1ccad80f7739cda905088207ba9997183a26f
3
+ size 239986534
onnx/decoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a07f2e9707928c559e558303f2c6f3078963e096bc612e9503d0c9d9f40f48e
3
+ size 198911464
onnx/decoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:284a13bd6a0d35de5922a02ab6fd55c33714e2e1439c3962b4d3b7a0149bbd9b
3
+ size 367838881
onnx/decoder_with_past_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c034ee2d9e5a7d1138f3934db1ee598afe1bdcdf3c29b52a3e6c887f33994b46
3
+ size 197951151
onnx/decoder_with_past_model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:39007076d9e19fa47cfc08a8bee0b53437d796c715e79e5b2333673d7a7dc14a
3
+ size 572266455
onnx/decoder_with_past_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8c4a6ee52f0734a1de4ae8a0ca30fa30c9f8a6d5879e762ae849c35c4e1dd79c
3
+ size 328240214
onnx/decoder_with_past_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6efbe823717bc060dc07d3bf73087067b5925cbd2d7add16779b40b03fcb89db
3
+ size 215153571
onnx/decoder_with_past_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8b2763639a2d3758feb57ca5b8acf976ac4b59f6199af26e53c21fe445df01e2
3
+ size 176607141
onnx/decoder_with_past_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fdb9dbff6c76a04fbeaaebee17927433c362f8a1c5a6f47517ffcdb3d96b44b9
3
+ size 328240258
onnx/encoder_model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:734567dfc275d914ed700eda35e8b45ff0e9f2a4a1556d35fe90d6c1c92dacca
3
+ size 63963915
onnx/encoder_model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bcd4620f23f0a0bdaa67bc93bac07a006cca1db47e1a5dcb4f7ef50915821457
3
+ size 49935818
onnx/encoder_model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e28c7b9e4bba9483867aebb4bb2dcccab0fea7559a78aa651335ff5a9f02d87f
3
+ size 66421417
onnx/encoder_model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4733701bf9bb532684197232b447400cf80568f44a71b8c6d16586be32cb3834
3
+ size 43064355
onnx/encoder_model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a9389bdc5c551e4fed9b130c0d231832d83469bd2f06da571ef63ab7f83cee12
3
+ size 49935823
onnx/model.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c7acf5757afc0b635469c5d0166d21b37fa54fedd77f867f581a64f4caf12d5e
3
+ size 1302192393
onnx/model_bnb4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b23bbb5947bee69a199b1f30a9187a6fc43fcd17620b1a51bc0f356cf870d6a
3
+ size 220881267
onnx/model_fp16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8aff9de176feb4f1dd33bd92fa25e4a139b9113205c4ca9cad6f9374307453c3
3
+ size 651973508
onnx/model_int8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:52e9ec872173ab2eaccf0f5b387f773b85147ff1725ffd9e70d8ee6219068b8c
3
+ size 327833726
onnx/model_q4.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:45c74558715df56c1eb621347cbe526fe7677f29295eb1b32ef576336a6f0fe0
3
+ size 240540231
onnx/model_q4f16.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c59a16ea9806dadc82e38b88811e1b44584ccc07380b9a6d614df47ad2c9af6f
3
+ size 199789862
onnx/model_uint8.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d85f73b2820c6cc90d2b0c6536fd94af8a7ccf728b6fc36820c6281b79c6b05a
3
+ size 327833787