Raphaël Bournhonesque
commited on
Commit
•
843c718
1
Parent(s):
a1d2014
add new version of the ingredient detection model
Browse files- .gitattributes +1 -0
- README.md +28 -0
- config.json +3 -0
- model.safetensors +3 -0
- onnx.tar.gz +3 -0
- predictions/test_predictions.html +0 -0
- predictions/test_predictions.jsonl.gz +3 -0
- predictions/test_predictions_agg.jsonl.gz +3 -0
- predictions/train_predictions.html +0 -0
- predictions/train_predictions.jsonl.gz +3 -0
- predictions/train_predictions_agg.jsonl.gz +3 -0
- special_tokens_map.json +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +3 -0
- training_args.bin +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,3 +1,31 @@
|
|
1 |
---
|
2 |
license: agpl-3.0
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: agpl-3.0
|
3 |
---
|
4 |
+
This ingredient detection model was trained on the ingredient detection [dataset v1.1](https://huggingface.co/datasets/openfoodfacts/ingredient-detection/tree/v1.1) using [this training code](https://github.com/openfoodfacts/openfoodfacts-ai/tree/a9b4ad6a854fa6f8330b0ff3e6a67ad963c9b96b/ingredient_extraction/train).
|
5 |
+
|
6 |
+
Training was tracked on [Wandb](https://wandb.ai/raphaeloff/ingredient-detection-ner/runs/dwbdbjek/overview).
|
7 |
+
|
8 |
+
This release provides the following assets:
|
9 |
+
|
10 |
+
Training-related assets:
|
11 |
+
|
12 |
+
- `predictions` directory: predictions on train and test dataset of the model, in:
|
13 |
+
HTML format: easier to view
|
14 |
+
JSONL format: either the raw or the aggregated (post-processed) version
|
15 |
+
- the HuggingFace serialized model, in the root directory
|
16 |
+
|
17 |
+
Serving assets:
|
18 |
+
|
19 |
+
- `onnx.tar.gz`: the model exported to ONNX format
|
20 |
+
|
21 |
+
|
22 |
+
## Versions
|
23 |
+
|
24 |
+
### 1.1
|
25 |
+
|
26 |
+
New version based on [v1.1 of the dataset](https://huggingface.co/datasets/openfoodfacts/ingredient-detection/tree/v1.1).
|
27 |
+
“organic”/”issu de l’agriculture biologique” suffixes are now considered as part of the ingredient list.s
|
28 |
+
|
29 |
+
### 1.0
|
30 |
+
|
31 |
+
First version based on [v1.0 of the dataset](https://huggingface.co/datasets/openfoodfacts/ingredient-detection/tree/v1.0).
|
config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4b5daeda643d147fd578f1f9ae7d2ae2a2f6a8005132fc40edc3c55dc006c9d9
|
3 |
+
size 854
|
model.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ca586acd223448c0aff78372639cd6203466e753c5becfda2df38bd98f18e378
|
3 |
+
size 2235424156
|
onnx.tar.gz
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4adbca6b2e4e21d2c1ea77e3bc73125ae3bf6e98825550528e68dbc386a645bd
|
3 |
+
size 1832390840
|
predictions/test_predictions.html
ADDED
The diff for this file is too large to render.
See raw diff
|
|
predictions/test_predictions.jsonl.gz
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bfe6b529802f160be174c919dcbcb3b1daa8540a37e0aa95396612bfe6662b72
|
3 |
+
size 815368
|
predictions/test_predictions_agg.jsonl.gz
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f442d0747bcf66ece19f025a6f45c2ebf12b3d593c9150c498400b4fc4310249
|
3 |
+
size 231772
|
predictions/train_predictions.html
ADDED
The diff for this file is too large to render.
See raw diff
|
|
predictions/train_predictions.jsonl.gz
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6d20e8f45a4bc4121fee8d8dfa15f2ef6aa1721dd7a9ece41fabe8dd52374841
|
3 |
+
size 7241061
|
predictions/train_predictions_agg.jsonl.gz
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:178c40512407f62e087aae6004fa875ca6df223561a0cb66a9a6e3677a3ae8d4
|
3 |
+
size 2080479
|
special_tokens_map.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:06e405a36dfe4b9604f484f6a1e619af1a7f7d09e34a8555eb0b77b66318067f
|
3 |
+
size 280
|
tokenizer.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:70dc686438b64aca1ee6b60d1aed7f7cc93a432c73ec891b31d09a4758ee695d
|
3 |
+
size 17082835
|
tokenizer_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f90024142df07163e5e6c5b9a6ad7c8c68b22a9112af11e3db4559a9ff90f737
|
3 |
+
size 1147
|
training_args.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bb6b754d4831d03ef3b36a2ef36050f7c369bb72e6eeb8220e3a7224e5926f28
|
3 |
+
size 5048
|