llm-jp
/

llm-jp-3-8x1.8b-instruct3

Text Generation

text-generation-inference

Model card Files Files and versions Community

Taka008 commited on Mar 19

Commit

89a7fe2

·

verified ·

1 Parent(s): 28e17fe

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -176,8 +176,10 @@ For more details, please refer to the [codes](https://github.com/llm-jp/llm-jp-j
 ### AnswerCarefully-Eval
-[AnswerCarefully-Eval](https://www.anlp.jp/proceedings/annual_meeting/2025/pdf_dir/Q4-19.pdf) evaluates the safety of language model outputs in Japanese using the LLM-as-a-Judge approach, based on the test set of [llm-jp/AnswerCarefully](https://huggingface.co/datasets/llm-jp/AnswerCarefully).
-We evaluated the models using `gpt-4-0613`. The scores represent the average values obtained from five rounds of inference and evaluation.
 | Model name | Acceptance rate (%, &uarr;) | Violation rate (%, &darr;) |
 | :--- | ---: | ---: |

 ### AnswerCarefully-Eval
+[AnswerCarefully-Eval](https://www.anlp.jp/proceedings/annual_meeting/2025/pdf_dir/Q4-19.pdf) assesses the safety of Japanese language model outputs using the LLM-as-a-Judge approach, based on the test set from [llm-jp/AnswerCarefully](https://huggingface.co/datasets/llm-jp/AnswerCarefully).
+We evaluated the models using `gpt-4-0613`.
+The scores represent the average values obtained from five rounds of inference and evaluation.
 | Model name | Acceptance rate (%, &uarr;) | Violation rate (%, &darr;) |
 | :--- | ---: | ---: |