nvidia
/

Llama3-ChatQA-2-8B

Text Generation

Model card Files Files and versions Community

root commited on Sep 9, 2024

Commit

e799630

·

1 Parent(s): 7afa617

update README

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -19,7 +19,8 @@ We introduce Llama3-ChatQA-2, which bridges the gap between open-source LLMs and
 [Llama3-ChatQA-2-70B](https://huggingface.co/nvidia/Llama3-ChatQA-2-70B) &ensp; [Evaluation Data](https://huggingface.co/nvidia/Llama3-ChatQA-2-70B/tree/main/data) &ensp; [Training Data](https://huggingface.co/datasets/nvidia/ChatQA2-Long-SFT-data) &ensp; [Retriever](https://huggingface.co/intfloat/e5-mistral-7b-instruct) &ensp; [Website](https://chatqa2-project.github.io/) &ensp; [Paper](https://arxiv.org/abs/2407.14482)
 ## Overview of Benchmark Results
-Results in [ChatRAG Bench](https://huggingface.co/datasets/nvidia/ChatRAG-Bench) are as follows:
 ![Example Image](overview.png)

 [Llama3-ChatQA-2-70B](https://huggingface.co/nvidia/Llama3-ChatQA-2-70B) &ensp; [Evaluation Data](https://huggingface.co/nvidia/Llama3-ChatQA-2-70B/tree/main/data) &ensp; [Training Data](https://huggingface.co/datasets/nvidia/ChatQA2-Long-SFT-data) &ensp; [Retriever](https://huggingface.co/intfloat/e5-mistral-7b-instruct) &ensp; [Website](https://chatqa2-project.github.io/) &ensp; [Paper](https://arxiv.org/abs/2407.14482)
 ## Overview of Benchmark Results
+<!-- Results in [ChatRAG Bench](https://huggingface.co/datasets/nvidia/ChatRAG-Bench) are as follows: -->
+We evaluate ChatQA 2 on short-context RAG benchmark (ChatRAG) (within 4K tokens), long context tasks from SCROLLS and LongBench (within 32K tokens), and ultra-long context tasks from In- finiteBench (beyond 100K tokens). Results are shown below.
 ![Example Image](overview.png)