Update README.md
Browse files
README.md
CHANGED
@@ -1,5 +1,91 @@
|
|
1 |
-
---
|
2 |
-
license: other
|
3 |
-
license_name: openmdw
|
4 |
-
license_link: LICENSE
|
5 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: other
|
3 |
+
license_name: openmdw
|
4 |
+
license_link: LICENSE
|
5 |
+
---
|
6 |
+
# Seed-X-PPO-7B
|
7 |
+
<a href="https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/Technical_Report.pdf">
|
8 |
+
<img src="https://img.shields.io/badge/Seed--X-Report-blue"></a>
|
9 |
+
<a href="XXXX">
|
10 |
+
<img src="https://img.shields.io/badge/Seed--X-Hugging Face-brightgreen"></a>
|
11 |
+
<a href="https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/LICENSE.openmdw">
|
12 |
+
<img src="https://img.shields.io/badge/License-OpenMDW-yellow"></a>
|
13 |
+
|
14 |
+
## Introduction
|
15 |
+
We are excited to introduce **Seed-X**, a powerful open-source multilingual translation language model series, including instruction and reasoning models, with 7B parameters pushing the boundaries of translation capabilities.
|
16 |
+
We develop Seed-X as an accessible, off-the-shelf tool to support the community in advancing translation research and applications:
|
17 |
+
* **Exceptional translation capabilities**: Seed-X exhibits state-of-the-art translation capabilities, on par with or outperforming ultra-large models like Gemini-2.5, Claude-3.5, and GPT-4, as validated by human evaluations and automatic metrics.
|
18 |
+
* **Deployment and inference-friendly**: With a compact 7B parameter count and mistral architecture, Seed-X offers outstanding translation performance in a lightweight and efficient package, ideal for deployment and inference.
|
19 |
+
* **Broad domain coverage**: Seed-X excels on a highly challenging translation test set spanning diverse domains, including the internet, science and technology, office dialogues, e-commerce, biomedicine, finance, law, literature, and entertainment.
|
20 |
+

|
21 |
+
|
22 |
+
This repo contains the **Seed-X-Instruct** model, with the following features:
|
23 |
+
* Type: Causal language models
|
24 |
+
* Training Stage: Pretraining & Post-training
|
25 |
+
* Support: Multilingual translation among 28 languages
|
26 |
+
|
27 |
+
| Languages | Abbr. | Languages | Abbr. | Languages | Abbr. | Languages | Abbr. |
|
28 |
+
| ----------- | ----------- |-----------|-----------|-----------|-----------| -----------|-----------|
|
29 |
+
|Arabic | ar |French | fr | Malay | ms | Russian | ru |
|
30 |
+
|Czech | cs |Croatian | hr | Norwegian Bokmal | nb | Swedish | sv |
|
31 |
+
|Danish | da |Hungarian | hu | Dutch | nl | Thai | th |
|
32 |
+
|German | de |Indonesian | id | Norwegian | no | Turkish | tr |
|
33 |
+
|English | en |Italian | it | Polish | pl | Ukrainian | uk |
|
34 |
+
|Spanish | es |Japanese | ja | Portuguese | pt | Vietnamese | vi |
|
35 |
+
|Finnish | fi |Korean | ko | Romanian | ro | Chinese | zh |
|
36 |
+
|
37 |
+
## Model Downloads
|
38 |
+
| Model Name | Description | Download |
|
39 |
+
| ----------- | ----------- |-----------
|
40 |
+
| 👉 **Seed-X-Instruct** | Instruction-tuned for alignment with user intent. |🤗 [Model](https://huggingface.co/ByteDance-Seed/Seed-X-Instruct-7B)|
|
41 |
+
| Seed-X-PPO | RL trained to boost translation capabilities. | 🤗 [Model](https://huggingface.co/ByteDance-Seed/Seed-X-PPO-7B)|
|
42 |
+
|Seed-X-RM | Reward model to evaluate the quality of translation.| 🤗 [Model](https://huggingface.co/ByteDance-Seed/Seed-X-RM-7B)|
|
43 |
+
|
44 |
+
## Quickstart
|
45 |
+
Here is a simple example demonstrating how to load the model and perform translation using ```vllm```
|
46 |
+
```python
|
47 |
+
from vllm import LLM, SamplingParams
|
48 |
+
|
49 |
+
model_path = "./ByteDance-Seed/Seed-X-Instruct-7B"
|
50 |
+
|
51 |
+
model = LLM(model=model_path,
|
52 |
+
max_num_seqs=512,
|
53 |
+
tensor_parallel_size=8,
|
54 |
+
enable_prefix_caching=True,
|
55 |
+
gpu_memory_utilization=0.95)
|
56 |
+
|
57 |
+
messages = [
|
58 |
+
"Translate the following English sentence into Chinese:\nMay the force be with you <zh>", # without CoT
|
59 |
+
"Translate the following English sentence into Chinese and explain it in detail:\nMay the force be with you <zh>" # with CoT
|
60 |
+
]
|
61 |
+
|
62 |
+
sampling_params = SamplingParams(temperature=0,
|
63 |
+
max_tokens=4096,
|
64 |
+
skip_special_tokens=True)
|
65 |
+
|
66 |
+
results = model.generate(messages, sampling_params)
|
67 |
+
responses = [res.outputs[0].text.strip() for res in results]
|
68 |
+
|
69 |
+
print(responses)
|
70 |
+
```
|
71 |
+
## Evaluation
|
72 |
+
We evaluated Seed-X on a diverse set of translation benchmarks, including FLORES-200, WMT-25, and a publicly released [challenge set](https://github.com/ByteDance-Seed/Seed-X-7B/tree/main/challenge_set) accompanied by human evaluations.
|
73 |
+

|
74 |
+
For detailed benchmark results and analysis, please refer to our [Technical Report](https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/Technical_Report.pdf).
|
75 |
+
|
76 |
+
## License
|
77 |
+
This project is licensed under OpenMDW. See the [LICENSE](https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/LICENSE.openmdw) flie for details.
|
78 |
+
|
79 |
+
## Citation
|
80 |
+
If you find Seed-X useful for your research and applications, feel free to give us a star ⭐ or cite us using:
|
81 |
+
```bibtex
|
82 |
+
@Article{XXX,
|
83 |
+
title={XXXXXXXXXXX},
|
84 |
+
author={XXX,XXX,XXX,XXX},
|
85 |
+
year={2025},
|
86 |
+
eprint={XXXX.XXXXX},
|
87 |
+
archivePrefix={arXiv},
|
88 |
+
primaryClass={cs.XX}
|
89 |
+
}
|
90 |
+
```
|
91 |
+
We will soon publish our technical report on Arxiv.
|