Translation
Safetensors
mistral
YuLu0713 commited on
Commit
8aa8dbb
·
verified ·
1 Parent(s): 7b62e6d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +91 -5
README.md CHANGED
@@ -1,5 +1,91 @@
1
- ---
2
- license: other
3
- license_name: openmdw
4
- license_link: LICENSE
5
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ license_name: openmdw
4
+ license_link: LICENSE
5
+ ---
6
+ # Seed-X-PPO-7B
7
+ <a href="https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/Technical_Report.pdf">
8
+ <img src="https://img.shields.io/badge/Seed--X-Report-blue"></a>
9
+ <a href="XXXX">
10
+ <img src="https://img.shields.io/badge/Seed--X-Hugging Face-brightgreen"></a>
11
+ <a href="https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/LICENSE.openmdw">
12
+ <img src="https://img.shields.io/badge/License-OpenMDW-yellow"></a>
13
+
14
+ ## Introduction
15
+ We are excited to introduce **Seed-X**, a powerful open-source multilingual translation language model series, including instruction and reasoning models, with 7B parameters pushing the boundaries of translation capabilities.
16
+ We develop Seed-X as an accessible, off-the-shelf tool to support the community in advancing translation research and applications:
17
+ * **Exceptional translation capabilities**: Seed-X exhibits state-of-the-art translation capabilities, on par with or outperforming ultra-large models like Gemini-2.5, Claude-3.5, and GPT-4, as validated by human evaluations and automatic metrics.
18
+ * **Deployment and inference-friendly**: With a compact 7B parameter count and mistral architecture, Seed-X offers outstanding translation performance in a lightweight and efficient package, ideal for deployment and inference.
19
+ * **Broad domain coverage**: Seed-X excels on a highly challenging translation test set spanning diverse domains, including the internet, science and technology, office dialogues, e-commerce, biomedicine, finance, law, literature, and entertainment.
20
+ ![performance](imgs/model_comparsion.png)
21
+
22
+ This repo contains the **Seed-X-Instruct** model, with the following features:
23
+ * Type: Causal language models
24
+ * Training Stage: Pretraining & Post-training
25
+ * Support: Multilingual translation among 28 languages
26
+
27
+ | Languages | Abbr. | Languages | Abbr. | Languages | Abbr. | Languages | Abbr. |
28
+ | ----------- | ----------- |-----------|-----------|-----------|-----------| -----------|-----------|
29
+ |Arabic | ar |French | fr | Malay | ms | Russian | ru |
30
+ |Czech | cs |Croatian | hr | Norwegian Bokmal | nb | Swedish | sv |
31
+ |Danish | da |Hungarian | hu | Dutch | nl | Thai | th |
32
+ |German | de |Indonesian | id | Norwegian | no | Turkish | tr |
33
+ |English | en |Italian | it | Polish | pl | Ukrainian | uk |
34
+ |Spanish | es |Japanese | ja | Portuguese | pt | Vietnamese | vi |
35
+ |Finnish | fi |Korean | ko | Romanian | ro | Chinese | zh |
36
+
37
+ ## Model Downloads
38
+ | Model Name | Description | Download |
39
+ | ----------- | ----------- |-----------
40
+ | 👉 **Seed-X-Instruct** | Instruction-tuned for alignment with user intent. |🤗 [Model](https://huggingface.co/ByteDance-Seed/Seed-X-Instruct-7B)|
41
+ | Seed-X-PPO | RL trained to boost translation capabilities. | 🤗 [Model](https://huggingface.co/ByteDance-Seed/Seed-X-PPO-7B)|
42
+ |Seed-X-RM | Reward model to evaluate the quality of translation.| 🤗 [Model](https://huggingface.co/ByteDance-Seed/Seed-X-RM-7B)|
43
+
44
+ ## Quickstart
45
+ Here is a simple example demonstrating how to load the model and perform translation using ```vllm```
46
+ ```python
47
+ from vllm import LLM, SamplingParams
48
+
49
+ model_path = "./ByteDance-Seed/Seed-X-Instruct-7B"
50
+
51
+ model = LLM(model=model_path,
52
+ max_num_seqs=512,
53
+ tensor_parallel_size=8,
54
+ enable_prefix_caching=True,
55
+ gpu_memory_utilization=0.95)
56
+
57
+ messages = [
58
+ "Translate the following English sentence into Chinese:\nMay the force be with you <zh>", # without CoT
59
+ "Translate the following English sentence into Chinese and explain it in detail:\nMay the force be with you <zh>" # with CoT
60
+ ]
61
+
62
+ sampling_params = SamplingParams(temperature=0,
63
+ max_tokens=4096,
64
+ skip_special_tokens=True)
65
+
66
+ results = model.generate(messages, sampling_params)
67
+ responses = [res.outputs[0].text.strip() for res in results]
68
+
69
+ print(responses)
70
+ ```
71
+ ## Evaluation
72
+ We evaluated Seed-X on a diverse set of translation benchmarks, including FLORES-200, WMT-25, and a publicly released [challenge set](https://github.com/ByteDance-Seed/Seed-X-7B/tree/main/challenge_set) accompanied by human evaluations.
73
+ ![humen_eval](imgs/humen_eval.png)
74
+ For detailed benchmark results and analysis, please refer to our [Technical Report](https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/Technical_Report.pdf).
75
+
76
+ ## License
77
+ This project is licensed under OpenMDW. See the [LICENSE](https://github.com/ByteDance-Seed/Seed-X-7B/blob/main/LICENSE.openmdw) flie for details.
78
+
79
+ ## Citation
80
+ If you find Seed-X useful for your research and applications, feel free to give us a star ⭐ or cite us using:
81
+ ```bibtex
82
+ @Article{XXX,
83
+ title={XXXXXXXXXXX},
84
+ author={XXX,XXX,XXX,XXX},
85
+ year={2025},
86
+ eprint={XXXX.XXXXX},
87
+ archivePrefix={arXiv},
88
+ primaryClass={cs.XX}
89
+ }
90
+ ```
91
+ We will soon publish our technical report on Arxiv.