Categorizer Model and Training Dataset
Collection
1 item
β’
Updated
Project page: https://huggingface.co/BenchHub. Code: https://github.com/rladmstn1714/BenchHub
BenchHub-Cat-7b is a category classification model based on Qwen2.5-7B, fine-tuned to assign natural language queries to structured category triplets: (subject, skill, target)
.
Hyperparameter | Value |
---|---|
Sequence Length | 8192 |
Learning Rate | 2 Γ 10β»β΅ |
Batch Size (Effective) | 256 |
Epochs | 3 |
Scheduler | Cosine Decay |
Warmup Ratio | 0.05 |
Optimizer | Method from [19] |
Trainer | DeepSpeed ZeRO-3 |
Hardware | 4Γ A6000 48GB GPUs |
Training Time | ~5 hours per run |
Input: Natural language question or instruction
Output: Triplet (subject, skill, target)
, such as:
{ "subject_type": "history",
"task_type": "reasoning",
"target_type": "korea"}
### Instruction:
Classify the following query into subject, skill, and target.
### Query:
How did Confucianism shape education in East Asia?
### Output:
{ "subject_type": "history",
"task_type": "reasoning",
"target_type": "korea"}
Apache 2.0