metadata
license: mit
language:
- en
pipeline_tag: text-classification
tags:
- gpt2
- distributed-training
- megatron
- accelerate
Megatron-GPT2-Classification
Description
The megatron-gpt2-classification
model is a language model trained using Megatron and Accelerate frameworks. It has been fine-tuned for classification tasks and benefits from distributed training across 4 GPUs (RTX 4070).
Key Features
- Trained with Megatron and Accelerate.
- Distributed training on 4 GPUs (RTX 4070).
- Fine-tuned for classification tasks.