Himanshu13x
/

gpt2-medium355M-sft

Text Generation

Model card Files Files and versions

Himanshu13x commited on Aug 13

Commit

0bd86d6

·

verified ·

1 Parent(s): d605dcf

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -5,4 +5,20 @@ language:
 base_model:
 - openai-community/gpt2-medium
 pipeline_tag: text-generation
----

 base_model:
 - openai-community/gpt2-medium
 pipeline_tag: text-generation
+---
+# GPT-2 (From Scratch in PyTorch) — Fine-Tuned Version
+This model is a **custom GPT-2 implementation** built entirely from scratch in **PyTorch** (no Hugging Face Transformers for the architecture itself) and **fine-tuned** on a custom dataset using **Supervised Fine-Tuning (SFT)**.
+## Model Details
+- **Architecture:** GPT-2 (from scratch)
+- **Variants Supported:** gpt2-small, gpt2-medium, gpt2-large, gpt2-xl
+- **Framework:** PyTorch
+- **Pretraining Source:** Loaded GPT-2 pretrained weights from OpenAI format
+- **Fine-Tuning Method:** Supervised Fine-Tuning (SFT)
+- **Fine-Tuning Data:** Custom dataset (domain-specific; see dataset section)
+- **Tokenization:** GPT-2 tokenizer style