Text2GPT ๐ค
Text2GPT is a lightweight text generation model fine-tuned from DistilGPT2, with 81.9M parameters, designed for efficient and coherent text generation. It leverages the power of transformers and supports Safetensors for secure model loading. Ideal for creative writing, text completion, and more! ๐
Features โจ
- Generates human-like text with minimal input ๐
- Supports Safetensors for safe and efficient loading ๐
- Fine-tuned for low-resource environments โก
- Compatible with Hugging Face
transformers
and vLLM ๐
Installation ๐ ๏ธ
Install the required dependencies:
pip install transformers torch safetensors
Usage ๐ฎ
Loading the Model with Transformers
Use the Hugging Face transformers
library to load and generate text:
from transformers import AutoTokenizer, AutoModelForCausalLM
# Load model and tokenizer
model_name = "kulia-moon/Text2GPT"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)
# Generate text
input_text = "Once upon a time"
inputs = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(**inputs, max_length=50, do_sample=True)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Using Pipeline for Simplicity
For quick text generation:
from transformers import pipeline
pipe = pipeline("text-generation", model="kulia-moon/Text2GPT")
print(pipe("My name is Julien and I like to", max_length=30, do_sample=True)[0]["generated_text"])
vLLM Deployment for Scalability
Deploy with vLLM for high-throughput inference:
docker run --runtime nvidia --gpus all -v ~/.cache/huggingface:/root/.cache/huggingface -p 8000:8000 --ipc=host vllm/vllm-openai:latest --model kulia-moon/Text2GPT
Then, serve the model:
docker exec -it my_vllm_container bash -c "vllm serve kulia-moon/Text2GPT"
Widget Examples ๐ฑ๏ธ
Try these prompts on the model page:
- "Once upon a time" โณ
- "My name is Julien and I like to" ๐
- "Paris is an amazing place to visit," ๐ผ
- "I like traveling by train because" ๐
Example Output:
Input: "Once upon a time"
Output: "Once upon a time, a curious AI roamed the digital realm, crafting tales of wonder."
Model Details ๐
- Architecture: DistilGPT2-based, 6 layers, 81.9M parameters
- Base Model: distilbert/distilgpt2
- Safetensors: Supported, 81,912,576 parameters (non-sharded, non-quantized)
- Intended Use: Text generation, creative writing, dialogue completion
- Limitations: May produce biased or repetitive outputs; not optimized for sensitive tasks
Evaluation Report ๐
Evaluation metrics are under development. Preliminary tests suggest performance comparable to DistilGPT2 (perplexity ~21.1 on WikiText-103). Contributions for detailed metrics are welcome via discussions! ๐
Requirements โ๏ธ
- Python 3.8+
transformers>=4.30.0
torch>=2.0.0
safetensors>=0.4.0
License ๐
This model is licensed under the MIT License.
Community & Support ๐ฌ
Join the conversation or seek help at:
Contributions and feedback are welcome! ๐
- Downloads last month
- 16