Text2GPT ๐Ÿค–

Text2GPT is a lightweight text generation model fine-tuned from DistilGPT2, with 81.9M parameters, designed for efficient and coherent text generation. It leverages the power of transformers and supports Safetensors for secure model loading. Ideal for creative writing, text completion, and more! ๐Ÿš€


Features โœจ

  • Generates human-like text with minimal input ๐Ÿ“
  • Supports Safetensors for safe and efficient loading ๐Ÿ”’
  • Fine-tuned for low-resource environments โšก
  • Compatible with Hugging Face transformers and vLLM ๐Ÿš€

Installation ๐Ÿ› ๏ธ

Install the required dependencies:

pip install transformers torch safetensors

Usage ๐ŸŽฎ

Loading the Model with Transformers

Use the Hugging Face transformers library to load and generate text:

from transformers import AutoTokenizer, AutoModelForCausalLM

# Load model and tokenizer
model_name = "kulia-moon/Text2GPT"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

# Generate text
input_text = "Once upon a time"
inputs = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(**inputs, max_length=50, do_sample=True)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Using Pipeline for Simplicity

For quick text generation:

from transformers import pipeline

pipe = pipeline("text-generation", model="kulia-moon/Text2GPT")
print(pipe("My name is Julien and I like to", max_length=30, do_sample=True)[0]["generated_text"])

vLLM Deployment for Scalability

Deploy with vLLM for high-throughput inference:

docker run --runtime nvidia --gpus all -v ~/.cache/huggingface:/root/.cache/huggingface -p 8000:8000 --ipc=host vllm/vllm-openai:latest --model kulia-moon/Text2GPT

Then, serve the model:

docker exec -it my_vllm_container bash -c "vllm serve kulia-moon/Text2GPT"

Widget Examples ๐Ÿ–ฑ๏ธ

Try these prompts on the model page:

  • "Once upon a time" โณ
  • "My name is Julien and I like to" ๐Ÿ˜Š
  • "Paris is an amazing place to visit," ๐Ÿ—ผ
  • "I like traveling by train because" ๐Ÿš‚

Example Output:

Input: "Once upon a time"
Output: "Once upon a time, a curious AI roamed the digital realm, crafting tales of wonder."

Model Details ๐Ÿ“Š

  • Architecture: DistilGPT2-based, 6 layers, 81.9M parameters
  • Base Model: distilbert/distilgpt2
  • Safetensors: Supported, 81,912,576 parameters (non-sharded, non-quantized)
  • Intended Use: Text generation, creative writing, dialogue completion
  • Limitations: May produce biased or repetitive outputs; not optimized for sensitive tasks

Evaluation Report ๐Ÿ“ˆ

Evaluation metrics are under development. Preliminary tests suggest performance comparable to DistilGPT2 (perplexity ~21.1 on WikiText-103). Contributions for detailed metrics are welcome via discussions! ๐Ÿ™Œ

Requirements โš™๏ธ

  • Python 3.8+
  • transformers>=4.30.0
  • torch>=2.0.0
  • safetensors>=0.4.0

License ๐Ÿ“œ

This model is licensed under the MIT License.

Community & Support ๐Ÿ’ฌ

Join the conversation or seek help at:

Contributions and feedback are welcome! ๐ŸŒŸ

Downloads last month
16
Safetensors
Model size
81.9M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ 4 Ask for provider support

Model tree for kulia-moon/Text2GPT

Finetuned
(665)
this model
Quantizations
1 model