--- license: mit language: - en library_name: transformers tags: - text-classification - yelp-reviews - gpt-2 - bert --- # **Model Description** This model predicts the star rating (1 - 5) of a Yelp review based on its text content. It was trained using **GPT-2** and **BERT**, with **BERT** achieving the best performance at **75%** validation accuracy. The model addresses class imbalance using weighted loss and optimizes hyperparameters to enhance generalization. # **Training Details** - **Dataset**: Yelp Reviews dataset (100,000 samples used) - **Preprocessing**: - **GPT-2 Tokenizer** with **Byte-Pair Encoding (BPE)** for rare words - Truncation (128 tokens) and padding for uniform input size - **Models Trained**: - **GPT-2**: Fine-tuned with a custom classification head, achieving **67% validation accuracy** - **BERT**: Fine-tuned with bidirectional attention, achieving **75% validation accuracy** - **Loss Function**: Weighted **Cross-Entropy Loss** to counteract class imbalance # **Limitations** - Performance may degrade on **highly informal or extremely short reviews** - **Class imbalance** still affects predictions for underrepresented ratings - Model was trained on **English-language** reviews only