EXt1's picture
Update README.md
08be4a5 verified
---
library_name: transformers
pipeline_tag: text-classification
datasets:
- EXt1/Thai-True-Fake-News
language:
- th
metrics:
- accuracy
base_model:
- microsoft/mdeberta-v3-base
---
# mdeberta-v3-base-thai-fakenews
This model is a fine-tuned version of the microsoft/mdeberta-v3-base model. It was fine-tuned using the EXt1/Thai-True-Fake-News dataset, a collection of Thai news articles labeled as either real or fake. The model is designed for fake news detection in the Thai language, achieving an accuracy of 91% on a test set. This model is part of the senior project of CPE35 students at King Mongkut's University of Technology Thonburi (KMUTT).
### Model Description
- **Base Mode: `microsoft/mdeberta-v3-base`**
- **Dataset: `EXt1/Thai-True-Fake-News`**
- **Model Size: 279M parameters**
- **Language: Thai**
- **Labels:**
- 0: True News
- 1: Fake News
### Evaluation Results
- **Loss: 0.25065**
- **Accuracy: 91% on the test set**
### Usage
```
from transformers import AutoTokenizer, AutoModelForSequenceClassification
import torch
tokenizer = AutoTokenizer.from_pretrained("EXt1/mdeberta-v3-base-thai-fakenews")
model = AutoModelForSequenceClassification.from_pretrained("EXt1/mdeberta-v3-base-thai-fakenews")
text = "M-Flow ส่ง SMS แจ้งให้ชำระค่าปรับจราจรด้วยการคลิกลิงก์"
inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True)
with torch.no_grad():
logits = model(**inputs).logits
predicted_class = torch.argmax(logits, dim=1).item()
if predicted_class == 1:
print("ข่าวปลอม")
else:
print("ข่าวจริง")
```
### Use Cases
This model is designed for text classification tasks, specifically for distinguishing between true and fake news in the Thai language. It can be applied to various use cases, such as:
- Detecting fake news articles in the Thai language on social media or news websites.
- Supporting news verification systems or automated content moderation tools.