fact-check1-v3-final (Credo AI - Brain 2)

This is the definitive "Brain 2" model from the Credo AI project, a powerful and robust binary (FAKE/REAL) news classifier. This model represents the culmination of a multi-stage refinement process designed to create a highly accurate and unbiased fake news specialist.

The Model's Journey: From Specialist to Sage

This model is a fine-tuned version of microsoft/deberta-v3-large.

  • v1: The model was initially trained on a massive, combined corpus of over 50,000 news articles, achieving 99.9%+ accuracy on its core task. However, this intense specialization made it a "conspiracy theorist," classifying simple, factual statements as FAKE because they didn't match the stylistic patterns of the news it was trained on.

  • v2: The model underwent a "scientific calibration" by being fine-tuned on a dataset of pristine scientific facts (allenai/scifact). This corrected its bias against clean, encyclopedic text but was still too narrow.

  • v3 (This version): The model has completed its final "masterclass." It was fine-tuned on a perfectly balanced dataset containing:

    • General Knowledge: Thousands of clean, factual statements from Wikipedia (wiki_qa dataset) to broaden its understanding of truth.
    • Memory Reinforcement: A sample of classic fake news articles (mrisdal/fake-news) to remind it of its core mission and prevent catastrophic forgetting.

This final calibration, performed with an ultra-low learning rate, has produced a model that is both a powerful fake news detector and a robust, general-purpose fact-checker.

Intended Use

This model is intended to be used as a fast and powerful first-line filter for English-language news headlines and short texts.

from transformers import pipeline

classifier = pipeline("text-classification", model="Arko007/fact-check1-v3-final")

# Example 1: Simple Fact (The test it used to fail)
text1 = "The sun rises in the east."
print(classifier(text1))
# Expected output: [{'label': 'REAL', 'score': ...}]

# Example 2: Fake News
text2 = "BREAKING: Scientists confirm lizards are secretly running the government."
print(classifier(text2))
# Expected output: [{'label': 'FAKE', 'score': ...}]

Limitations

While this v3 model is far more robust, its core training was on political and general news. Its performance may be lower on highly specialized domains like financial analysis or deep scientific literature.

Downloads last month
56
Safetensors
Model size
0.4B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using Arko007/fact-check1-v3-final 1