Shh, don't say that! Domain Certification in LLMs

arXiv arXiv ICLR 2025 Huggingface

Collection: https://huggingface.co/collections/cemde/domain-certification-67ba4fb663f8d1348c3c2263

Certify you Large Language Model (LLM)!

With the code in this repository you can reproduce the workflows we use in our ICLR 2025 paper to achieve Domain Certification using our VALID algorithm.

We provide the guide models for our Medical Question Answering experiments.

Model Description
cemde/Domain-Certification-MedQA-Guide-Base This is the base model trained on the ground-truth responses.
cemde/Domain-Certification-MedQA-Guide-Finetuned This is the model trained on responses from Llama-3-8B.

Citation

@inproceedings{
emde2025shh,
title={Shh, don't say that! Domain Certification in {LLM}s},
author={Cornelius Emde and Alasdair Paren and Preetham Arvind and Maxime Guillaume Kayser and Tom Rainforth and Bernard Ghanem and Thomas Lukasiewicz and Philip Torr and Adel Bibi},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://arxiv.org/abs/2502.19320}
}
Downloads last month
74
Safetensors
Model size
184M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train cemde/Domain-Certification-MedQA-Guide-Finetuned

Collection including cemde/Domain-Certification-MedQA-Guide-Finetuned