Hammer: Robust Function-Calling for On-Device Language Models via Function Masking Paper • 2410.04587 • Published Oct 6, 2024 • 2
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published 4 days ago • 65
Taming the Titans: A Survey of Efficient LLM Inference Serving Paper • 2504.19720 • Published Apr 28 • 11
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents Paper • 2404.10774 • Published Apr 16, 2024 • 4
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving Paper • 2502.07640 • Published Feb 11 • 9
Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving Paper • 2506.17104 • Published 7 days ago • 1
LLMs Will Always Hallucinate, and We Need to Live With This Paper • 2409.05746 • Published Sep 9, 2024 • 5
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey Paper • 2503.12605 • Published Mar 16 • 35
Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective Paper • 2505.15045 • Published May 21 • 54
view article Article Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios By quotientai and 3 others • May 2 • 19
ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems Paper • 2311.09476 • Published Nov 16, 2023 • 6
view article Article Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs By davidberenstein1957 and 1 other • May 7 • 38
LLM Hallucination Detection Papers Collection Collection of LLM hallucination and evaluation papers that I've been exploring and implementing. Some of them have my comments and annotated doodles. • 12 items • Updated Feb 20, 2024 • 13
view article Article Open-source DeepResearch – Freeing our search agents By m-ric and 4 others • Feb 4 • 1.26k
Preference Leakage: A Contamination Problem in LLM-as-a-judge Paper • 2502.01534 • Published Feb 3 • 41
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published Feb 3 • 115
view article Article Automatic Hallucination detection with SelfCheckGPT NLI By dhuynh95 • Nov 27, 2023 • 7