1 34 6

Brun

JM-Brun

AI & ML interests

None yet

Recent Activity

updated a collection 1 day ago

Tool calling

liked a model 1 day ago

Salesforce/Llama-xLAM-2-8b-fc-r

upvoted a paper 1 day ago

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Paper • 2410.04587 • Published Oct 6, 2024 • 2

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published 4 days ago • 65

upvoted 2 papers 2 days ago

Taming the Titans: A Survey of Efficient LLM Inference Serving

Paper • 2504.19720 • Published Apr 28 • 11

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Paper • 2404.10774 • Published Apr 16, 2024 • 4

upvoted 3 papers 3 days ago

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

Paper • 2502.07640 • Published Feb 11 • 9

Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving

Paper • 2506.17104 • Published 7 days ago • 1

LLMs Will Always Hallucinate, and We Need to Live With This

Paper • 2409.05746 • Published Sep 9, 2024 • 5

upvoted a paper 9 days ago

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published Mar 16 • 35

upvoted a paper 25 days ago

Table-R1: Inference-Time Scaling for Table Reasoning

Paper • 2505.23621 • Published 29 days ago • 92

upvoted a paper about 1 month ago

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Paper • 2505.15045 • Published May 21 • 54

upvoted an article about 1 month ago

Article

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

and 3 others •

May 2

• 19

upvoted a paper about 1 month ago

ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems

Paper • 2311.09476 • Published Nov 16, 2023 • 6

upvoted an article about 1 month ago

Article

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

and 1 other •

May 7

• 38

upvoted an article about 2 months ago

Article

How to Build an MCP Server with Gradio

and 1 other •

Apr 30

• 174

upvoted a collection 4 months ago

LLM Hallucination Detection Papers

Collection

Collection of LLM hallucination and evaluation papers that I've been exploring and implementing. Some of them have my comments and annotated doodles. • 12 items • Updated Feb 20, 2024 • 13

upvoted an article 5 months ago

Article

Open-source DeepResearch – Freeing our search agents

and 4 others •

Feb 4

• 1.26k

upvoted 2 papers 5 months ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published Feb 3 • 41

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

upvoted an article 5 months ago

Article

Automatic Hallucination detection with SelfCheckGPT NLI

•

Nov 27, 2023

• 7

upvoted a collection 5 months ago

Daily Papers

Collection

1 item • Updated Oct 26, 2023 • 80

Brun

AI & ML interests

Recent Activity

Organizations

JM-Brun's activity

Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios

Good answers are not necessarily factual answers: an analysis of hallucination in leading LLMs

How to Build an MCP Server with Gradio

Open-source DeepResearch – Freeing our search agents

Automatic Hallucination detection with SelfCheckGPT NLI