philschmid (Philipp Schmid)

upvoted a paper 6 months ago

MedGemma Technical Report

Paper • 2507.05201 • Published Jul 7 • 14

upvoted a paper 7 months ago

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

Paper • 2504.19413 • Published Apr 28 • 35

upvoted an article 9 months ago

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Mar 17

•

348

upvoted a paper 9 months ago

Why Do Multi-Agent LLM Systems Fail?

Paper • 2503.13657 • Published Mar 17 • 47

upvoted an article 10 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

+2

Mar 12

•

479

upvoted a paper 11 months ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published Feb 3 • 40

upvoted 2 articles 11 months ago

Article

Open-R1: Update #1

Feb 2

•

305

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Jan 31

•

51

upvoted a paper 11 months ago

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

upvoted 6 papers about 1 year ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 51

Pyramidal Flow Matching for Efficient Video Generative Modeling

Paper • 2410.05954 • Published Oct 8, 2024 • 40

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 121

upvoted an article about 1 year ago

Article

Llama can now see and run on your device - welcome Llama 3.2

+5

Sep 25, 2024

•

191

upvoted a collection about 1 year ago

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 649

upvoted a paper about 1 year ago

EuroLLM: Multilingual Language Models for Europe

Paper • 2409.16235 • Published Sep 24, 2024 • 29

upvoted a paper over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

upvoted a collection over 1 year ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 669

Philipp Schmid

AI & ML interests

Organizations

MedGemma Technical Report

Mem0: Building Production-Ready AI Agents with Scalable Long-Term Memory

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Why Do Multi-Agent LLM Systems Fail?

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Open-R1: Update #1

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Phi-4 Technical Report

Hymba: A Hybrid-head Architecture for Small Language Models

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Pyramidal Flow Matching for Efficient Video Generative Modeling

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Llama can now see and run on your device - welcome Llama 3.2

Llama 3.2

EuroLLM: Multilingual Language Models for Europe

Training Language Models to Self-Correct via Reinforcement Learning

Qwen2.5

Philipp Schmid

AI & ML interests

Organizations

philschmid's activity

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Open-R1: Update #1

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

Llama can now see and run on your device - welcome Llama 3.2