Maximilian Schneider's picture

2 11 1

Maximilian Schneider PRO

DaMax96

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Open Problems in Mechanistic Interpretability

upvoted a paper 2 days ago

Optimizing Large Language Model Training Using FP4 Quantization

upvoted a paper 2 days ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

View all activity

Organizations

None yet

DaMax96's activity

upvoted 3 papers 2 days ago

Open Problems in Mechanistic Interpretability

Paper • 2501.16496 • Published 3 days ago • 11

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 2 days ago • 23

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 2 days ago • 48

upvoted 2 papers 9 months ago

Capabilities of Gemini Models in Medicine

Paper • 2404.18416 • Published Apr 29, 2024 • 24

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

upvoted a paper 10 months ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Paper • 2404.12253 • Published Apr 18, 2024 • 55

upvoted a paper 11 months ago

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 53

upvoted a paper about 1 year ago

MM-VID: Advancing Video Understanding with GPT-4V(ision)

Paper • 2310.19773 • Published Oct 30, 2023 • 20

upvoted 3 papers over 1 year ago

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 23

Retentive Network: A Successor to Transformer for Large Language Models

Paper • 2307.08621 • Published Jul 17, 2023 • 171

Training Models to Generate, Recognize, and Reframe Unhelpful Thoughts

Paper • 2307.02768 • Published Jul 6, 2023 • 14