Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models Paper • 2506.06006 • Published 6 days ago • 11
Inference-Time Hyper-Scaling with KV Cache Compression Paper • 2506.05345 • Published 7 days ago • 25
Self-Training Large Language Models for Tool-Use Without Demonstrations Paper • 2502.05867 • Published Feb 9
Running 2.68k 2.68k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper • 2410.15999 • Published Oct 21, 2024 • 20
A-NeSI: A Scalable Approximate Method for Probabilistic Neurosymbolic Inference Paper • 2212.12393 • Published Dec 23, 2022
IntelliGraphs: Datasets for Benchmarking Knowledge Graph Generation Paper • 2307.06698 • Published Jul 13, 2023
Prompting as Probing: Using Language Models for Knowledge Base Construction Paper • 2208.11057 • Published Aug 23, 2022 • 3