Enhancing Automated Interpretability with Output-Centric Feature Descriptions Paper • 2501.08319 • Published 4 days ago • 10
Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization Paper • 2412.04619 • Published Dec 5, 2024 • 1
Towards scientific discovery with dictionary learning: Extracting biological concepts from microscopy foundation models Paper • 2412.16247 • Published 29 days ago • 1
Interpretability as Compression: Reconsidering SAE Explanations of Neural Activations with MDL-SAEs Paper • 2410.11179 • Published Oct 15, 2024 • 1
Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models Paper • 2412.05353 • Published Dec 6, 2024 • 1
The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units Paper • 2411.02280 • Published Nov 4, 2024 • 1
Inferring Functionality of Attention Heads from their Parameters Paper • 2412.11965 • Published Dec 16, 2024 • 2
LatentQA: Teaching LLMs to Decode Activations Into Natural Language Paper • 2412.08686 • Published Dec 11, 2024 • 1
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 74
NLI Eval Datasets Collection A curated collection of NLI evaluation datasets. Each dataset is exactly as originally proposed • 19 items • Updated Nov 12, 2024 • 3
🇮🇹👓 LLaVA-NDiNO Collection HF Collection for the models of the paper "LLaVA-NDiNO: Empowering LLMs with Multimodality for the Italian Language" • 7 items • Updated Oct 20, 2024 • 3
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper • 2411.17465 • Published Nov 26, 2024 • 78
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated 27 days ago • 31
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka • Nov 19, 2024 • 104
Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models Paper • 2411.14257 • Published Nov 21, 2024 • 9
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models Paper • 2411.12580 • Published Nov 19, 2024 • 2
Controllable Context Sensitivity and the Knob Behind It Paper • 2411.07404 • Published Nov 11, 2024 • 1