Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Paper β’ 2506.03295 β’ Published 7 days ago β’ 17
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper β’ 2505.10610 β’ Published 26 days ago β’ 53
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper β’ 2505.10610 β’ Published 26 days ago β’ 53
view article Article π¦Έπ»#1: Open-endedness and AI Agents β A Path from Generative to Creative AI? By Kseniase β’ Dec 25, 2024 β’ 13
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper β’ 2503.02812 β’ Published Mar 4 β’ 10
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper β’ 2503.02812 β’ Published Mar 4 β’ 10
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. β’ 15 items β’ Updated Mar 3 β’ 7
Pre-Trianing Data Packing Collection [ACL'24] Analysing the Impact of Sequence Composition on Language Model Pre-Training. https://github.com/yuzhaouoe/pretraining-data-packing β’ 10 items β’ Updated Mar 3
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering β’ 5 items β’ Updated Mar 3
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering β’ 5 items β’ Updated Mar 3
Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering Paper β’ 2410.15999 β’ Published Oct 21, 2024 β’ 20 β’ 3
Analysing the Residual Stream of Language Models Under Knowledge Conflicts Paper β’ 2410.16090 β’ Published Oct 21, 2024 β’ 7 β’ 2