NeuralOS: Towards Simulating Operating Systems via Neural Generative Models Paper β’ 2507.08800 β’ Published 18 days ago β’ 74
Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Paper β’ 2506.03295 β’ Published Jun 3 β’ 17
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper β’ 2505.10610 β’ Published May 15 β’ 54
MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly Paper β’ 2505.10610 β’ Published May 15 β’ 54
view article Article π¦Έπ»#1: Open-endedness and AI Agents β A Path from Generative to Creative AI? By Kseniase β’ Dec 25, 2024 β’ 16
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper β’ 2503.02812 β’ Published Mar 4 β’ 10
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper β’ 2503.02812 β’ Published Mar 4 β’ 10
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. β’ 15 items β’ Updated Mar 3 β’ 7
Pre-Trianing Data Packing Collection [ACL'24] Analysing the Impact of Sequence Composition on Language Model Pre-Training. https://github.com/yuzhaouoe/pretraining-data-packing β’ 10 items β’ Updated Mar 3
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering β’ 5 items β’ Updated Mar 3
SAE-Based Representation Engineering Collection [NAACL'25] SAE-Based RepE github.com/yuzhaouoe/SAE-based-representation-engineering β’ 5 items β’ Updated Mar 3