A Technical Study into Small Reasoning Language Models Paper • 2506.13404 • Published 22 days ago • 9
Uncovering Cultural Representation Disparities in Vision-Language Models Paper • 2505.14729 • Published May 20 • 1
ReEx-SQL: Reasoning with Execution-Aware Reinforcement Learning for Text-to-SQL Paper • 2505.12768 • Published May 19 • 3
Improving Multilingual Capabilities with Cultural and Local Knowledge in Large Language Models While Enhancing Native Performance Paper • 2504.09753 • Published Apr 13 • 5
SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning Paper • 2504.08600 • Published Apr 11 • 29
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Paper • 2504.07072 • Published Apr 9 • 9
LLM Post-Training: A Deep Dive into Reasoning Large Language Models Paper • 2502.21321 • Published Feb 28 • 1
Self-Evolving Multi-Agent Simulations for Realistic Clinical Interactions Paper • 2503.22678 • Published Mar 28 • 1
Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More Paper • 2502.07490 • Published Feb 11 • 9
RKadiyala at SemEval-2024 Task 8: Black-Box Word-Level Text Boundary Detection in Partially Machine Generated Texts Paper • 2410.16659 • Published Oct 22, 2024
Large Language Models for Cross-lingual Emotion Detection Paper • 2410.15974 • Published Oct 21, 2024 • 1
1024m at SMM4H 2024: Tasks 3, 5 & 6 -- Ensembles of Transformers and Large Language Models for Medical Text Classification Paper • 2410.15998 • Published Oct 21, 2024 • 1
Augmenting Legal Decision Support Systems with LLM-based NLI for Analyzing Social Media Evidence Paper • 2410.15990 • Published Oct 21, 2024 • 1
Falcon Mamba: The First Competitive Attention-free 7B Language Model Paper • 2410.05355 • Published Oct 7, 2024 • 36
view post Post 4234 Falcon Mamba now available now in llama.cpp !Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a 3 replies · 👍 5 5 ❤️ 3 3 🚀 2 2 + Reply