view article Article Fine-Tune Meta Llama 3.2-Vision-Instruct Multimodal LLM on Intel Accelerators By bconsolvo • 1 day ago • 8
view article Article Provence: efficient and robust context pruning for retrieval-augmented generation By nadiinchi • 1 day ago • 3
view article Article SILMA Kashif v1.0: A Specialized Model for RAG Tasks By karimouda • 2 days ago • 1
view article Article Is Attention Interpretable in Transformer-Based Large Language Models? Let’s Unpack the Hype By royswastik • 2 days ago • 3
view article Article 🌁#85: Curiosity, Open Source, and Timing: The Formula Behind DeepSeek’s Phenomenal Success By Kseniase • 2 days ago • 6
view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI • 23 days ago • 18
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria • 23 days ago • 14
view article Article 🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • 28 days ago • 39
view article Article ✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang • 27 days ago • 13
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 27 days ago • 32
view article Article Superposition in Transformers: A Novel Way of Building Mixture of Experts By BenChaliah • 26 days ago • 14
view article Article AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI By Kseniase • 26 days ago • 3
view article Article Building Effective Agents with Anthropic’s Best Practices and smolagents ❤️ By Sri-Vigneshwar-DJ • 25 days ago • 5
view article Article How to Automate Reddit Comment Generation with AI Agents in KaibanJS By darielnoel • 23 days ago • 4
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw • 23 days ago • 23