RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 12
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective Paper • 2503.01933 • Published 7 days ago • 10
Phantom: Subject-consistent video generation via cross-modal alignment Paper • 2502.11079 • Published 22 days ago • 52
Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking Paper • 2501.00244 • Published Dec 31, 2024 • 1
Training language models to follow instructions with human feedback Paper • 2203.02155 • Published Mar 4, 2022 • 17
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 200