
SakanaAI/TinySwallow-1.5B
Text Generation
•
Updated
•
35.9k
•
24
Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models"