Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Paper β’ 2504.08685 β’ Published Apr 11 β’ 126
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated Apr 28 β’ 484
Retentive Network: A Successor to Transformer for Large Language Models Paper β’ 2307.08621 β’ Published Jul 17, 2023 β’ 171