Learn: LLM Architecture 2025 RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 13 DeepSeek-V3 Technical Report Paper • 2412.19437 • Published Dec 27, 2024 • 66
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 13
Learn: Vision Language Models What matters when building vision-language models? Paper • 2405.02246 • Published May 3, 2024 • 104 Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 132
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 132
Learn: LLM Architecture 2025 RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 13 DeepSeek-V3 Technical Report Paper • 2412.19437 • Published Dec 27, 2024 • 66
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 13
Learn: Vision Language Models What matters when building vision-language models? Paper • 2405.02246 • Published May 3, 2024 • 104 Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 132
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 132