view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 14 days ago β’ 345
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper β’ 2407.10457 β’ Published Jul 15, 2024 β’ 24
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper β’ 2403.13570 β’ Published Mar 20, 2024 β’ 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper β’ 2406.14491 β’ Published Jun 20, 2024 β’ 91
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper β’ 2309.10400 β’ Published Sep 19, 2023 β’ 26
Evaluating Text-to-Visual Generation with Image-to-Text Generation Paper β’ 2404.01291 β’ Published Apr 1, 2024 β’ 6
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper β’ 2312.00752 β’ Published Dec 1, 2023 β’ 143
Preference Datasets for KTO Collection This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. β’ 5 items β’ Updated Dec 11, 2024 β’ 15
NER in Spanish Collection Fine-tuned models to perform NER in Spanish using the framework SpanMarker and different encoders and datasets β’ 3 items β’ Updated Sep 2, 2024 β’ 4
Seamless Communication Collection A significant step towards removing language barriers through expressive, fast and high-quality AI translation. β’ 16 items β’ Updated Jan 16, 2024 β’ 154
Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation Paper β’ 2403.06164 β’ Published Mar 10, 2024 β’ 2