Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation Paper • 2505.18842 • Published May 24 • 37
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape Rooms Paper • 2503.14427 • Published Mar 18 • 19
seunbite/20250208_trainset_0.1_Qwen2.5-3B-Instruct-step-001000 Text Generation • 3B • Updated Feb 9 • 3
seunbite/20250208_trainset_0.5_Qwen2.5-3B-Instruct-step-002000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.3_Qwen2.5-3B-Instruct-step-002000 Text Generation • 3B • Updated Feb 8 • 4
seunbite/20250208_trainset_0.3_Qwen2.5-3B-Instruct-step-003000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.3_Qwen2.5-3B-Instruct-step-001000 Text Generation • 3B • Updated Feb 8 • 5
seunbite/20250208_trainset_0.1_Qwen2.5-3B-Instruct-step-002000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.5_Qwen2.5-3B-Instruct-step-001000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.3_Qwen2.5-3B-Instruct-step-002000 Text Generation • 3B • Updated Feb 8 • 4
seunbite/20250208_trainset_0.5_Qwen2.5-3B-Instruct-step-002000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.1_Qwen2.5-3B-Instruct-step-001000 Text Generation • 3B • Updated Feb 9 • 3
seunbite/20250208_trainset_0.1_Qwen2.5-3B-Instruct-step-002000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.3_Qwen2.5-3B-Instruct-step-003000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.5_Qwen2.5-3B-Instruct-step-001000 Text Generation • 3B • Updated Feb 8 • 3
seunbite/20250208_trainset_0.3_Qwen2.5-3B-Instruct-step-001000 Text Generation • 3B • Updated Feb 8 • 5
Do LLMs Have Distinct and Consistent Personality? TRAIT: Personality Testset designed for LLMs with Psychometrics Paper • 2406.14703 • Published Jun 20, 2024 • 2