GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published Jan 14 • 64
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Paper • 2501.04698 • Published Jan 8 • 14
TOMG-Bench: Evaluating LLMs on Text-based Open Molecule Generation Paper • 2412.14642 • Published Dec 19, 2024 • 4
view post Post 1389 News! ChemVLM Codes Opensource Now! https://github.com/AI4Chem/ChemVlm See translation 1 reply · 🤗 4 4 + Reply
StyleMaster: Stylize Your Video with Artistic Generation and Translation Paper • 2412.07744 • Published Dec 10, 2024 • 19
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints Paper • 2412.07760 • Published Dec 10, 2024 • 50
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation Paper • 2412.07759 • Published Dec 10, 2024 • 18
view post Post 1793 ChemVLM has been accepted by AAAI2025! Seeing and Understanding: Bridging Vision with Chemical Knowledge Via ChemVLM (2408.07246)Try have a chat wiht him🤗. AI4Chem/ChemVLM-26B-1-2 See translation 🚀 4 4 + Reply
view post Post 3067 The first version of LLaMA-O1 has been uploaded to HF now!Here We Come!Supervised: SimpleBerry/LLaMA-O1-Supervised-1129Base(Pretrain): SimpleBerry/LLaMA-O1-Base-1127Supervised Finetune Dataset: SimpleBerry/OpenLongCoT-SFTPretraining Dataset: SimpleBerry/OpenLongCoT-Pretrain-1202RLHF is on the way! View our GitHub Repo:https://github.com/SimpleBerry/LLaMA-O1Our ongoing related researches: Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394) LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884) Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning (2411.18203) @AdinaY @akhaliq @jwu323 ------GGUF:https://huggingface.co/Lyte/LLaMA-O1-Supervised-1129-Q4_K_M-GGUFonline Demo (CPU-only): SimpleBerry/LLaMA-O1-Supervised-1129-Demo See translation 3 replies · 🚀 13 13 🤗 3 3 🔥 1 1 + Reply
view post Post 1361 LLaMA-O1 Base and SFT model will be uploaded to HF today.RLHF pipeline already ready, still waiting for data sampling. See translation 1 reply · 🚀 5 5 + Reply
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Paper • 2411.18203 • Published Nov 27, 2024 • 34
Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning Paper • 2411.18203 • Published Nov 27, 2024 • 34