AI & ML interests

None defined yet.

AdinaY 
posted an update 1 day ago
view post
Post
328
The tech report of RoboBrain 2.0 is now available on the Daily Papers page🔥

It's an embedded brain model that sees, thinks, and plans for many robots.

Leave your insights or questions, the authors are happy to respond.
RoboBrain 2.0 Technical Report (2507.02029)
AdinaY 
posted an update 2 days ago
AdinaY 
posted an update 2 days ago
view post
Post
181
POLAR🐻‍❄️ New reward modeling by Shanghai AI Lab

internlm/polar-68693f829d2e83ac5e6e124a

✨ 1.8B/7B - Apache 2.0
✨ Scalable policy discriminative pretraining
✨ Easy RLHF with minimal preference data
AdinaY 
posted an update 7 days ago
view post
Post
1911
The Chinese Open Source Heatmap is live 🔥
You can now track the companies/ research labs/ communities powering China’s open source AI movement.

zh-ai-community/model-release-heatmap-zh

Some highlights:

✨Giant Tech are investing more in open source.
-Alibaba: Full stack open ecosystem
-Tecent: Hunyuan image/video/3D
-Bytedance: Catching up fast in 2025
-Baidu: New player in open LLM

✨New players emerging post–DeepSeek moment.
-Xiaomi
-Red Note
-Bilibili
-MiniMax
-Moonshot AI

✨Startup list is shifting fast! Those who find a direction aligned with their strengths are the ones who endure.
-DeepSeek
-MiniMax
-StepFun
-Moonshot AI
-Zhipu AI
-OpenBMB

✨Research Lab & Community are making key contributions.
-BAAI
-Shanghai AI Lab
-OpenMOSS
-MAP
AdinaY 
posted an update 8 days ago
view post
Post
3301
🔥 June highlights from China’s open source ecosystem.

zh-ai-community/june-2025-open-works-from-the-chinese-community-683d66c188f782dc5570ba15

✨Baidu & MiniMax both launched open foundation models
- Baidu: Ernie 4.5 ( from 0.3B -424B ) 🤯
- MiniMax: MiniMax -M1 ( Hybrid MoE reasoning model )

✨Multimodal AI is moving from fusion to full-stack reasoning: unified Any-to-Any pipelines across text, vision, audio, and 3D
- Baidu: ERNIE-4.5-VL-424B
- Moonshot AI: Kimi-VL-A3B
- Alibaba: Ovis-U1
- BAAI: Video-XL-2/OmniGen2
- AntGroup: Ming-Lite-Omni
- Chinese Academy of Science: Stream-Omni
- Bytedance: SeedVR2-3B
- Tencent: Hunyuan 3D 2.1/ SongGeneration
- FishAudio: Openaudio-s1-mini

✨Domain specific models are rapidly emerging
- Alibaba DAMO: Lingshu-7B (medical MLLM)
- BAAI: RoboBrain (Robotics)

✨ So many small models!
- OpenBMB: MiciCPM4 ( on device )
- Qwen: Embedding/Reranker (0.6B)
- Alibaba: Ovis-U1-3B
- Moonshot AI: Kimi-VL-A3B
- Bytedance: SeedVR2-3B
AdinaY 
posted an update 8 days ago
view post
Post
300
MTVCraft 🔥 Veo3 style Audio-Video model by BAAI

Model:
BAAI/MTVCraft
Demo:
BAAI/MTVCraft

✨ Text > [Speech + SFX + BGM] > Synchronized Video
✨ Built with Qwen3 + ElevenLabs + MTV
AdinaY 
posted an update 8 days ago
view post
Post
2278
GLM-4.1V-Thinking 🔥 New open vision reasoning model by Zhipu AI

THUDM/glm-41v-thinking-6862bbfc44593a8601c2578d

✨ 9B base & Thinking - MIT license
✨ CoT + RL with Curriculum Sampling
✨ 64k context, 4K image, any aspect ratio
✨ Support English & Chinese
✨ Outperforms GPT 4O -2024/11/20
AdinaY 
posted an update 10 days ago
AdinaY 
posted an update 10 days ago
view post
Post
328
Baidu kept its promise, releasing 10 open models on the very last day of June🚀 Let's meet ERNIE 4.5 🔥

baidu/ernie-45-6861cd4c9be84540645f35c9

✨ From 0.3B to 424B total params
✨ Includes 47B & 3B active param MoE models + a 0.3B dense model
✨ Apache 2.0
✨ 128K context length
✨ Text+Vision co-training with ViT & UPO
AdinaY 
posted an update 13 days ago
view post
Post
3088
Hunyuan-A13B 🔥 New MoE LLM by TencentHunyuan

tencent/Hunyuan-A13B-Instruct

✨80B total / 13B active params
✨256K context window
✨Dual-mode reasoning: fast & slow thinking
✨Efficient inference (GQA + quantization)
AdinaY 
posted an update 16 days ago
AdinaY 
posted an update 16 days ago
view post
Post
303
MOSS-TTSD 🔊 Bilingual text-to-spoken dialogue model by Fudan University - Open MOSS team.

Model:
fnlp/MOSS-TTSD-v0
Demo:
fnlp/MOSS-TTSD

✨ Supports Chinese & English
✨ Zero-shot 2-speaker voice cloning
✨ Long-form generation (up to 960s)
✨ Built on Qwen 3
AdinaY 
posted an update 17 days ago
view post
Post
269
Skywork-SWE 🔥 New code agent model by Skywork 天工

Skywork/Skywork-SWE-32B

✨ 32B - Apache 2.0
✨ 38.0% pass@1 on SWE-bench Verified
✨ Up to 47.0% with test-time scaling
✨ Shows clear data scaling law (8K+ demos)
✨ Built on Qwen2.5-Coder-32B + OpenHands
AdinaY 
posted an update 23 days ago
AdinaY 
posted an update 24 days ago
view post
Post
4042
Kimi-Dev 💻 New coding model by Moonshot AI

moonshotai/Kimi-Dev-72B

✨ 72B - MIT license
✨ 60.4% on SWE-bench Verified
✨ RL-trained to patch real repos in Docker
✨ Only rewarded if full test suite passes
AdinaY 
posted an update 24 days ago
view post
Post
653
MiniMax-M1 🔥 The First reasoning model by MiniMax.

MiniMaxAI/minimax-m1-68502ad9634ec0eeac8cf094

✨ 40k/80k thinking budget
✨ Powered by Hybrid MoE + Lightning Attention 👀
✨ 1M context length 🤯
✨ Apache 2.0
✨ RL-trained for math, coding & real-world software
AdinaY 
posted an update 24 days ago
view post
Post
454
Hunyuan 3D 2.1 🔥 Industrial-grade 3D model just released by Tencent Hunyuan

tencent/Hunyuan3D-2.1
tencent/Hunyuan3D-2.1

✨ PBR materials: leather, bronze & more, breathtaking realism under any light
✨ Consumer GPU-ready: good for developers and small teams

AdinaY 
posted an update 30 days ago
AdinaY 
posted an update about 1 month ago
view post
Post
3183
RoboBrain 2.0🔥 OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs
AdinaY 
posted an update about 1 month ago
view post
Post
2685
RedNote 小红书 just released their first LLM 🔥

dots.llm1.base 🪐 a 142B MoE model with only 14B active params.

rednote-hilab/dotsllm1-68246aaaaba3363374a8aa7c
✨ Base & Instruct - MIT license
✨ Trained on 11.2T non-synthetic high-quality data
✨ Competitive with Qwen2.5/3 on reasoning, code, alignment