ICCV2023

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

tedlasai authored a paper 11 days ago

Multispectral Demosaicing via Dual Cameras

tedlasai authored a paper 11 days ago

Learning to Refocus with Video Diffusion Models

tedlasai authored a paper 11 days ago

Generating the Past, Present and Future from a Motion-Blurred Image

View all activity

AdinaY

posted an update about 7 hours ago

Post

Chinese open source AI in December 2025 was about the stack coming together: open, end to end, and ready to ship 🔥

https://huggingface.co/collections/zh-ai-community/december-2025-china-open-source-highlights

✨ Big wave of foundation models: still scaling, but efficiency, reasoning, and deployment now matter more than size
- DeepSeek-V3.2
- Z.ai GLM-4.7
- MiniMax-M2.1
- Xiaomi: MiMo-V2-Flash

✨ Multimodal reasoning is now default
- Z.ai GLM-4.6V
- Z.ai AutoGLM-Phone 9B
- Bytedance: Dolphin-v2

✨ Image & video: editable assets and real workflows
- Qwen-Image-Layered / Image-2512
- Meituan: LongCat-Image & Image Edit
- AIDC: Ovis-Image-7B
- Live-Avatar / LongCat-Video-Avatar
- HY-WorldPlay / RealVideo

✨ Audio goes edge ready
- GLM-ASR-Nano / Fun-ASR-Nano
- GLM-TTS / VoxCPM1.5
- CosyVoice 0.5B

✨ The quiet backbone: data & infrastructure
- Finch (FinWorkBench)
- Tencent ARC: TimeLens-100K
- BIGAI: TongSIM-Asset
- MiniMax: VTP-Base

✨ Also congrats on Minimax and Z.ai announced their IPOs and Moonshot announced a new $500M funding round 🔥

Like everyone else, I was OOO at the end of December, so feel free to share (in comments or PR) any I missed in this list!

AdinaY

posted an update about 15 hours ago

Post

423

MiniMax M2.1 blog is out🔥
https://huggingface.co/blog/MiniMaxAI/multilingual-and-multi-task-coding-with-strong-gen

Only a year into open source, MiniMax is already making a great impact. Not only through solid models/products, but also by how well the team uses community platforms like Hugging Face.

HF Teams, blogs, Daily Papers, Spaces as project pages, and always experimenting with new ways to engage. Super impressive!

AdinaY

posted an update 1 day ago

Post

2389

2025.1 - DeepSeek entered the scene, backed by High Flyer Quant
2026.1 - IQuest enters the game, backed by Uniquant Quant 📈 and launching IQuest-Coder on huggingface
https://huggingface.co/collections/IQuestLab/iquest-coder

✨ 40B models: Instruct / Thinking / Loop
✨ Loop = MoE-level performance with only ~5% extra training cost
✨ Native 128K context

anyirao

authored a paper 3 days ago

Pretraining Frame Preservation in Autoregressive Video Memory Compression

Paper • 2512.23851 • Published 7 days ago • 19

akhaliq

submitted 2 papers to Daily Papers 3 days ago

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published 6 days ago • 4

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow

Paper • 2512.24766 • Published 6 days ago • 7

AdinaY

submitted a paper to Daily Papers 5 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 5 days ago • 201

AdinaY

posted an update 18 days ago

Post

714

Following up on LLaDA 2.0 , the paper is now out on Daily Papers🔥
It has sparked a lot of discussion in the community for showing how discrete diffusion LLMs can scale to 100B and run faster than traditional AR models.
LLaDA2.0: Scaling Up Diffusion Language Models to 100B (2512.15745)

Nymbo

posted an update 18 days ago

Post

1915

🚨 New tool for the Nymbo/Tools MCP server: The new Agent_Skills tool provides full support for Agent Skills (Claude Skills but open-source).

How it works: The tool exposes the standard discover/info/resources/validate actions. Skills live in /Skills under the same File_System root, and any bundled scripts run through Shell_Command, no new infrastructure required.

Agent_Skills(action="discover")  # List all available skills
Agent_Skills(action="info", skill_name="music-downloader")  # Full SKILL.md
Agent_Skills(action="resources", skill_name="music-downloader")  # Scripts, refs, assets

I've included a music-downloader skill as a working demo, it wraps yt-dlp for YouTube/SoundCloud audio extraction.

Caveat: On HF Spaces, Shell_Command works for most tasks, but some operations (like YouTube downloads) are restricted due to the container environment. For full functionality, run the server locally on your machine.

Try it out ~ https://www.nymbo.net/nymbot

akhaliq

submitted a paper to Daily Papers 20 days ago

What matters for Representation Alignment: Global Information or Spatial Structure?

Paper • 2512.10794 • Published 25 days ago • 8

AdinaY

posted an update 21 days ago

Post

4572

Finch 💰 an enterprise-grade benchmark that measures whether AI agents can truly handle real world finance & accounting work.

FinWorkBench/Finch

✨ Built from real enterprise data (Enron + financial institutions), not synthetic tasks
✨ Tests end-to-end finance workflows
✨ Multimodal & cross-file reasoning
✨ Expert annotated (700+ hours) and genuinely challenging hard

AdinaY

authored a paper 21 days ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Paper • 2512.13168 • Published 22 days ago • 49

BryanW

authored a paper 21 days ago

RecTok: Reconstruction Distillation along Rectified Flow

Paper • 2512.13421 • Published 21 days ago • 4

susunghong

submitted a paper to Daily Papers 21 days ago

DiffusionBrowser: Interactive Diffusion Previews via Multi-Branch Decoders

Paper • 2512.13690 • Published 21 days ago • 2

akhaliq

submitted a paper to Daily Papers 25 days ago

Towards a Science of Scaling Agent Systems

Paper • 2512.08296 • Published 28 days ago • 14

anyirao

authored a paper 26 days ago

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper • 2512.09824 • Published 26 days ago • 27

akhaliq

submitted a paper to Daily Papers 27 days ago

ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models

Paper • 2512.07843 • Published Nov 24, 2025 • 21

DavidVivancos

posted an update about 1 month ago

Post

265

Need a new challenging Dataset? Now that #NeurIPS2025 is almost over.

DavidVivancos/NeuraxonLife2-1M

1 Million #Neuraxon Artificial Lives, from almost 10000 Research Game runs, with more than 21 Million Neurons and almost 4 years of Simulated Life.

Read the preprint here https://www.researchgate.net/publication/397331336_Neuraxon

And here you have all the code: https://github.com/DavidVivancos/Neuraxon

risashinoda

authored a paper about 1 month ago

AlignBench: Benchmarking Fine-Grained Image-Text Alignment with Synthetic Image-Caption Pairs

Paper • 2511.20515 • Published Nov 25, 2025 • 3

kaanakan

authored a paper about 1 month ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published Nov 25, 2025 • 47

AI & ML interests

Recent Activity

Team members 212

ICCV2023's activity