2 43 39

huy

bui

AI & ML interests

None yet

Recent Activity

liked a model 26 days ago

moonshotai/Kimi-K2.6

liked a model about 2 months ago

unsloth/GLM-4.1V-9B-Thinking-GGUF

liked a model about 2 months ago

zai-org/AutoGLM-Phone-9B

View all activity

Organizations

upvoted 3 papers 4 months ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 525

OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published Feb 26 • 53

Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control

Paper • 2602.18422 • Published Feb 20 • 30

upvoted an article 6 months ago

Article

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

diwank

•

Jun 16, 2024

• 7

upvoted a paper 6 months ago

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Paper • 2512.13303 • Published Dec 15, 2025 • 17

upvoted an article 6 months ago

Article

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

prem-research

•

Dec 12, 2025

• 22

upvoted a collection 7 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 746

upvoted 4 papers 7 months ago

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published Nov 25, 2025 • 24

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published Nov 11, 2025 • 42

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 140

TiDAR: Think in Diffusion, Talk in Autoregression

Paper • 2511.08923 • Published Nov 12, 2025 • 129

upvoted 2 papers 9 months ago

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 119

StableToken: A Noise-Robust Semantic Speech Tokenizer for Resilient SpeechLLMs

Paper • 2509.22220 • Published Sep 26, 2025 • 66

upvoted an article 9 months ago

Article

Code a simple RAG from scratch

ngxson

•

Oct 29, 2024

• 349

upvoted a paper 9 months ago

Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR

Paper • 2509.18174 • Published Sep 17, 2025 • 134

upvoted a paper 10 months ago

Diffusion Language Models Know the Answer Before Decoding

Paper • 2508.19982 • Published Aug 27, 2025 • 27

upvoted an article 10 months ago

Article

Introducing smolagents: simple agents that write actions in code.

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.2k

upvoted 2 papers 10 months ago

OmniTry: Virtual Try-On Anything without Masks

Paper • 2508.13632 • Published Aug 19, 2025 • 15

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13, 2025 • 58

upvoted a paper about 1 year ago

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Paper • 2506.07044 • Published Jun 8, 2025 • 114

huy

AI & ML interests

Recent Activity

Organizations

bui's activity

CryptGPT: Privacy-Preserving Language Models Using Vigenere Cipher (Part 1)

MiniGuard-v0.1: Prem's Guardrail Model Redefining the Pareto Frontier

Code a simple RAG from scratch

Introducing smolagents: simple agents that write actions in code.