1 46 431

nDimensional

AI & ML interests

Computer Vision, Diffusers, Transformers, ML, NLP, Diffusion Models, Unsupervised Learning, JAX, Neural Networks

Recent Activity

upvoted a paper about 7 hours ago

MMSearch-R1: Incentivizing LMMs to Search

liked a Space about 8 hours ago

ilcve21/Sparc3D

liked a Space about 8 hours ago

Qwen/Qwen3-Demo

View all activity

Organizations

None yet

upvoted a paper about 7 hours ago

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published 4 days ago • 42

upvoted a paper 4 days ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published 5 days ago • 24

upvoted 2 papers about 1 month ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 70

Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models

Paper • 2505.10554 • Published May 15 • 119

upvoted a paper about 2 months ago

Describe Anything: Detailed Localized Image and Video Captioning

Paper • 2504.16072 • Published Apr 22 • 61

upvoted 4 papers 3 months ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 73

upvoted 3 papers 4 months ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 51

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 145

Large Language Diffusion Models

Paper • 2502.09992 • Published Feb 14 • 120

upvoted 2 papers 5 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 115

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72

upvoted a paper 6 months ago

Explanatory Instructions: Towards Unified Vision Tasks Understanding and Zero-shot Generalization

Paper • 2412.18525 • Published Dec 24, 2024 • 76

upvoted a paper 7 months ago

BLIP3-KALE: Knowledge Augmented Large-Scale Dense Captions

Paper • 2411.07461 • Published Nov 12, 2024 • 24

upvoted a paper 8 months ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published Nov 4, 2024 • 52

upvoted a paper 9 months ago

Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models

Paper • 2410.02740 • Published Oct 3, 2024 • 55

upvoted 2 papers 10 months ago

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2, 2024 • 97

Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders

Paper • 2408.15998 • Published Aug 28, 2024 • 88

nDimensional

AI & ML interests

Recent Activity

Organizations

nDimensional's activity