In a Training Loop 🔄

53 115 186

Dmitry Ryumin

DmitryRyumin

https://dmitryryumin.github.io

DmitryRyumin

AI & ML interests

Machine Learning and Applications, Multi-Modal Understanding

Recent Activity

upvoted a paper 10 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

liked a Space 28 days ago

huggingface/ai-deadlines

liked a Space about 1 month ago

black-forest-labs/FLUX.2-dev

View all activity

Organizations

upvoted a paper 10 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 15 days ago • 47

upvoted an article 2 months ago

Article

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks

Nov 15, 2025

•

upvoted 10 papers 3 months ago

Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning

Paper • 2511.02818 • Published Nov 4, 2025 • 15

SelectMix: Enhancing Label Noise Robustness through Targeted Sample Mixing

Paper • 2509.11265 • Published Sep 14, 2025 • 1

Intra-Cluster Mixup: An Effective Data Augmentation Technique for Complementary-Label Learning

Paper • 2509.17971 • Published Sep 22, 2025 • 1

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 101

Token Activation Map to Visually Explain Multimodal LLMs

Paper • 2506.23270 • Published Jun 29, 2025 • 5

LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models

Paper • 2504.14032 • Published Apr 18, 2025 • 7

Knocking-Heads Attention

Paper • 2510.23052 • Published Oct 27, 2025 • 30

E^2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker

Paper • 2510.22733 • Published Oct 26, 2025 • 32

Heavy Labels Out! Dataset Distillation with Label Space Lightening

Paper • 2408.08201 • Published Aug 15, 2024 • 21

AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders

Paper • 2510.19779 • Published Oct 22, 2025 • 61

upvoted 3 papers 4 months ago

Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding

Paper • 2510.06308 • Published Oct 7, 2025 • 55

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Paper • 2510.06590 • Published Oct 8, 2025 • 76

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 98

upvoted 3 collections 4 months ago

upvoted 2 papers 4 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21, 2025 • 13

Dmitry Ryumin

AI & ML interests

Recent Activity

Organizations

DmitryRyumin's activity

Fine-Tuning MetaCLIP-2 for Image Classification on Downstream Tasks