Gyanateet Dutta's picture

Gyanateet Dutta

Ryukijano

·

https://ryukijano.github.io

AI & ML interests

Computer Vision, Robotics, Generative modelling,ML in browser, healthcare applications, intersection of art and ML.

Recent Activity

liked a dataset 6 days ago

builddotai/Egocentric-10K

liked a model 14 days ago

CompVis/DisMo

upvoted an article 18 days ago

Why You Should Care About Partial Differential Equations (PDEs)

View all activity

Organizations

upvoted an article 18 days ago

Article

Why You Should Care About Partial Differential Equations (PDEs)

19 days ago

•

35

upvoted 2 papers about 2 months ago

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31 • 28

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29 • 64

upvoted a paper 3 months ago

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

Paper • 2510.12586 • Published Oct 14 • 108

upvoted an article 4 months ago

Article

SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence

Sep 2

•

35

upvoted 4 papers 4 months ago

Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies

Paper • 2508.20072 • Published Aug 27 • 31

Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels

Paper • 2508.17437 • Published Aug 20 • 38

MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published Aug 20 • 68

Do What? Teaching Vision-Language-Action Models to Reject the Impossible

Paper • 2508.16292 • Published Aug 22 • 9

upvoted a collection 4 months ago

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 7 days ago • 100

upvoted an article 4 months ago

Article

Introducing Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

May 17

•

11

upvoted a paper 5 months ago

Reinforcement Learning in Vision: A Survey

Paper • 2508.08189 • Published Aug 11 • 29

upvoted a collection 5 months ago

The Well

A 15TB collection of physics simulation datasets. • 18 items • Updated Mar 24 • 41

upvoted a paper 5 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 133

upvoted an article 5 months ago

Article

🪆 Introduction to Matryoshka Embedding Models

+1

Feb 23, 2024

•

185

upvoted a collection 5 months ago

Cosmos-Transfer1-DiffusionRenderer

High-quality video de-lighting and re-lighting based on Cosmos video diffusion framework • 2 items • Updated Oct 2 • 2

upvoted a paper 6 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

upvoted an article 6 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Jul 9

•

745

upvoted a collection 7 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 177

upvoted a paper 7 months ago

Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation

Paper • 2505.13215 • Published May 19 • 29