Prince Canuma's picture

Prince Canuma PRO

prince-canuma

·

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

Marvis-AI/marvis-tts-250m-v0.1-transformers:Updated the inference code. Added a notebook and a demo audio.

updated a model 3 days ago

mlx-community/embeddinggemma-300m-qat-q4_0-unquantized-bf16

updated a model 3 days ago

mlx-community/embeddinggemma-300m-qat-q8_0-unquantized-bf16

View all activity

Organizations

upvoted a collection 3 days ago

EmbeddingGemma

7 items • Updated 3 days ago • 2

upvoted 2 collections 24 days ago

Gemma 3-270m

20 items • Updated 24 days ago • 4

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory. • 29 items • Updated 24 days ago • 31

upvoted 2 collections 4 months ago

Perception Encoder

17 items • Updated Jul 11 • 67

LLaMA-Omni

13 items • Updated May 17 • 16

upvoted 2 collections 5 months ago

VideoChat-R1

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning • 3 items • Updated Apr 22 • 5

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209

upvoted 7 papers 5 months ago

DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

Paper • 2504.02882 • Published Apr 2 • 8

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 199

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 110

ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers

Paper • 2504.00502 • Published Apr 1 • 25

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 57

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Paper • 2504.00883 • Published Apr 1 • 66

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3 • 33

upvoted a collection 5 months ago

ModernBert

16 items • Updated Apr 3 • 2

upvoted 5 papers 5 months ago

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published Mar 31 • 39

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published Mar 31 • 62

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Paper • 2203.11171 • Published Mar 21, 2022 • 5

Self-Consistency Preference Optimization

Paper • 2411.04109 • Published Nov 6, 2024 • 19

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features

Paper • 2504.00557 • Published Apr 1 • 15