Pratyay Banerjee's picture

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

HCI, Computer Vision, Object Detection, Pattern Recognition, NLP, Supervised Learning

Recent Activity

liked a model 1 day ago

deepseek-ai/DeepSeek-V3.1-Base

liked a Space 2 days ago

multimodalart/MoDA-fast-talking-head

liked a model 2 days ago

nvidia/canary-1b-v2

View all activity

Organizations

upvoted an article 7 days ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

By

•

May 28, 2024

• 243

upvoted 19 papers 10 days ago

Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning

Paper • 2507.16784 • Published 29 days ago • 116

Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models

Paper • 2507.14241 • Published Jul 17 • 17

DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts

Paper • 2507.18464 • Published 27 days ago • 11

CLEAR: Error Analysis via LLM-as-a-Judge Made Easy

Paper • 2507.18392 • Published 28 days ago • 19

The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm

Paper • 2507.18553 • Published 27 days ago • 39

Diversity-Enhanced Reasoning for Subjective Questions

Paper • 2507.20187 • Published 25 days ago • 23

SmallThinker: A Family of Efficient Large Language Models Natively Trained for Local Deployment

Paper • 2507.20984 • Published 23 days ago • 54

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published 23 days ago • 79

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 26 days ago • 139

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Paper • 2507.21509 • Published 23 days ago • 29

RecGPT Technical Report

Paper • 2507.22879 • Published 21 days ago • 35

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published 19 days ago • 62

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published 19 days ago • 32

Qwen-Image Technical Report

Paper • 2508.02324 • Published 17 days ago • 200

AttnTrace: Attention-based Context Traceback for Long-Context LLMs

Paper • 2508.03793 • Published 15 days ago • 2

LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?

Paper • 2508.01780 • Published 17 days ago • 13

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Paper • 2508.03686 • Published 15 days ago • 32

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published 17 days ago • 126

StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion

Paper • 2508.04440 • Published 15 days ago • 9