Eugene Oskin

eoskin

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

upvoted a paper 4 days ago

Hidden Dynamics of Massive Activations in Transformer Training

upvoted a paper 4 days ago

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

View all activity

Organizations

None yet

upvoted 4 papers 4 days ago

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Paper • 2508.04482 • Published 9 days ago • 9

Hidden Dynamics of Massive Activations in Transformer Training

Paper • 2508.03616 • Published 10 days ago • 17

InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization

Paper • 2508.05731 • Published 8 days ago • 24

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published 7 days ago • 139

upvoted 2 articles 6 days ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

and 2 others •

Mar 22, 2024

• 100

Article

You could have designed state of the art positional encoding

•

Nov 25, 2024

• 340

upvoted a collection 8 days ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 149

upvoted an article 11 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 679

upvoted 3 papers 2 months ago

upvoted 2 papers 5 months ago

RoFormer: Enhanced Transformer with Rotary Position Embedding

Paper • 2104.09864 • Published Apr 20, 2021 • 14

Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective

Paper • 2503.01933 • Published Mar 3 • 12

upvoted 7 papers 6 months ago

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published Feb 16 • 60

PAFT: Prompt-Agnostic Fine-Tuning

Paper • 2502.12859 • Published Feb 18 • 15

Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking

Paper • 2501.00244 • Published Dec 31, 2024 • 1

Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 21

Retrofitting Word Vectors to Semantic Lexicons

Paper • 1411.4166 • Published Nov 15, 2014 • 1

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 31

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 241

Eugene Oskin

AI & ML interests

Recent Activity

Organizations

eoskin's activity

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

You could have designed state of the art positional encoding

Finally, a Replacement for BERT: Introducing ModernBERT