490 200 992

Peter Szemraj PRO

pszemraj

https://pszemraj.carrd.co/

pszemraj

AI & ML interests

metallic intuition

Recent Activity

liked a Space about 23 hours ago

multimodalart/Qwen-Image-Fast

liked a model about 23 hours ago

LiquidAI/LFM2-VL-450M

upvoted a paper 2 days ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

View all activity

Organizations

upvoted 2 papers 2 days ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published 8 days ago • 41

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published 6 days ago • 48

upvoted a paper 4 days ago

PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts

Paper • 2508.09848 • Published 7 days ago • 63

upvoted 4 papers 7 days ago

Complex Logical Instruction Generation

Paper • 2508.09125 • Published 8 days ago • 38

Matrix-3D: Omnidirectional Explorable 3D World Generation

Paper • 2508.08086 • Published 9 days ago • 67

HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches

Paper • 2508.08088 • Published 9 days ago • 28

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

Paper • 2508.07976 • Published 10 days ago • 45

upvoted an article 8 days ago

Article

TextQuests: How Good are LLMs at Text-Based Video Games?

and 1 other •

9 days ago

• 24

upvoted a paper 12 days ago

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Paper • 2508.03501 • Published 15 days ago • 52

upvoted 2 papers 14 days ago

Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published 19 days ago • 215

CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward

Paper • 2508.03686 • Published 15 days ago • 32

upvoted an article 14 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

and 11 others •

16 days ago

• 467

upvoted a paper 16 days ago

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published 19 days ago • 32

upvoted a paper 18 days ago

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Paper • 2507.21509 • Published 23 days ago • 29

upvoted 3 papers 26 days ago

GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface

Paper • 2507.18546 • Published 27 days ago • 18

A New Pair of GloVes

Paper • 2507.18103 • Published 28 days ago • 7

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 120

upvoted a collection 29 days ago

Common Pile v0.1 Raw Data

Collection

8TB of public domain and openly licensed text • 30 items • Updated 6 days ago • 18

upvoted 2 papers about 1 month ago

Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Paper • 2507.14137 • Published Jul 18 • 33

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 245