8 7

Anton Razzhigaev

razzant

https://t.me/abstractDL

razzant

AI & ML interests

Language models, multimodal models, knowledge graphs, chatbots

Recent Activity

upvoted a paper 4 days ago

Image Reconstruction as a Tool for Feature Analysis

commented on a paper about 1 month ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

authored a paper 3 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

View all activity

Organizations

razzant's activity

upvoted a paper 4 days ago

Image Reconstruction as a Tool for Feature Analysis

Paper • 2506.07803 • Published 5 days ago • 28

commented a paper about 1 month ago

ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations

Paper • 2505.02819 • Published May 5 • 24 •

authored a paper 3 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 118

upvoted a paper 3 months ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 118

authored 2 papers 3 months ago

Universal Adversarial Attack on Aligned Multimodal LLMs

Paper • 2502.07987 • Published Feb 11

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 234

commented a paper 3 months ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 234 •

authored a paper 4 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

commented a paper 4 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175 •

commented a paper 5 months ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7 • 14 •

upvoted a paper 5 months ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published Jan 7 • 14

upvoted a paper 11 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 117

upvoted a paper 12 months ago

The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing

Paper • 2406.10601 • Published Jun 15, 2024 • 70

upvoted a collection 12 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 116 items • Updated 9 days ago • 105

upvoted a paper about 1 year ago

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 54

authored 2 papers about 1 year ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 159

OmniFusion Technical Report

Paper • 2404.06212 • Published Apr 9, 2024 • 78

authored 3 papers over 1 year ago

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models

Paper • 2311.05928 • Published Nov 10, 2023 • 1

Black-Box Face Recovery from Identity Features

Paper • 2007.13635 • Published Jul 27, 2020

MEKER: Memory Efficient Knowledge Embedding Representation for Link Prediction and Question Answering

Paper • 2204.10629 • Published Apr 22, 2022