sergicalsix's picture

1 252

sergicalsix

sergicalsix

·

AI & ML interests

None yet

Recent Activity

updated a collection 3 days ago

2025 LLM Papers on Hugging Face with Japanese Memos

upvoted a paper 3 days ago

Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks

upvoted a paper 3 days ago

Inside-Out: Hidden Factual Knowledge in LLMs

View all activity

Organizations

None yet

sergicalsix's activity

upvoted 4 papers 3 days ago

Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks

Paper • 2503.11514 • Published 12 days ago • 15

Inside-Out: Hidden Factual Knowledge in LLMs

Paper • 2503.15299 • Published 5 days ago • 35

Tokenize Image as a Set

Paper • 2503.16425 • Published 4 days ago • 12

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 6 days ago • 127

upvoted an article 6 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

13 days ago

• 342

upvoted 9 papers 6 days ago

Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching

Paper • 2503.05179 • Published 18 days ago • 43

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published 18 days ago • 51

CoSTAast: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

Paper • 2503.10613 • Published 11 days ago • 73

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 14 days ago • 80

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 19 days ago • 215

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published 17 days ago • 4

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published Feb 20 • 95

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 26 days ago • 80

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 164

upvoted 4 papers 7 days ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 21 days ago • 77

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published 14 days ago • 40

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published 11 days ago • 46

Transformers without Normalization

Paper • 2503.10622 • Published 11 days ago • 133

upvoted a paper 10 days ago

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Paper • 2502.17424 • Published 28 days ago • 4

upvoted a paper 22 days ago

Ask in Any Modality: A Comprehensive Survey on Multimodal Retrieval-Augmented Generation

Paper • 2502.08826 • Published Feb 12 • 17