10 30 4

Melisa Russak

melisa

melisa-writer

AI & ML interests

I love definitions

Recent Activity

liked a Space 3 days ago

Writer/Financial_LLM_Performance_Leaderboard

new activity 9 days ago

Writer/FailSafeQA:Fix paper link

upvoted a paper 10 days ago

Expect the Unexpected: FailSafe Long Context QA for Finance

View all activity

Organizations

melisa's activity

upvoted a paper 10 days ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 12 days ago • 123

upvoted a paper 13 days ago

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Paper • 2502.03544 • Published 17 days ago • 42

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 257

upvoted a paper 2 months ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 78

upvoted 2 papers 3 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 46

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

upvoted an article 3 months ago

Article

Fine-tuning LLMs with Singular Value Decomposition

•

Jun 2, 2024

• 11

upvoted a paper 3 months ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 46

upvoted 3 papers 4 months ago

Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse

Paper • 2410.21333 • Published Oct 27, 2024 • 10

Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation

Paper • 2410.18565 • Published Oct 24, 2024 • 46

Movie Gen: A Cast of Media Foundation Models

Paper • 2410.13720 • Published Oct 17, 2024 • 93

upvoted a paper 5 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

upvoted 5 papers 6 months ago

upvoted an article 6 months ago

Article

Using Writer Framework with Hugging Face Spaces

•

Aug 20, 2024

• 30

upvoted a paper 8 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 89

upvoted a paper 9 months ago

Block Transformer: Global-to-Local Language Modeling for Fast Inference

Paper • 2406.02657 • Published Jun 4, 2024 • 38