2 9

Amirhossein Kazemnejad

kazemnejad

kazemnejad

AI & ML interests

None yet

Recent Activity

authored a paper about 24 hours ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

upvoted a paper 1 day ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

upvoted a paper 5 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

View all activity

Organizations

kazemnejad's activity

authored a paper about 24 hours ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published 5 days ago • 19

upvoted a paper 1 day ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published 5 days ago • 19

upvoted a paper 5 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 14 days ago • 72

authored a paper 5 days ago

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 14 days ago • 72

updated a model 11 days ago

McGill-NLP/nano-aha-moment-3b

Text Generation • Updated 11 days ago • 97 • 2

published a model 16 days ago

McGill-NLP/nano-aha-moment-3b

Text Generation • Updated 11 days ago • 97 • 2

updated a dataset 27 days ago

McGill-NLP/MultiDigit-20

Viewer • Updated 27 days ago • 16k • 79

published a dataset 27 days ago

McGill-NLP/MultiDigit-20

Viewer • Updated 27 days ago • 16k • 79

upvoted 2 papers about 1 month ago

Exploiting Instruction-Following Retrievers for Malicious Information Retrieval

Paper • 2503.08644 • Published Mar 11 • 16

SafeArena: Evaluating the Safety of Autonomous Web Agents

Paper • 2503.04957 • Published Mar 6 • 19

upvoted a paper about 2 months ago

How to Get Your LLM to Generate Challenging Problems for Evaluation

Paper • 2502.14678 • Published Feb 20 • 17

commented a paper 6 months ago

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 25 •

authored 3 papers 6 months ago

The Impact of Positional Encoding on Length Generalization in Transformers

Paper • 2305.19466 • Published May 31, 2023 • 2

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 25

Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

Paper • 2305.14775 • Published May 24, 2023

upvoted a paper 7 months ago

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Paper • 2410.01679 • Published Oct 2, 2024 • 25

updated a dataset 9 months ago

MathMindsAGI/MATH-openai-split

Viewer • Updated Jul 10, 2024 • 12.5k • 256 • 1

upvoted a paper 9 months ago

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations

Paper • 2407.03471 • Published Jul 3, 2024 • 32

updated a dataset 10 months ago

MathMindsAGI/konkur-1403-math-v1.5

Viewer • Updated Jun 24, 2024 • 116 • 18

upvoted a paper about 1 year ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9, 2024 • 66