OSU NLP Group

university

https://twitter.com/osunlp

osunlp

AI & ML interests

Natural language processing, language models, language agents

Recent Activity

yuexiang96 authored a paper 3 days ago

Small Models Struggle to Learn from Strong Reasoners

yuexiang96 authored a paper 3 days ago

Evaluating Vision-Language Models as Evaluators in Path Planning

yuexiang96 authored a paper 3 days ago

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

View all activity

yuexiang96

authored 10 papers 3 days ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 38

Evaluating Vision-Language Models as Evaluators in Path Planning

Paper • 2411.18711 • Published Nov 27, 2024

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13 • 23

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Paper • 2503.19877 • Published Mar 25 • 1

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Paper • 2504.10342 • Published Apr 14 • 11

Speculative Thinking: Enhancing Small-Model Reasoning with Large Model Guidance at Inference Time

Paper • 2504.12329 • Published Apr 12

Overtrained Language Models Are Harder to Fine-Tune

Paper • 2503.19206 • Published Mar 24 • 2

The CoT Encyclopedia: Analyzing, Predicting, and Controlling how a Reasoning Model will Think

Paper • 2505.10185 • Published May 15 • 25

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 24

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published 5 days ago • 50

huangtom

authored a paper 8 days ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published 9 days ago • 45

nnnyt

in osunlp/Mind2Web-2 8 days ago

Update README.md

#1 opened 8 days ago by

BoyuNLP

in osunlp/Mind2Web-2 8 days ago

Update README.md

#1 opened 8 days ago by

nnnyt

updated a dataset 8 days ago

osunlp/Mind2Web-2

Preview • Updated 8 days ago • 11

BoyuNLP

updated a collection 8 days ago

Mind2Web 2

Evaluating Agentic Search with Agent-as-a-Judge • 2 items • Updated 8 days ago

nnnyt

published a dataset 8 days ago

osunlp/Mind2Web-2

Preview • Updated 8 days ago • 11

BoyuNLP

authored a paper 9 days ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published 9 days ago • 45

nnnyt

authored a paper 9 days ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published 9 days ago • 45

yhshu

authored a paper 9 days ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published 9 days ago • 45

btyu

in osunlp/SMolInstruct 19 days ago

Inquiry on Data Correction Methodology in SMolInstruct

#2 opened 4 months ago by