Yu Gu's picture

8 5

Yu Gu

entslscheia

·

entslscheia

AI & ML interests

None yet

Recent Activity

upvoted an article about 1 month ago

Open-source DeepResearch – Freeing our search agents

authored a paper 5 months ago

Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases

authored a paper 5 months ago

Mind2Web: Towards a Generalist Agent for the Web

View all activity

Organizations

entslscheia's activity

upvoted an article about 1 month ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

authored 9 papers 5 months ago

Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases

Paper • 2011.07743 • Published Nov 16, 2020

Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 19

KoLA: Carefully Benchmarking World Knowledge of Large Language Models

Paper • 2306.09296 • Published Jun 15, 2023 • 19

A Systematic Investigation of KB-Text Embedding Alignment at Scale

Paper • 2106.01586 • Published Jun 3, 2021

Bringing Back the Context: Camera Trap Species Identification as Link Prediction on Multimodal Knowledge Graphs

Paper • 2401.00608 • Published Dec 31, 2023 • 2

Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

Paper • 2402.14672 • Published Feb 22, 2024 • 1

Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments

Paper • 2212.09736 • Published Dec 19, 2022

HippoRAG: Neurobiologically Inspired Long-Term Memory for Large Language Models

Paper • 2405.14831 • Published May 23, 2024 • 4

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

upvoted 3 papers 5 months ago

Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents

Paper • 2410.05243 • Published Oct 7, 2024 • 19

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Paper • 2410.05080 • Published Oct 7, 2024 • 21

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 62

liked a dataset 6 months ago

AI-MO/NuminaMath-TIR

Viewer • Updated Nov 25, 2024 • 72.5k • 22.4k • 115

updated 4 models 7 months ago

code-world-model/llama7b_math_pot_trace

Feature Extraction • Updated Aug 25, 2024 • 6

code-world-model/llama7b_math_codegen

Feature Extraction • Updated Aug 25, 2024 • 12

code-world-model/llama7b_math_cot

Feature Extraction • Updated Aug 25, 2024 • 10

code-world-model/llama7b_math_pot

Feature Extraction • Updated Aug 25, 2024 • 11

upvoted a paper 7 months ago

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Paper • 2408.06327 • Published Aug 12, 2024 • 17

updated a dataset about 1 year ago

osunlp/KBQA-Agent

Viewer • Updated Feb 27, 2024 • 500 • 97 • 10