Evaluating Agentic Search with Agent-as-a-Judge
AI & ML interests
Natural language processing, language models, language agents
Recent Activity
View all activity
Towards Generalist Agents for the Web (NeurIPS'23 Spotlight)
Navigating GUIs as Humans Do: Universal Visual Grounding for GUI Agents (ICLR'25 Oral)
LLMs tuned on the SMolInstruct dataset for chemistry tasks.
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
SAEs for vision models like CLIP or DINOv2
Generative models to produce GCG-like adversarial suffixes
-
osunlp/AmpleGCG-llama2-sourced-llama2-7b-chat
Text Generation • 7B • Updated • 6 • 4 -
osunlp/AmpleGCG-llama2-sourced-vicuna-7b
Text Generation • 7B • Updated • 6 • 1 -
osunlp/AmpleGCG-llama2-sourced-vicuna-7b13b-guanaco-7b13b
Text Generation • 7B • Updated • 4 • 1 -
osunlp/AmpleGCG-plus-llama2-sourced-llama2-7b-chat
Text Generation • 7B • Updated • 102 • 1
Evaluating Agentic Search with Agent-as-a-Judge
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
Towards Generalist Agents for the Web (NeurIPS'23 Spotlight)
SAEs for vision models like CLIP or DINOv2
Navigating GUIs as Humans Do: Universal Visual Grounding for GUI Agents (ICLR'25 Oral)
Generative models to produce GCG-like adversarial suffixes
-
osunlp/AmpleGCG-llama2-sourced-llama2-7b-chat
Text Generation • 7B • Updated • 6 • 4 -
osunlp/AmpleGCG-llama2-sourced-vicuna-7b
Text Generation • 7B • Updated • 6 • 1 -
osunlp/AmpleGCG-llama2-sourced-vicuna-7b13b-guanaco-7b13b
Text Generation • 7B • Updated • 4 • 1 -
osunlp/AmpleGCG-plus-llama2-sourced-llama2-7b-chat
Text Generation • 7B • Updated • 102 • 1
LLMs tuned on the SMolInstruct dataset for chemistry tasks.