ZWProj

university

https://huggingface.co/ZhaoweiWang

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ZhaoweiWang authored a paper 24 days ago

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

ZhaoweiWang authored a paper 24 days ago

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

ZhaoweiWang authored a paper 24 days ago

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

View all activity

ZWProj's activity

ZhaoweiWang

authored 4 papers 24 days ago

KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

Paper • 2310.09044 • Published Oct 13, 2023

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning

Paper • 2401.07286 • Published Jan 14, 2024

DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects

Paper • 2410.02730 • Published Oct 3, 2024

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published 28 days ago • 53

yuzhaouoe

authored a paper 24 days ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published 28 days ago • 53

yuzhaouoe

authored a paper 3 months ago

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Paper • 2503.02812 • Published Mar 4 • 10

wyu1

authored a paper 4 months ago

OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas

Paper • 2501.15427 • Published Jan 26 • 6

wyu1

authored 2 papers 8 months ago

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Paper • 2410.10813 • Published Oct 14, 2024 • 12

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2, 2024 • 26

wyu1

authored a paper 9 months ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 68

wyu1

authored a paper 12 months ago

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48

yuzhaouoe

authored a paper 12 months ago

A Simple and Effective $L_2$ Norm-Based Strategy for KV Cache Compression

Paper • 2406.11430 • Published Jun 17, 2024 • 24

yuzhaouoe

authored 2 papers about 1 year ago

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

Paper • 2404.05904 • Published Apr 8, 2024 • 9

Are We Done with MMLU?

Paper • 2406.04127 • Published Jun 6, 2024 • 39

wyu1

authored a paper over 1 year ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 147

yuzhaouoe

authored 3 papers over 1 year ago

wyu1

authored 2 papers over 1 year ago

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Paper • 2401.13919 • Published Jan 25, 2024 • 32

Creative Robot Tool Use with Large Language Models

Paper • 2310.13065 • Published Oct 19, 2023 • 9

AI & ML interests

Recent Activity

Team members 5

ZWProj's activity