Dongjie Yang's picture

Dongjie Yang

Mutonix

·

AI & ML interests

Large Language Models & Multimodality

Recent Activity

liked a model about 2 months ago

BriLLM/BriLLM0.5

liked a Space 4 months ago

mteb/leaderboard

upvoted an article 5 months ago

Mixture of Experts Explained

View all activity

Organizations

authored 2 papers about 1 year ago

KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing

Paper • 2410.18517 • Published Oct 24, 2024 • 1

Are LLMs Aware that Some Questions are not Open-ended?

Paper • 2410.00423 • Published Oct 1, 2024 • 1

authored 2 papers over 1 year ago

PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

Paper • 2405.12532 • Published May 21, 2024

Vript: A Video Is Worth Thousands of Words

Paper • 2406.06040 • Published Jun 10, 2024 • 28

authored 3 papers almost 2 years ago

Learning Better Masking for Better Language Model Pre-training

Paper • 2208.10806 • Published Aug 23, 2022

BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer

Paper • 2307.00360 • Published Jul 1, 2023

RefGPT: Reference -> Truthful & Customized Dialogues Generation by GPTs and for GPTs

Paper • 2305.14994 • Published May 24, 2023