Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dongjie Yang's picture
42 12 29

Dongjie Yang

Mutonix
wenbopan's profile picture Bye-lemon's profile picture 21world's profile picture
·
  • mutonix
  • mutonix

AI & ML interests

Large Language Models & Multimodality

Recent Activity

upvoted an article 5 days ago
Mixture of Experts Explained
liked a model 7 days ago
meituan-longcat/LongCat-Flash-Chat
liked a dataset 3 months ago
v3ucn/chinese-novel-dataset
View all activity

Organizations

Blog-explorers's profile picture Social Post Explorers's profile picture

authored 2 papers 10 months ago

KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing

Paper • 2410.18517 • Published Oct 24, 2024 • 1

Are LLMs Aware that Some Questions are not Open-ended?

Paper • 2410.00423 • Published Oct 1, 2024 • 1
authored 2 papers about 1 year ago

PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

Paper • 2405.12532 • Published May 21, 2024

Vript: A Video Is Worth Thousands of Words

Paper • 2406.06040 • Published Jun 10, 2024 • 30
authored 3 papers over 1 year ago

Learning Better Masking for Better Language Model Pre-training

Paper • 2208.10806 • Published Aug 23, 2022

BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer

Paper • 2307.00360 • Published Jul 1, 2023

RefGPT: Reference -> Truthful & Customized Dialogues Generation by GPTs and for GPTs

Paper • 2305.14994 • Published May 24, 2023
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs