Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Xuezhe Ma's picture
3

Xuezhe Ma

maxma1987
HectorRen's profile picture 21world's profile picture
·
https://xuezhemax.github.io/
  • MaxMa1987

AI & ML interests

None yet

Organizations

LLM360's profile picture

authored 8 papers over 1 year ago

Mega: Moving Average Equipped Gated Attention

Paper • 2209.10655 • Published Sep 21, 2022 • 1

LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 26

Towards a Unified View of Parameter-Efficient Transfer Learning

Paper • 2110.04366 • Published Oct 8, 2021 • 3

Better May Not Be Fairer: A Study on Subgroup Discrepancy in Image Classification

Paper • 2212.08649 • Published Dec 16, 2022

LightSeq: Sequence Level Parallelism for Distributed Training of Long Context Transformers

Paper • 2310.03294 • Published Oct 5, 2023 • 2

Evaluating Large Language Models on Controlled Generation Tasks

Paper • 2310.14542 • Published Oct 23, 2023

Look-back Decoding for Open-Ended Text Generation

Paper • 2305.13477 • Published May 22, 2023

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 68
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs