Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dong Zhang's picture
4 1 9

Dong Zhang

nutation
21world's profile picture fufengyuan's profile picture shtefcs's profile picture
·
https://0nutation.github.io/
  • 0nutation

AI & ML interests

None yet

Organizations

OpenMOSS (SII, Fudan NLP)'s profile picture NSGPT's profile picture jlbbq's profile picture sgcr's profile picture

authored 9 papers over 1 year ago

SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities

Paper • 2305.11000 • Published May 18, 2023 • 4

SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models

Paper • 2308.16692 • Published Aug 31, 2023 • 1

LEGO:Language Enhanced Multi-modal Grounding Model

Paper • 2401.06071 • Published Jan 11, 2024 • 13

SeqXGPT: Sentence-Level AI-Generated Text Detection

Paper • 2310.08903 • Published Oct 13, 2023 • 1

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

Paper • 2402.06894 • Published Feb 10, 2024

InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance

Paper • 2401.11206 • Published Jan 20, 2024 • 1

SpeechGPT-Gen: Scaling Chain-of-Information Speech Generation

Paper • 2401.13527 • Published Jan 24, 2024

SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems

Paper • 2401.03945 • Published Jan 8, 2024

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Paper • 2402.12226 • Published Feb 19, 2024 • 46
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs