mrfakename's picture

mrfakename PRO

mrfakename

·

https://mrfake.name/

AI & ML interests

LLMs, TTS, & Open Source

Recent Activity

published a dataset about 2 hours ago

LAION-Voice/LAION-Voice-WIP

new activity 1 day ago

llamafy/README:Model Request: ByteDance-Seed/Seed-OSS-36B-Instruct

new activity 1 day ago

llamafy/README:Model Request: Qwen 3 Series

View all activity

Organizations

upvoted a collection 20 days ago

DeepSeek-V3.1

3 items • Updated 18 days ago • 222

upvoted a collection 24 days ago

NextStep-1

7 items • Updated 21 days ago • 27

upvoted a collection 2 months ago

ERNIE 4.5

collection of ERNIE 4.5 models. "-Paddle" models use PaddlePaddle weights, while "-PT" models use Transformer-style PyTorch weights. • 25 items • Updated Jul 11 • 159

upvoted an article 3 months ago

Article

DO THEY SEE WHAT WE SEE?

By

•

Jun 22

• 5

upvoted a paper 3 months ago

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Paper • 2506.09827 • Published Jun 11 • 18

upvoted 2 collections 4 months ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated Jul 9 • 70

Qwen3

84 items • Updated Aug 6 • 1.21k

upvoted 3 collections 7 months ago

DeepSeek-R1

10 items • Updated May 29 • 794

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 535

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Jul 21 • 121

upvoted a collection 8 months ago

Reasoning Datasets

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 25

upvoted a paper 9 months ago

Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis

Paper • 2410.23320 • Published Oct 30, 2024 • 8

upvoted 4 papers 11 months ago

MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer

Paper • 2409.00750 • Published Sep 1, 2024 • 4

StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion

Paper • 2409.10058 • Published Sep 16, 2024 • 2

YODAS: Youtube-Oriented Dataset for Audio and Speech

Paper • 2406.00899 • Published Jun 2, 2024 • 3

F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching

Paper • 2410.06885 • Published Oct 9, 2024 • 47

upvoted 2 papers about 1 year ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 61

Artist: Aesthetically Controllable Text-Driven Stylization without Training

Paper • 2407.15842 • Published Jul 22, 2024 • 14

upvoted 2 papers over 1 year ago

Diffusion On Syntax Trees For Program Synthesis

Paper • 2405.20519 • Published May 30, 2024 • 1

"Teach AI How to Code": Using Large Language Models as Teachable Agents for Programming Education

Paper • 2309.14534 • Published Sep 25, 2023 • 2