Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2970404.1
TFLOPS
320
365
610
Yatharth Sharma
YaTharThShaRma999
Follow
couragestrong's profile picture
Jongsim's profile picture
thomaskalnik's profile picture
19 followers
Ā·
26 following
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
YaTharThShaRma999/voices
reacted
to
Ruurd
's
post
with š„
2 days ago
The past year I have been trying to get diffusion models to work for language generation, without having to retrain a LLM from scratch. And recently, we finally succeeded: We introduce "LAD: LoRA-Adapted Denoiser", a method to convert a LLaMA model into a text diffusion model using LoRA finetuning and structured input corruption. šÆ Try the demo and read the write-up here! https://ruurdkuiper.github.io/tini-lad/ Unlike autoregressive (word-for-word) models like ChatGPT, diffusion models iteratively refine a noised sequence. However, most current diffusion approaches rely on all-parameter retraining and repeatedly remasking tokens, which is costly and slow during both training and inference! š§ With LAD: - We can finetune an autoregressive model for diffusive generation in just 10 hours on a single GPU. - Test-time compute is fully adjustable: fewer steps means faster outputs while more steps improve output quality. - Due to our unique noising schedule, remasking is not always needed during inference. All tokens are attended to in each iteration! š LAD is built using: ā A frozen LLaMA-8B backbone ā Structured noising: token swaps, duplications, replacements, span shifts ā Modified attention masks for bidirectional decoding š” We show that even small, fast-trained models can perform diffusive generation ā with competitive benchmark performance, perplexity and more flexible test-time behavior than traditional transformers.
liked
a model
3 days ago
fluxions/vui
View all activity
Organizations
None yet
spaces
11
Sort:Ā Recently updated
pinned
Running
1
Gemini Image Edit
š
Generate edited images with text prompts
pinned
Running
5
Octopus v2
š
chat with octopus 2 gguf model
Running
Space
š
Sleeping
1
Space
š
Runtime error
Real-Time Text-to-Image SDXL Lightning
ā”
Sleeping
JupyterLab
š»
Expand 11 spaces
models
34
Sort:Ā Recently updated
YaTharThShaRma999/voices
Updated
1 day ago
ā¢
1
YaTharThShaRma999/csm_decoder_casual
Text Generation
ā¢
Updated
18 days ago
ā¢
42
YaTharThShaRma999/csm_test
Text-to-Audio
ā¢
Updated
22 days ago
ā¢
18
YaTharThShaRma999/csm_decoder
Text Generation
ā¢
Updated
23 days ago
ā¢
41
YaTharThShaRma999/csm_backbone
Feature Extraction
ā¢
Updated
26 days ago
ā¢
10
YaTharThShaRma999/oute_tts_awq
Updated
28 days ago
ā¢
6
YaTharThShaRma999/muyan_awq_auto
Updated
28 days ago
ā¢
15
YaTharThShaRma999/TestCalibration
Updated
29 days ago
YaTharThShaRma999/muyan_awq
Updated
May 3
ā¢
10
ā¢
1
YaTharThShaRma999/orpheus_multilingual_awq
Updated
Apr 26
ā¢
32
Expand 34 models
datasets
4
Sort:Ā Recently updated
YaTharThShaRma999/calibration_audio
Viewer
ā¢
Updated
29 days ago
ā¢
128
ā¢
59
YaTharThShaRma999/Physics_dataset
Viewer
ā¢
Updated
Sep 24, 2023
ā¢
1k
ā¢
34
ā¢
3
YaTharThShaRma999/autotrain-data-flant5finetune
Preview
ā¢
Updated
Aug 10, 2023
ā¢
10
YaTharThShaRma999/ImageCaptioningDataset
Updated
May 13, 2023
ā¢
10