Avishai Elmakies's picture

1 26 9

Avishai Elmakies

avishai-elmakies

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Discrete Audio Tokens: More Than a Survey!

liked a dataset about 1 month ago

niveck/LLMafia

upvoted a paper about 1 month ago

Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games

View all activity

Organizations

upvoted 3 papers about 1 month ago

Discrete Audio Tokens: More Than a Survey!

Paper • 2506.10274 • Published Jun 12 • 32

Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games

Paper • 2506.05309 • Published Jun 5 • 14

Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation

Paper • 2506.08570 • Published Jun 10 • 33

upvoted 2 papers about 2 months ago

StressTest: Can YOUR Speech LM Handle the Stress?

Paper • 2505.22765 • Published May 28 • 18

WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

Paper • 2505.19103 • Published May 25 • 13

upvoted a paper 2 months ago

Fast Text-to-Audio Generation with Adversarial Post-Training

Paper • 2505.08175 • Published May 13 • 23

upvoted 3 papers 3 months ago

I-Con: A Unifying Framework for Representation Learning

Paper • 2504.16929 • Published Apr 23 • 30

Follow the Flow: On Information Flow Across Textual Tokens in Text-to-Image Models

Paper • 2504.01137 • Published Apr 1 • 21

Scaling Analysis of Interleaved Speech-Text Language Models

Paper • 2504.02398 • Published Apr 3 • 32

upvoted 7 papers 4 months ago

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Paper • 2504.00595 • Published Apr 1 • 37

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 63

Single Image Iterative Subject-driven Generation and Editing

Paper • 2503.16025 • Published Mar 20 • 14

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 93

AudioX: Diffusion Transformer for Anything-to-Audio Generation

Paper • 2503.10522 • Published Mar 13 • 27

Charting and Navigating Hugging Face's Model Atlas

Paper • 2503.10633 • Published Mar 13 • 86

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

Paper • 2503.09601 • Published Mar 12 • 15

upvoted a collection 5 months ago

Slam

All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 7 items • Updated May 22 • 13

upvoted 3 papers 5 months ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 70

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13 • 36

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 49