Iman Barati

Iman998

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

liked a Space 3 months ago

lmarena-ai/lmarena-leaderboard

liked a dataset 4 months ago

raia-center/khayyam-challenge

View all activity

Organizations

upvoted a paper about 2 months ago

Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models

Paper • 2506.18945 • Published Jun 23 • 39

upvoted a collection 5 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 162

upvoted an article 5 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

and 3 others •

Mar 12

• 449

upvoted 2 articles 7 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

•

May 28, 2024

• 240

Article

🪆 Introduction to Matryoshka Embedding Models

and 2 others •

Feb 23, 2024

• 154

upvoted a paper 8 months ago

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published Dec 19, 2024 • 90

upvoted a paper 10 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 55

upvoted 5 papers 11 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 64

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Paper • 2409.12183 • Published Sep 18, 2024 • 40

upvoted an article 11 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

and 5 others •

Sep 18, 2024

• 264

upvoted an article 12 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

and 5 others •

Aug 12, 2024

• 112

upvoted 4 papers about 1 year ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 105

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 167

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 95

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 70

Iman Barati

AI & ML interests

Recent Activity

Organizations

Iman998's activity

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Training and Finetuning Embedding Models with Sentence Transformers v3

🪆 Introduction to Matryoshka Embedding Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Welcome FalconMamba: The first strong attention-free 7B model