Ma's picture

2 15

Ma

heurainbow

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

upvoted a paper about 1 month ago

ShiQ: Bringing back Bellman to LLMs

liked a dataset 5 months ago

WizzF/Heap-Forge

View all activity

Organizations

upvoted 2 papers about 1 month ago

Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

Paper • 2503.14286 • Published Mar 18 • 2

ShiQ: Bringing back Bellman to LLMs

Paper • 2505.11081 • Published May 16 • 2

liked a dataset 5 months ago

WizzF/Heap-Forge

Viewer • Updated Dec 28, 2024 • 10.5M • 701 • 3

liked a dataset 9 months ago

lightblue/kurage_training_data

Viewer • Updated Sep 16, 2024 • 61.6k • 224 • 6

liked 8 datasets over 1 year ago

teknium/OpenHermes-2.5

Viewer • Updated Apr 15, 2024 • 1M • 3.22k • 735

selfrag/selfrag_train_data

Viewer • Updated Oct 31, 2023 • 146k • 139 • 71

allenai/MADLAD-400

Updated Sep 9, 2024 • 190k • 143

UCSD26/medical_dialog

Updated Sep 18, 2023 • 642 • 162

Open-Orca/FLAN

Viewer • Updated Aug 2, 2023 • 378M • 4.58k • 179

LDJnr/Puffin

Viewer • Updated Jun 7, 2024 • 3k • 792 • 94

dim/camel_ai_chemistry

Viewer • Updated Oct 12, 2023 • 20k • 41 • 1

defunct-datasets/the_pile_books3

Updated Jan 18, 2024 • 246 • 152

liked a dataset almost 2 years ago

totally-not-an-llm/EverythingLM-data

Viewer • Updated Aug 3, 2023 • 1.08k • 31 • 22

liked 2 datasets about 2 years ago

jondurbin/airoboros-gpt4

Preview • Updated Jun 22, 2023 • 76 • 15

tiiuae/falcon-refinedweb

Viewer • Updated Jun 20, 2023 • 968M • 10k • 858

liked a Space about 2 years ago

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

liked a dataset over 2 years ago

laion/relaion2B-en-research-safe

Viewer • Updated Jul 2, 2024 • 2.1B • 3.78k • 200