Evaluate Measurement

https://github.com/huggingface/evaluate/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

lvwerra authored a paper 10 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

evaluate-bot updated a Space 16 days ago

evaluate-measurement/toxicity

evaluate-bot updated a Space 16 days ago

evaluate-measurement/honest

View all activity

lvwerra

authored a paper 10 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 11 days ago • 57

evaluate-bot

updated 8 Spaces 16 days ago

Toxicity

Honest

Word Length

Text Duplicates

Label Distribution

Regard

Perplexity

Word Count

lvwerra

authored a paper 3 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 192

sasha

authored a paper 3 months ago

Fully Autonomous AI Agents Should Not be Developed

Paper • 2502.02649 • Published Feb 4 • 34

lvwerra

authored a paper 5 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 235

lvwerra

authored a paper 6 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14 • 64

lvwerra

authored a paper 8 months ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 25

lvwerra

updated 6 Spaces 10 months ago

Regard

Word Count

Word Length

Perplexity

Toxicity

Text Duplicates