FineData

Team
community
Activity Feed

AI & ML interests

We release large pre-training datasets to accelerate open LLM development. Part of the Hugging Face Science team (hf.co/science)

Recent Activity

megΒ 
posted an update 5 days ago
view post
Post
3441
πŸ€– Did you know your voice might be cloned without your consent from just *one sentence* of audio?
That's not great. So with @frimelle , we brainstormed a new idea for developers who want to curb malicious use: ✨The Voice Consent Gate.✨
Details, code, here: https://huggingface.co/blog/voice-consent-gate
  • 3 replies
Β·

docs: fix typo

1
#2 opened 13 days ago by
stefan-it
guipenedoΒ 
updated a Space 13 days ago

datatrove

#2 opened 13 days ago by
hynky
hynkyΒ 
in HuggingFaceFW/finepdfs 14 days ago

Deciding on extraction path

4
#10 opened about 2 months ago by
Mdspike

Were the original PDFs saved?

11
#2 opened about 2 months ago by
staghado

Docling output

1
#4 opened about 2 months ago by
akreal