FineData

community
Activity Feed

AI & ML interests

We release large pre-training datasets to accelerate open LLM development. Part of the Hugging Face Science team (hf.co/science)

Recent Activity

hynkyΒ 
in HuggingFaceFW/finepdfs about 1 month ago

Dataset broken by latest update?

5
#27 opened about 1 month ago by
Rijgersberg
megΒ 
posted an update 2 months ago
view post
Post
3874
πŸ€– Did you know your voice might be cloned without your consent from just *one sentence* of audio?
That's not great. So with @frimelle , we brainstormed a new idea for developers who want to curb malicious use: ✨The Voice Consent Gate.✨
Details, code, here: https://huggingface.co/blog/voice-consent-gate
  • 3 replies
Β·
megΒ 
posted an update 4 months ago
view post
Post
2911
πŸ€– As AI-generated content is shared in movies/TV/across the web, there's one simple low-hanging fruit πŸ‡ to help know what's real: Visible watermarks. With the Gradio team, I've made sure it's trivially easy to add this disclosure to images, video, chatbot text. See how: https://huggingface.co/blog/watermarking-with-gradio
Thanks to the code collab in particular from @abidlabs and Yuvraj Sharma.
davanstrienΒ 
posted an update 4 months ago
eliebakΒ 
posted an update 4 months ago
view post
Post
3896
Super excited to announce that our research team at Hugging Face will be doing an AMA on reddit r/LocalLLaMA.

Come ask any questions to the team behind SmolLM, FineWeb and more! And who knows, maybe there’ll be a shiny new release to talk about?

Thursday 4th September, 8AM-11AM PST πŸ€—

science