Jan 30, 2025 at 4:50 PM

Hugging Face is building Open R1, a fully open source version of DeepSeek R1 AI model

Hugging Face has announced it's building Open R1, an open reproduction of the recently launched DeepSeek R1 model. While DeepSeek R1's model weights are accessible, the datasets and training code remain undisclosed. Open R1 aims to fill these gaps, enabling the research and industry community to develop similar or superior models.

The initiative focuses on reconstructing DeepSeek R1's data and training processes, validating its claims, and advancing open reasoning models. Hugging Face seeks to enhance transparency in reinforcement learning's role in reasoning and provide reproducible insights to the open-source community.

Open R1 is not just about replicating results, but also about sharing valuable insights. By documenting effective and ineffective strategies, Hugging Face hopes to prevent others from expending unnecessary time and compute resources on unproductive approaches.

Jan 30, 2025 by Paul

Maoholguin found this interesting

MORE ABOUT: #Large Language Model (LLM) Tools #AI Chatbots #Open R1

Open R1

Open R1 is a community-driven, open-source AI chatbot initiative focused on replicating the advanced AI capabilities of DeepSeek-R1 through transparent methodologies. It offers an open platform for developers and researchers to explore and enhance AI interactions. Top alternatives to Open R1 include ChatGPT, HuggingChat, and Google Gemini, each providing unique AI-driven conversational experiences.

External links

Open-R1: a fully open reproduction of DeepSeek-R1
Hugging Face • Official source
Hugging Face researchers are trying to build a more open version of DeepSeek's AI 'reasoning' model
TechCrunch
Hugging Face wants to make DeepSeek R1 fully open by filling closed source gaps
Neowin
Hugging Face wants to reverse-engineer DeepSeek's R1 reasoning model
SiliconANGLE
Hugging Face Is Trying to Build a Fully Open-Source Version of DeepSeek-R1 AI Model
Gadgets 360
Open-R1: The first DeepSeek R1 AI clone, with a big twist
BGR

Comments

UserPower

CommentJan 30, 2025

Nice to see that the $200B invested in AI wasn't wasted after all since it will allow the myriad of gigantic datacenters with millions of ludicrous GPUs be able to train R1 model in few hours instead of few weeks.