The Historical Archivesfloodgates have opened for building AI reasoning models on the cheap.
Researchers at Stanford and the University of Washington have developed a model that performs comparably to OpenAI o1 and DeepSeek R1 models in math and coding — for less than $50 of cloud compute credits.
What's more, the model was trained on only 1,000 questions, and took just 26 minutes and 16 Nvidia H100 GPUs. Stanford researcher Niklas Muennighoff said in a email to Mashable that the cost is an estimate based on the GPU runtime and number of H100 GPUs used.
The AI industry of late is all about how new approaches to the pre and post training process can massively save computing costs, as evidenced by DeepSeek's disruptive impact. On top of that, developers are now able to build on top of existing AI models at little or no cost, through APIs, open-source access, and even closed-source models by distilling their data, bringing the costs down even more.
According to the team's research paper which was published last Friday, s1 was trained on a dataset consisting of "1,000 carefully curated questions paired with reasoning traces and answers distilled from Gemini Thinking Experimental." Google's Gemini Thinking Experimental model is accessible with daily limits through AI Studio. While it's a closed-source model, that clearly hasn't stopped researchers from making use of its responses.
SEE ALSO: OpenAI launches 'deep research' AI agent for ChatGPTNext, the researchers used an "off the shelf" pretrained model from Alibaba-owned lab, Qwen, and performed supervised fine-tuning of its curated dataset. Then, the team created a token budget to control the amount of compute time for testing the model. If s1 went over budget on thinking tokens, it was cut off and forced to generate whatever answer it came up with. If the researchers wanted the model to spend more "test-time compute" on a problem, they would simply tell the model to "wait," which extended its thinking time and led to more accurate results.
By controlling the amount of time and compute spent on a problem, the researchers were able to show how increased thinking team leads to improved performance.
S1 is one example of open-source reasoning models that have been developed for a fraction of the cost of flagship models from Google and OpenAI. In January, UC Berkeley researchers released an open-source reasoning model called Sky-T1 that cost $450, "demonstrating that it is possible to replicate high-level reasoning capabilities affordably and efficiently," per its blog post. There's also the open-source rStar-Math reasoning model from Microsoft Asia researchers, Tulu 3 from non profit research institute Ai2, and HuggingFace has its own initiative to replicate DeepSeek's R1.
As high-quality models become more accessible and cheaper, we're starting to see a power shift from the few AI heavy hitters, to the many.
Topics Artificial Intelligence OpenAI
Solange Knowles wraps herself in yarn, still inspires outfit envyPlayStation Classic reviews are in: Here's what the critics thinkYouTube rolls out Stories to creators with over 10,000 subscribersSolange Knowles wraps herself in yarn, still inspires outfit envySurprise! Apple Music is coming to the Amazon EchoHere's how to bring the majesty and mystery of space to your phonePotter fans battle it out over J.K. Rowling Albus Potter tweetStarved for attention, banned Twitter troll handcuffs herself to the company’s NY office doorsFirst Indigenous female MP sworn in amid traditional songHow to make group texting suck lessWhat's coming to Hulu in December 2018Social senior dog walks 4 miles every day to catch up with all his friendsWhat we learned from George R. R. Martin's new book 'Fire and Blood'Hidden rainbow hair makes a bold trend safe for any officeAll the 'Game of Thrones' theories from 'Fire and Blood'Another major U.S. climate report dropped. But you may have missed it.You can now buy a talking fish with Amazon’s Alexa voice assistantCritics love 'SpiderThese countries' mobile internet speeds are way faster than WiFiJordan Peele is working on a 'Candyman' sequel and frankly we need this now MLB sees first tie game in a very, very long time Antonio Brown will honor Arnold Palmer with an awesome pair of custom cleats Trump jokes about kicking non Teddy Ruxpin is back and creepier than ever Not an illusion: Lady Gaga to headline Super Bowl halftime show in 2017 The truth behind your favourite hipster 'craft' brands Mark Wahlberg's 'Deepwater Horizon' will have you crying on the edge of your seat Condolences to Gordon Ramsay, whose penis was stung by a jellyfish 'Overwatch' Open grand finals recap: Misfits defeat Team EnVyUs Beyoncé is the tech industry's newest celebrity investor Hundreds gather for Bon Iver show, greeted by cassette player instead Adventurer Johnny Strange to be remembered with skateparks in Malibu and Bhutan Polite guy wins $1 million in lottery store after letting someone cut in front First look at the sweet new 'Rogue One' Star Wars toys coming out Young former politician labelled a 'wanker' after visiting ISIS frontline People are inserting Donald Trump's sex tape comment into previous presidential speeches Another Miss Universe contestant recalls being body 'USA Today's' first endorsement in history is for anyone but Donald Trump Mercedes just unveiled its first all Rosetta bids farewell tweeting cute cartoons in different languages
2.542s , 10133.6875 kb
Copyright © 2025 Powered by 【Historical Archives】,Wisdom Convergence Information Network