This startup wants to spark a DeepSeek moment in the United States

Photo of author

By [email protected]


since then Deep Sick It came onto the scene in January, and momentum around Chinese open source has been building artificial intelligence Models. Some researchers are pushing for a more open approach to building AI, which would allow the model-making process to be distributed around the world.

Prime Minister Thoughta decentralized AI startup, is currently training a large parametric language model, called INTELLECT-3, using a new type of distributed reinforcement learning for fine-tuning. Vincent Weisser, the company’s CEO, says that the model will demonstrate a new way to build competitive open artificial intelligence models using a range of devices in different locations in a way that does not depend on big technology companies.

The AI ​​world is currently divided between those who rely on closed American models and those who use open Chinese offerings, Weiser says. Prime Intellect advances AI by allowing more people to build and modify advanced AI for themselves.

Improving AI models is no longer just a matter of condensing training data and computation. Today’s parametric models use reinforcement learning to improve after the pre-training process is complete. Do you want your model to excel at math, answer legal questions, or play Sudoku? Ask him to improve himself through practice in an environment where you can measure success and failure.

“These reinforcement learning environments are now a bottleneck to actually scaling capabilities,” Weisser tells me.

Prime Intellect has created a framework that allows anyone to create a reinforcement learning environment tailored to a specific task. The company combines the best environments created by its team and the community to fine-tune INTELLECT-3.

I tried running a Wordle puzzle-solving environment, created by Will Brown, a researcher at Prime Intellect, and watched a small model solve Wordle puzzles (it was more methodical than I was, to be honest). If I were an AI researcher trying to improve a model, I would spin up a bunch of GPUs and run the model over and over while the reinforcement learning algorithm adjusted its weights, thus turning the model into a Wordle master.



https://media.wired.com/photos/68e564624bf4fef4e1ddd7ad/191:100/w_1280,c_limit/AI-Lab-America-Needs-DeepSeek-Moment-Business.jpg

Source link

Leave a Comment