Generative AI News

DeepSeek R1: A Reasoning Model Rival to OpenAI-01

On January 20, Chinese AI lab DeepSeek launched a new reasoning model called DeepSeek R1, designed to improve problem-solving and analytical skills.

The company also introduced six smaller open-source models based on R1, with two performing similarly to OpenAI’s OpenAI-01 mini.

Key Points

  • DeepSeek R1 is Fully open-source and claims to match the performance of OpenAI’s most advanced model, OpenAI-01.
  • When DeepSeek-R1 comes to mathematics it Achieved 79.8% on the American Invitational Mathematics Examination (AIME 2024), comparable to OpenAI-01.
  • Another benchmark on mathematics, MATH-500, the DeepSeek-R1 model achieved a 93 per cent accuracy, surpassing most of the benchmarks. 
  • A benchmark for coding, DeepSeek-R1 secure a rank in the 96.3rd percentile of human participants. 
  • On General Knowledge, benchmarks such as MMLU And GPQA Diamond, DeepSeek-R1 scored 90.8 per cent and 71.5 per cent accuracy respectively.
  • AlpacaEval 2.0, a benchmark that tests the AI models writing and question answering, DeepSeek-R1 secured an 87.6 per cent win rate. 
  • DeepSeek R1 is designed to not only provide answers but also reason through problems like a human.
  • R1 is  Available on the AI platform Hugging Face under an MIT license, allowing commercial use without restrictions.
  • DeepSeek also released “distilled” versions of R1 ranging in size from 1.5 billion parameters to 70 billion parameters. The smallest can run on a laptop.
  • DeepSeek R1 is available through DeepSeek’s API at prices 90%-95% cheaper than OpenAI’s o1.

Transforming AI Accessibility with DeepSeek R1

DeepSeek R1 is important because it makes advanced AI technology available to everyone.

It’s open-source and free to use, so developers, researchers, and businesses can access it without any cost.

This can help drive innovation, especially in areas with fewer resources.

DeepSeek R1 performs well in tasks like reasoning, math, coding, and general knowledge, meaning it can solve complex problems in various fields.

It’s also cost-effective, offering API access at lower prices than competitors, making it a good choice for organizations wanting powerful AI without high costs.

By creating versions that can run on smaller devices, DeepSeek R1 makes advanced AI more accessible, even in places with limited computing power, promoting inclusivity and equal access to technology.

News Gist

Chinese AI lab DeepSeek released DeepSeek R1, an open-source reasoning model rivaling OpenAI-01 in performance.

It excels in mathematics, coding, and general knowledge benchmarks, with versions ranging from 1.5 to 70 billion parameters. Available on Hugging Face under an MIT license. 

It offers cost-effective AI solutions, accessible even on laptops, at significantly lower API costs.

Leave a Reply

Your email address will not be published. Required fields are marked *

AI Binger
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.