Nous Research Unleashes Hermes 4: Open-Source AI Models

August 30, 2025 Ai Binger News Desk

Nous Research, a pioneering group in open-source and decentralized AI, has launched Hermes 4, their most advanced family of large language models to date.

Built for transparency and user steerability, Hermes 4 stands out for its reasoning capabilities, performance, and minimal content restrictions.

Key Technical Features

Hybrid Reasoning Architecture: Hermes 4 introduces “hybrid reasoning” capabilities, allowing users to toggle between quick responses and thorough, step-by-step cognitive processes.

When activated, models articulate their internal reasoning within special <think> tags before delivering final responses, similar to OpenAI’s o1 reasoning models but with complete transparency regarding the AI’s thought process.

Function Calling and Tool Use: Hermes 4 supports advanced function calling within single assistant turns, produced after reasoning processes.

The system handles tool definitions through system prompts or direct message fields, with automatic parsing built into VLLM and SGLang frameworks.

Advanced Training Infrastructure: Hermes 4 leverages two revolutionary training systems: DataForge for synthetic data generation and Atropos for reinforcement learning.

DataForge operates through directed graphs, transforming basic pre-training data into complex instruction-following examples through “random walks”.

For instance, it can convert a Wikipedia article into a rap song before generating related questions and answers.

Atropos functions as specialized training environments where AI models practice specific skills mathematics, coding, tool utilization, and creative writing receiving feedback only for correct responses.

This “rejection sampling” method ensures only verified, high-quality responses enter training data.

Model Specifications and Performance

Available Model Sizes: Hermes 4 comes in three configurations: 14B, 70B, and 405B parameters, all built on Llama 3.1 architecture (except 14B using Qwen3 14B checkpoint).

Each model offers hybrid reasoning capabilities with toggleable thinking modes.

Benchmark Performance

Hermes 4’s performance achievements are remarkable:

405B model: 96.3% on MATH-500 benchmark in reasoning mode, 81.9% on AIME’24 mathematics competition.
70B model: 95.6% on MATH-500, 73.5% on AIME’24.
14B model: 92.6% on MATH-500, 52.7% on AIME’24.
Most significantly, Hermes 4 achieved 57.1% on RefusalBench in reasoning mode, vastly outperforming GPT-4o (17.67%) and Claude Sonnet 4 (17%).

Availability and Access Options

All model weights are freely available on Hugging Face following Nous Research’s open-source philosophy.

Interactive Platforms

Users can access Hermes 4 through multiple channels:

Nous Chat: Revamped interface featuring parallel interactions and memory systems.
API Access: Available through OpenRouter, Nebius, and Luminal.
Direct Download: Complete model weights on Hugging Face platform.

Pricing Structure

Hermes 4 maintains cost-effective pricing compared to proprietary alternatives:

Input tokens: $0.000093 per 1,000 tokens.
Output tokens: $0.000373 per 1,000 tokens.
Context length: 131,072 tokens.

This pricing significantly undercuts competitors like Claude 3.5 Sonnet ($0.003/$0.015) and GPT-4o while providing comparable or superior performance.

News Gist

Nous Research has launched Hermes 4, a family of open-weight hybrid reasoning AI models (14B, 70B, 405B).

With benchmark-leading performance, hybrid reasoning mode, and transparent training, Hermes 4 challenges Big Tech dominance while promoting openness, affordability, and user control.

FAQs

Q1. What is Hermes 4?

Hermes 4 is a family of open-weight hybrid reasoning AI models (14B, 70B, 405B) released by Nous Research in August 2025.

Q2. What makes Hermes 4 unique?

It features a hybrid reasoning mode, benchmark-topping math performance, fewer restrictions on responses, and complete transparency in training methods and datasets.

Q3. How does Hermes 4 perform in benchmarks?

The 405B version achieved 96.3% on MATH-500 and 81.9% on AIME’24, rivaling or surpassing top closed-source AI models.

Q4. Where can Hermes 4 be accessed?

It is available on Hugging Face, Nous Chat, and OpenRouter, with both free model weights and paid inference services.

Q5. What is the pricing for Hermes 4 usage?

On OpenRouter, pricing is about $0.20 per million input tokens and $0.80 per million output tokens for the 405B variant.

Q6. Why is Hermes 4 significant?

Hermes 4 challenges Big Tech by showing that open-source AI can be powerful, affordable, transparent, and user-controlled, empowering developers and researchers worldwide.

Cookie	Domain	Description	Duration	Type
_ga_*	.aibinger.com	Google Analytics sets this cookie to store and count page views.	1 year 1 month 4 days	Analytics
_ga	.aibinger.com	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.	1 year 1 month 4 days	Analytics

AI Binger

Nous Research Unleashes Hermes 4: Open-Source AI Models

Key Technical Features

Model Specifications and Performance

Benchmark Performance

Availability and Access Options

Interactive Platforms

Pricing Structure

News Gist

FAQs

Figure AI Introduces Figure 03: New Humanoid Robot

Google Rolls Out Gemini Enterprise

OpenAI Launches ChatGPT Apps SDK — A Full App Platform

Google DeepMind Launches CodeMender

Perplexity Expands with Acquisition of AI Design Startup

Fujitsu and NVIDIA Join Forces to Build “Physical AI” Platform

Leave a Reply Cancel reply