Tools your agents can actually run: Mellea v0.7.0

Mellea v0.7.0 ships a sandboxed code interpreter, a shell tool, and a library of executable requirements, plus context compaction and plugin-based telemetry, so agents can run code and stay grounded.

Mellea Contributors14 July 2026

release
v0.7

See Inside Your LLM Pipeline with Mellea Debug Plugins

Trace generation, validation, and sampling in detail. Built-in plugins reveal model calls, requirement failures, repair events, and loop iterations—all without boilerplate.

Akihiko Kuroda13 July 2026

debugging
plugins
observability

Every layer of Granite 4.1 in one conversation

At the IBM Booth at THINK 2026, we ran a demo using every model family from the Granite 4.1 release

Paul Schweigert1 July 2026

granite
speech
demo
switch

Why Mellea?

Agents are just programs, patterns of control flow around generative AI. So why the 80–90% failure rate? Because they're built out of prompts, not code. Mellea is a different approach.

David Cox, Paul Schweigert19 June 2026

generative-computing
agents
prompt-engineering
IVR
reliability
control-flow

Granite Switch in Mellea: one checkpoint, every adapter function

With Granite Switch, adding validation to a Mellea program — checking that an answer is grounded, that a requirement is met, that nothing in the response was hallucinated — is a single function call against the backend you're already using. One checkpoint, a dozen drop-in validations, no second pipeline to stand up.

Nigel Jones16 June 2026

granite
adapters
switch
vllm

The Loop Needs a Gate

The industry just spent a fortnight agreeing you should write loops, not prompts. Everyone also agrees on the catch: a loop is only as good as the gate that can fail its work.

Nigel Jones11 June 2026

loop-engineering
harness-engineering
verification
IVR
generative-programming
requirements

From Linting to Tests: Doubling Functional Correctness in Qiskit IVR

Wiring functional tests into Mellea's Instruct-Validate-Repair loop nearly doubled functional correctness on the Qiskit Human Eval benchmark, on top of what static validation already provided.

Alex Bozarth27 May 2026

qiskit
IVR
validation
benchmark
LLM
code-generation

Using MCP Server Tools in Mellea

Mellea now supports MCP server tools. Discover any MCP server's tools and call them directly from a Mellea agent.

Alex Bozarth22 May 2026

v0.6
mcp
tools

Making Small Models Rock with Mellea

Small open-weight models can handle production-shaped work when the harness decomposes the task, validates outputs, and routes each step to the right local model.

Paul Schweigert, Nathan Fulton20 May 2026

granite
rag
adapters
small-models
docling
local-llm

What Mellea Brings to DSPy: Structured Validation for Reliable AI Programs

Add semantic validation and quality guarantees to DSPy programs with Mellea's integration for structured prompting and runtime verification.

Akihiko Kuroda6 May 2026

dspy
generative-programming
llm
validation
reliability

Validate Every CrewAI Agent Output: Automatic Retry with Mellea

Mellea brings structured validation and automatic repair to CrewAI multi-agent systems through the instruct-validate-repair pattern.

Akihiko Kuroda4 May 2026

crewai
multi-agent
validation
integration

Cut LLM Costs Without Sacrificing Quality: The SOFAI Pattern in Mellea

Route most requests to a small model and escalate only hard cases to a larger one — Mellea's SOFAISamplingStrategy makes the dual-model pattern a one-line strategy swap.

Nigel Jones1 May 2026

sofai
sampling
cost
ollama

What Mellea Brings to LangChain: Structured Generative Programming for Reliable AI Applications

Learn how Mellea's generative programming patterns add structured validation, automatic retry, and inference-time scaling to LangChain applications.

Akihiko Kuroda30 April 2026

langchain
generative-programming
llm
validation
reliability

Getting Started with Mellea in Five Minutes

Install uv, pull a local model with Ollama, and build your first Mellea pipeline from scratch — no API key, no cloud, fully private.

Angelo Danducci II27 April 2026

getting-started
ollama
tutorial

Mellea Meets AI Frameworks: Structured Validation for LangChain, CrewAI, and DSPy

How Mellea brings structured validation and automatic retry to LangChain, CrewAI, and DSPy

Akihiko Kuroda24 April 2026

integration
framework

Your LLM Provider is Down. Now What?

Use mellea's provider-agnostic backend abstraction to build LLM applications that automatically survive outages through three layers of failover: validation retries, capability escalation (SOFAI), and infrastructure switching across providers.

Paul Schweigert22 April 2026

backends
reliability

Automatically Fixing Deprecated Qiskit Code with Instruct-Validate-Repair

How we used Mellea's Instruct-Validate-Repair pattern with flake8-qiskit-migration to automatically catch and fix deprecated Qiskit APIs in LLM-generated code.

Alex Bozarth20 April 2026