AI SDK v6 Developer Guide: Build AI Features in 2026

I spent most of last year bolting AI features onto products the wrong way.

Direct provider SDK. Hardcoded model string. A streaming response handler I copy pasted from a blog post. It worked. It also meant that every time a new model came out, switching took half a day of untangling types and rewriting stream parsers that I did not remember writing in the first place.

The AI SDK v6 is the fix I wish I had a year ago.

If you have been putting off building AI features into your product because the ecosystem felt chaotic, or if you tried the AI SDK a year ago and bounced off it, this is the update worth coming back to. The abstractions finally match how people actually build. The streaming story is coherent. Provider switching is a one-line change. And the tool-use and agent patterns are good enough that you can ship real features instead of demos.

Here is what actually changed, what it unlocks, and the patterns I use day to day now.

What the AI SDK Actually Is

Before the v6 specifics, a quick grounding for anyone who has heard the name and never used it.

The AI SDK is a TypeScript library that gives you one API for talking to every major model provider. You write your code once against the SDK. Underneath, it talks to OpenAI, Anthropic, Google, Mistral, open-source models via Ollama or Together, and anything else with a compatible adapter. Switching providers is a string change, not a rewrite.

It also handles the parts of AI work that are annoying to get right: streaming tokens to the browser, tool calls, structured output, chat state, retries, tracing. You can write those yourself. I have. You are not going to do a better job than the SDK for most products, and you will spend weeks on plumbing that does not differentiate your product from anyone else's.

The v6 release refined all of that and added a few things that change what is reasonable to build solo.

What Is New in AI SDK v6

The headline changes are smaller than the cumulative effect of them together. Individually, each improvement looks like a polish pass. Used together, they change the shape of what feels worth building.

First, the provider abstraction got simpler. You used to import a provider package and configure it. Now you can pass a plain "provider/model" string through the AI Gateway and the SDK handles the wiring. Switching from "anthropic/claude-opus-4-7" to "openai/gpt-5" is a string edit. Fallbacks between providers are first-class. If you have been watching the model landscape thrash around and wanted to avoid committing to one, this is the feature that matters.

Second, tools and agents are real primitives. The tool() helper and the agent loop are good enough that you can build agentic features without importing a separate framework. I used to reach for LangChain for anything with more than two steps. I do not anymore. The SDK's agent pattern is simpler, more debuggable, and stays out of your way.

Third, streaming got cleaner. The streamText, streamObject, and streamUI APIs converged on a consistent shape. The React hooks (useChat, useObject, useCompletion) work against the same streaming protocol the server sends, which is plain Server-Sent Events under the hood. The reasons that protocol fits AI streaming better than WebSockets are spelled out in the SSE vs WebSockets post. If you are using Next.js or any other framework, the end-to-end flow is the most boring it has ever been, which is the highest praise an AI streaming story has ever deserved.

Fourth, structured output is actually trustworthy. generateObject and streamObject with a Zod schema produce outputs that match your types. Not "probably match." Match. The SDK retries and reprompts if the model drifts. You get validated TypeScript objects out the other side, and you can rely on them in your product code without a second layer of defensive parsing.

Fifth, observability is built in. OpenTelemetry tracing is not an afterthought. You can see every prompt, every model call, every tool invocation, and every retry in a tracing UI without writing your own logger. When something goes wrong in production, you can actually see what happened.

These are the load-bearing changes. Everything else is cleanup around the edges.

The Mental Model That Makes the SDK Click

The thing that took me too long to internalize is that the AI SDK is not trying to be a framework. It is trying to be a standard library for AI features, the way fetch is a standard library for HTTP.

Once you look at it that way, the API makes sense. There are four core functions you use 90% of the time:

generateText when you want a string back
streamText when you want to stream a string back
generateObject when you want a typed object back
streamObject when you want to stream a typed object back

Everything else is a variation on those four themes. Tools attach to any of them. Agents are streamText in a loop. Chat is streamText with message history threaded through. Structured output is generateObject with a schema.

If you understand the four core functions, you understand the SDK. The rest is ergonomics.

Starting From Scratch: The Minimum Viable Setup

Here is the smallest example that does something real. A Next.js App Router route that streams a chat response.

// app/api/chat/route.ts
import { streamText } from 'ai';

export async function POST(req: Request) {
  const { messages } = await req.json();

  const result = streamText({
    model: 'anthropic/claude-opus-4-7',
    messages,
  });

  return result.toDataStreamResponse();
}

That is the whole backend. Eight lines. On the client side:

'use client';
import { useChat } from 'ai/react';

export function Chat() {
  const { messages, input, handleInputChange, handleSubmit } = useChat();

  return (
    <form onSubmit={handleSubmit}>
      {messages.map(m => <div key={m.id}>{m.role}: {m.content}</div>)}
      <input value={input} onChange={handleInputChange} />
    </form>
  );
}

This is the part that makes the SDK worth using. The hook knows the protocol. The protocol is standardized. The server streams. The client renders. No custom fetch logic, no SSE parser, no state machine to debug.

You can build a working chat UI in about ten minutes. That is not hype. It is the first time I have been able to write that sentence honestly.

Provider Switching Without the Wincing

One of the realities of shipping AI features in 2026 is that the best model for your use case changes every few weeks. GPT leads on one benchmark, Claude leads on another, and a Chinese open-source model nobody had heard of last month is suddenly competitive for half the price.

If your code is coupled to one provider, you either ignore those changes and fall behind or you eat the rewrite cost every time you switch. Both options are bad.

The AI SDK solves this by making model selection a configuration value rather than a structural dependency. With the Vercel AI Gateway, you can do this:

const result = streamText({
  model: process.env.CHAT_MODEL ?? 'anthropic/claude-opus-4-7',
  messages,
});

Now switching models is an environment variable change. No code deploy. No provider package swap. If you want A/B testing between providers, you can set the model per request based on user ID, cohort, or feature flag.

This is more important than it sounds. It means your product is not tied to the fortunes of any single lab. If Anthropic raises prices or OpenAI ships a better model, you can move without an engineering project. That is the main reason I default to gateway-style model strings now rather than direct provider packages, even though both work.

The only time I reach for a provider-specific package is when I need a feature that is not in the gateway abstraction yet, like specific fine-tuning hooks. For 95% of product work, the gateway is the right default. The fuller architecture, including bucketing requests, picking models per bucket, and falling back across providers when one blinks, is what I cover in the LLM router pattern.

Tools: Where the SDK Stops Being a Demo Library

The real leap for products comes from tools. If you have only used the SDK for chat completions, you have seen the easy half. Tools are what turn a language model into something that can do work in your application.

The pattern is:

import { tool, streamText } from 'ai';
import { z } from 'zod';

const searchOrders = tool({
  description: 'Find orders for the current user',
  parameters: z.object({
    status: z.enum(['pending', 'shipped', 'delivered']),
  }),
  execute: async ({ status }) => {
    return db.orders.findMany({ where: { status, userId: currentUser.id } });
  },
});

const result = streamText({
  model: 'anthropic/claude-opus-4-7',
  messages,
  tools: { searchOrders },
  maxSteps: 5,
});

The model decides when to call the tool. You provide the schema and the implementation. The SDK handles the back-and-forth protocol, validates the arguments, calls your function, feeds the result back to the model, and continues the conversation.

The maxSteps parameter is important. Without it, the model calls tools exactly once and stops. With it, you get multi-step reasoning. The model can call a tool, see the result, decide to call another tool, and keep going until it has what it needs to answer.

This is where the line between "chatbot with API calls" and "agent" starts to blur. If you set maxSteps to 10 and give the model a few well-designed tools, you have built an agent. There is no separate agent framework to learn. The surface area is the same as the chat surface area, with tools attached.

I covered the broader question of what agent memory and state management looks like in my guide on AI agent memory if you want to go deeper on the stateful side. If you are deciding between this and the other agent runtimes (LangGraph, Mastra, Pydantic AI, the provider SDKs), the agent framework comparison walks through the tradeoffs.

Structured Output: The Feature That Changes What You Build

Most of the AI features I see in products do not need chat at all. They need a structured result. Classify this ticket. Extract the invoice fields. Summarize this page into three bullet points. Generate a title, a description, and three tags for this upload.

For those, generateObject is the function you want:

import { generateObject } from 'ai';
import { z } from 'zod';

const { object } = await generateObject({
  model: 'anthropic/claude-opus-4-7',
  schema: z.object({
    title: z.string().max(80),
    tags: z.array(z.string()).max(5),
    summary: z.string().max(200),
  }),
  prompt: `Summarize this article: ${article}`,
});

// object is typed as { title: string; tags: string[]; summary: string }

You pass a Zod schema. You get back a validated object that matches it. The SDK handles the prompting, retries invalid outputs, and gives you something your TypeScript compiler is happy with.

This changes what is worth building. A year ago, adding an AI feature to a product meant writing a prompt, parsing freeform text, and dealing with edge cases where the model wrapped its response in markdown or added commentary. Today, it means writing a schema and a prompt.

The reliability question matters here. If you have tried this before and been burned by the model returning invalid output, the v6 retry logic is meaningfully better. The SDK reprompts with the validation error included, and modern models are good at correcting themselves on the second pass. In my use, structured output with a reasonable schema succeeds on the first try over 95% of the time, and the retry catches most of the rest.

If your schema is extremely strict or the task is ambiguous, you can still get failures. Keep schemas lenient where they can be, and design prompts so the model has room to succeed. I went deeper on the failure modes and the choices between JSON mode, function calling, and schema-constrained generation across providers in structured outputs in 2026, which is worth reading if you are about to ship anything that writes structured data to a database.

Streaming UI: When You Want More Than Text

For a long time, AI output in apps meant streaming text into a <div>. That is still the right answer for chat. For more structured experiences, the SDK gives you two better options.

streamObject streams a structured object as it is being generated. You see partial data arrive as JSON fields fill in. If you are generating a form, a table, or a card layout, this is the right primitive. The user sees the skeleton fill in rather than waiting for the whole thing.

const { partialObjectStream } = await streamObject({
  model: 'anthropic/claude-opus-4-7',
  schema: z.object({
    title: z.string(),
    sections: z.array(z.object({ heading: z.string(), body: z.string() })),
  }),
  prompt: 'Generate a blog post about...',
});

for await (const partial of partialObjectStream) {
  console.log(partial); // { title: "The ...", sections: [{ heading: "Why..." }] }
}

streamUI (in frameworks that support it) lets you stream actual React components. The model picks which component to render and what props to pass. You define the components. This is the shape of the experience if you have used v0 or similar tools. It is powerful and it is niche. For 90% of products, streamObject plus your own rendering layer is simpler and easier to reason about.

The Patterns That Keep Working

After a year of building features with the SDK, a few patterns have settled.

Put the model string in config, not code. Do this even if you have no plans to switch. Future-you will thank present-you when a better model ships and you want to try it in five minutes instead of an afternoon.

Start with generateObject, not generateText. Every time I have written generateText in product code, I have eventually rewritten it as generateObject because I needed structure. Skip the intermediate step. If the output is going anywhere other than a chat bubble, it should be structured.

Use tools sparingly and name them well. A model with three well-named tools outperforms a model with fifteen confusingly named ones. Each tool is a decision point for the model. More tools means more chances to pick wrong.

Set maxSteps on every agentic call. The default is 1, which is safe. Pick a number that matches your use case. Higher means more capability and more cost. I usually start at 5 and adjust from there based on traces.

Add tracing before you need it. Enable OpenTelemetry from day one. The cost of setup is an hour. The value the first time something goes wrong in production is weeks. Observability for AI features is not optional if you care about reliability. I go deeper on this in my production observability guide for solo developers.

Treat model output as untrusted input. Sanitize anything you send to the browser, the database, or another system. The model will sometimes return something weird. That is not a bug in the model. It is the nature of the work. Validate at the boundary.

What It Costs, and How to Keep That Under Control

The question that kills more AI features than anything else is not "does it work?" It is "does it pay for itself?"

The AI SDK does not change the cost of the models you call. A Claude request costs what a Claude request costs. What it does give you is the tooling to keep those costs visible and manageable.

Three things keep my costs predictable.

Caching identical prompts. If the same prompt is going to run many times, cache the result. The SDK has integrations for this. You can also do it yourself with a hash of the input. For any feature where the input space is bounded, caching is free performance and free money saved.

Using cheap models for cheap tasks. Not every AI call needs the biggest model. Classification tasks, simple extraction, and routing logic work fine on smaller, cheaper models. I default to the big model only for tasks that need real reasoning. Everything else goes to the cheap tier.

Rate limiting and spend caps per user. If your product has AI features available to users, set limits. A single user burning through your budget because they found a prompt injection or a runaway loop is a pattern I have seen too many times. The AI Gateway has spend caps built in. Use them. I wrote about this in more detail in LLM cost optimization for production.

Once you have those three patterns in place, AI features become a predictable line item rather than a variance risk. That is the state you want to be in if you are shipping anything that runs in production.

When Not to Use the AI SDK

It is a boring take, but worth saying out loud. The SDK is not the right choice for everything.

If you are building a pure chatbot against one provider and you are certain you will never switch, you can get by with that provider's SDK directly. You will save one layer of indirection and lose some TypeScript ergonomics. For most products this is a wash. For extremely minimal integrations, it is simpler.

If you need a feature that only one provider has and the SDK has not abstracted it yet, use the provider SDK. You can still use the AI SDK for 90% of calls and drop down to the raw SDK for the 10% that need the specific capability.

If you are doing heavy fine-tuning, custom inference, or deploying your own models, the SDK is not really the layer you need. It is a client library, not a model ops platform.

For everything else, which is most of what anyone is shipping, the AI SDK is the right default.

The Reason This Matters More in 2026 Than It Did Last Year

The economics of AI features changed in the last twelve months.

A year ago, shipping a real AI feature meant spending two weeks on plumbing for every week spent on the feature itself. Streaming, tool calls, retries, observability, switching providers, handling structured output. Each one was a small project. Together they added up to a tax that made AI features feel expensive relative to what they delivered.

Today, the plumbing is solved. You write a prompt, a schema, maybe a tool, and you ship. The SDK absorbs the infrastructure work. That means the economics tilt back toward the feature itself. You can prototype in a day. You can ship in a week. You can iterate on prompts and models without touching architecture.

This is the unglamorous version of the AI revolution, and it is the one that actually changes what gets built. Not the demos. The features that ship in products your users never think of as "AI features" because they just work.

If you have been watching the AI space and waiting for the right moment to actually build, the tooling has caught up. The remaining blocker is picking a problem worth solving. That part is on you.

For what it is worth, the problems worth solving right now are boring ones. Automated tagging. Smarter search. Better onboarding. Things that were impossible or too expensive to build with traditional code, now trivial. Pick one of those, ship it in a week, and see what it does for your product before you try to build anything more ambitious.

The tools are ready. The cost is manageable. The patterns are clear. The only thing left is building the thing.

AI SDK v6: The Practical Guide to Shipping AI Features Without Vendor Lock-In (2026)