Model Context Protocol

ByAbhi Panseriya— Fullstack Engineer at Carousell

Pub 26 Jun 20263 min read

Isometric technical diagram of an AI protocol host-server architecture

In plain words

Any AI application that does something useful has to reach outside the model: read a file, query a database, call an API, pull a ticket. Before MCP, each of those connections was bespoke. A team wrote custom glue for every tool, and that glue only worked inside their own app.

MCP replaces the glue with a protocol. A server wraps a capability, such as your database, your filesystem, or a SaaS API, and any MCP-compatible client can connect to it and use it. The spec's own analogy is a USB-C port for AI applications: one standard plug instead of a drawer full of adapters.

How it works

MCP runs on JSON-RPC 2.0 over a host-client-server architecture. The host is the AI application a person actually uses: Claude Desktop, an IDE like VS Code or Cursor, or a custom agent. Inside the host, one or more clients each hold a 1:1 connection to a server. The server is the program that exposes context and actions.

Servers expose three primitives:

Tools are functions the model can call, like searchFlights or send_email. Model-controlled.
Resources are read-only data identified by a URI, like file:///report.md. Application-controlled.
Prompts are reusable templates a user invokes deliberately. User-controlled.

Clients can expose three capabilities of their own back to the server: sampling lets the server ask the model to generate text, roots scope which files or URIs the server may touch, and elicitation lets the server ask the user for input mid-task.

A connection is stateful. It opens with an initialize handshake where client and server negotiate which capabilities each side supports, then stays open for the session. Once connected, the model invokes a tool with a plain JSON-RPC request:

{
  "jsonrpc": "2.0",
  "id": 3,
  "method": "tools/call",
  "params": {
    "name": "weather_current",
    "arguments": {
      "location": "San Francisco",
      "units": "imperial"
    }
  }
}

Two transports carry these messages. stdio launches the server as a local subprocess and talks over standard input and output, the usual choice for a server on your own machine. Streamable HTTP handles remote servers over HTTP POST and GET, with optional Server-Sent Events for streaming. It replaced the older HTTP+SSE transport from the original 2024-11-05 revision, and that shift toward remote-first servers is recent.

Why it matters

The problem MCP solves is combinatorial. With N AI clients and M tools, bespoke integrations scale as N×M. A shared protocol collapses that to N+M: write a server once and every compliant client can use it, build a client once and it reaches every server.

Portability is the point. A tool you expose over MCP works in Claude, in ChatGPT, in an IDE, or in your own agent harness without reimplementing it per provider. Multi-agent setups, like a team of Claude Code agents, lean on MCP so every agent reaches the same tools, and which SDKs and providers wire it up is worth tracking on its own.

Origin

Anthropic introduced MCP on November 25, 2024, created by engineers David Soria Parra and Justin Spahr-Summers. The design borrows from the Language Server Protocol (LSP), which solved the same N×M problem for code editors: instead of every editor implementing every language, a language server speaks one protocol that every editor understands. MCP applies that idea to models and tools, and OpenAI and others adopted it through 2025.

The spec versions by date. The current revision is 2025-11-25, and the version string (YYYY-MM-DD) marks the last backwards-incompatible change rather than a semantic version. A recent experimental primitive, Tasks, wraps long-running requests in durable execution so results can be retrieved later.

Common confusions

Confused with	How it differs from MCP
Function calling	The model emitting a structured request to invoke a tool defined inline in the LLM request. MCP is the protocol that standardizes how those tools are discovered (`tools/list`) and executed across providers. They stack; they do not compete.
RAG	Retrieval-augmented generation pulls passages from an index to ground an answer in knowledge. MCP connects the model to live tools and data so it can take action. Knowledge versus action.
OpenAPI / GPT Actions	A static API spec consumed per provider and not portable. MCP adds runtime discovery and a shared transport, so a server you build once works across model providers.
WebMCP	A browser-side W3C Community Group draft that exposes a page's own JavaScript functions as MCP-style tools via `navigator.modelContext`. It builds on MCP but runs client-side, and it is not a W3C Standard.

Frequently asked

Questions & answers

What is the Model Context Protocol (MCP)?

MCP is an open protocol, introduced by Anthropic in November 2024, that standardizes how AI applications connect to external tools and data. It uses JSON-RPC 2.0 over a host-client-server architecture, where servers expose tools, resources, and prompts that clients can call.

How is MCP different from function calling?

Function calling is the model deciding to invoke a tool defined inline in the LLM request. MCP is the open client-server protocol that standardizes how tools are discovered, transported, and executed across providers. They are complementary layers, not competitors.

What transports does MCP support?

Two: stdio, where the client launches the server as a local subprocess and exchanges messages over standard input and output, and Streamable HTTP, for remote servers over HTTP with optional Server-Sent Events. Streamable HTTP replaced the older HTTP+SSE transport from the 2024-11-05 revision.

What are MCP's primitives?

Servers expose three: tools (model-controlled actions), resources (application-controlled read-only data), and prompts (user-controlled templates). Clients can expose sampling, roots, and elicitation back to the server. A recent experimental primitive, Tasks, adds durable execution wrappers for long-running requests.

Is MCP the same as WebMCP?

No. MCP is the base protocol, with servers speaking JSON-RPC over stdio or HTTP. WebMCP is a separate W3C Community Group draft browser API that exposes a page's own JavaScript functions as MCP-style tools via navigator.modelContext; it builds on MCP but is not a W3C Standard.

Research & sources

Primary references reviewed while compiling this glossary.

01
Specification - Model Context Protocol (current revision)modelcontextprotocol.io
02
Architecture - MCP Specification 2025-11-25modelcontextprotocol.io
03
Versioning - Model Context Protocolmodelcontextprotocol.io
04
Architecture overview - Model Context Protocolmodelcontextprotocol.io
05
Understanding MCP servers (Tools, Resources, Prompts)modelcontextprotocol.io
06
What is the Model Context Protocol (MCP)?modelcontextprotocol.io
07
Introducing the Model Context Protocol - Anthropicanthropic.com
08
modelcontextprotocol/modelcontextprotocol - spec and schema repogithub.com
09
Model Context Protocol - Wikipediaen.wikipedia.org
10
WebMCP - W3C Web Machine Learning CG Draftwebmachinelearning.github.io

About the author

Abhi Panseriya

Fullstack Engineer at Carousell

Fullstack developer publishing daily blogs on fullstack, frontend, and backend engineering.

Permanent companion pieces - guides, comparisons, glossary entries, and live trackers.

Keep reading

A curated selection of engineering blogs recommended for you next.

supabase21 May 2026

Supabase shipped a handler. The line about agents is the news.

Supabase shipped @supabase/server, collapsing Edge Function boilerplate to one wrapper - and admitting its shape was designed for coding agents.

9 min read

google-io22 May 2026

I/O 2026 turned agents into an API line item

Three days after the I/O 2026 keynote, the line that matters is not a model. It is pay-per-run agent pricing.

9 min read

mcp4 Jun 2026

Your agent spends 55,000 tokens before it reads your prompt

GitHub's MCP server costs 55k tokens before the first message. How AGENTS.md, Skills, and MCP split the agent context problem and converged on lazy loading.

11 min read

tanstack15 May 2026

TanStack Start hit 1.0. It is the first real non-Vercel React framework in years.

TanStack Start tagged v1.0 with no keynote. It is the first credible non-Vercel option for new React apps in roughly five years.

11 min read

Model Context Protocol

In plain words

How it works

Why it matters

Origin

Common confusions

Questions & answers

Research & sources

About the author

AI SDK provider support

WebMCP

AI Harness

Durable Execution

Keep reading

Supabase shipped a handler. The line about agents is the news.

I/O 2026 turned agents into an API line item

Your agent spends 55,000 tokens before it reads your prompt

TanStack Start hit 1.0. It is the first real non-Vercel React framework in years.

In plain words

How it works

Why it matters

Origin

Common confusions

Questions & answers

Research & sources

About the author

Related references

AI SDK provider support

WebMCP

AI Harness

Durable Execution

Keep reading

Supabase shipped a handler. The line about agents is the news.

I/O 2026 turned agents into an API line item

Your agent spends 55,000 tokens before it reads your prompt

TanStack Start hit 1.0. It is the first real non-Vercel React framework in years.