MCP = Major Context Problem

John Tay

Apr 27

One Apollo MCP call ate 10% of my session

Read →

2 Comments

Sandesh Bagmare

Apr 30

I might sound naive sorry for that but genuinely want to learn this, could you help me understand

When an MCP tool or API call returns a large payload — say, 1000 data points intended for chart rendering or analysis — how is that data handled in the context window?

Specifically:

1. Does the full tool result get injected verbatim into the context, consuming tokens proportionally to the payload size?

2. Or is it expected for claude to have any built-in summarization/truncation layer that condenses large tool responses before they're counted against the context limit?bloa 3. As an architectural consideration:: is it better to have the MCP server return raw data (and let the client/Claude process it), or should the server return a pre-rendered artifact but in that case would other ai tools accepttto show it..

Reply (1)

John Tay

May 4

Yes, the full result gets pushed into Claude's context window verbatim. You can summarise or truncate after the fact, but the tokens are already consumed by then.

10xPlaybooks

MCP = Major Context Problem