LogoBellatrix One

Reduced Context

Bellatrix One optimizes AI performance by intelligently managing tool context. Instead of sending full tool specifications with every chat prompt, only relevant tools are included when needed.

The Problem with Full Context

Traditional MCP implementations send complete tool specifications to the AI with every request:

  • Hundreds or thousands of lines of tool definitions
  • Detailed parameter schemas for every available tool
  • Example values and documentation for all functions
  • This information repeated with every single message

This creates several problems:

  • Increased token usage and higher costs
  • Slower response times due to larger prompts
  • Context window filled with irrelevant tool definitions
  • Reduced space for actual conversation and task context
  • Lower quality responses as the AI struggles with information overload

How Reduced Context Works

Bellatrix One's MCP Broker intelligently filters tool definitions:

  1. The AI receives a lightweight list of available tool names and brief descriptions
  2. When the AI needs to use a specific tool, it requests the full specification
  3. Only the relevant tool details are loaded into context at that moment
  4. After use, the detailed specification is removed from context
  5. The next request starts fresh with minimal context overhead

Benefits

Lower Costs

By reducing token usage by up to 90% on tool definitions, you save significantly on API costs. This is especially important for teams with many tools or high message volumes.

Faster Responses

Smaller prompts mean faster processing by the AI model. The AI spends less time parsing irrelevant tool definitions and more time on your actual request.

Better Results

With more context window available for your conversation, the AI can maintain better understanding of your task. It's not distracted by dozens of unrelated tool specifications.

Scalability

You can connect hundreds of tools without degrading performance. Traditional approaches break down when you have more than a handful of tools.

Technical Implementation

The reduced context system works through the MCP Broker:

  • Tool discovery returns only names and one-line descriptions
  • Full schemas are fetched on-demand when tools are invoked
  • Intelligent caching reduces redundant specification requests
  • Natural language tool search helps the AI find relevant tools without seeing all definitions

Real-World Impact

For a typical setup with 50 available tools:

  • Traditional approach: ~15,000 tokens per request for tool definitions
  • Reduced context: ~500 tokens for tool list, ~300 tokens when a tool is used
  • Result: 95% reduction in tool-related token usage

Learn More

Learn about the MCP Broker →