> ## Documentation Index
> Fetch the complete documentation index at: https://docs.ctxprotocol.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Query and Chat Runtime

> The public contract behind `/api/v1/query` and `/api/chat`: inputs, response shapes, and the guarantees the managed runtime provides.

## What this page is for

Use this page as the **public contract** for Context's managed query runtime. Both of Context's public Query surfaces route through the same runtime:

* `/api/v1/query` — the SDK / API surface for programmatic callers
* `/api/chat` — the first-party conversational surface

You should expect identical answer behavior across both. Transport, streaming, and session state differ by surface; the managed runtime that produces the answer does not.

***

## What the runtime guarantees

<CardGroup cols={2}>
  <Card title="Grounded answers" icon="anchor">
    Every paid response is assembled from tool results the runtime actually retrieved. If the marketplace can't answer, the runtime returns `capability_miss` instead of fabricating numbers.
  </Card>

  <Card title="Honest ambiguity" icon="question">
    If a prompt is under-specified, the agentic runtime proceeds with the best grounded interpretation and discloses any assumption it made in the answer. It returns `capability_miss` only when the marketplace genuinely can't answer.
  </Card>

  <Card title="Full collection preservation" icon="list-check">
    Comparison queries return the complete relevant set rather than a cherry-picked subset, so your model can reason over the whole collection.
  </Card>

  <Card title="Per-response settlement" icon="coins">
    Paid responses settle in USDC on Base after the answer is delivered. Failed executions do not trigger a charge.
  </Card>
</CardGroup>

***

## Public controls

Both surfaces expose the same underlying runtime through different wire shapes. The public controls that actually affect answer behavior are:

| Control                 | Surface         | Description                                                                                       |
| ----------------------- | --------------- | ------------------------------------------------------------------------------------------------- |
| `query`                 | `/api/v1/query` | Natural-language question to answer                                                               |
| `tools`                 | `/api/v1/query` | Optional pinned tool shortlist (overrides marketplace discovery)                                  |
| `responseShape`         | `/api/v1/query` | `answer_with_evidence` (default) or `evidence_only`                                               |
| `includeData`           | `/api/v1/query` | Optional bounded inline execution data. Large payloads return a preview plus full-data references |
| `agentModelId`          | `/api/v1/query` | Public model knob for the main librarian agent loop                                               |
| `selectedChatModel`     | `/api/chat`     | Legacy chat wire name for the same user-selected agent model                                      |
| `includeDeveloperTrace` | `/api/v1/query` | Opt in to structured diagnostics for your own debugging                                           |

All other runtime behavior — discovery, selection, ambiguity resolution, execution, retrieval, and post-execution computation via the managed Python `code_interpreter` (see [Code Interpreter](/architecture/code-interpreter)) — is managed by the server. Callers do not pick between internal lanes or budgets.

***

## Response shapes

| Shape                  | Who it's for                                  | Contents                                                                                                |
| ---------------------- | --------------------------------------------- | ------------------------------------------------------------------------------------------------------- |
| `answer_with_evidence` | First-party chat, human-facing apps (default) | Natural-language answer **plus** structured evidence, freshness, confidence, and optional view payloads |
| `evidence_only`        | External agents that don't need prose         | Bounded evidence, computed artifacts, and full-data references without a synthesized prose answer       |

In `evidence_only` mode the agent harness receives the structured evidence package and computed artifacts without prose synthesis. Full execution data is referenced by `artifacts.canonicalDataRef` and, when requested, `dataUrl`; `includeData: true` adds a bounded preview rather than unbounded raw rows.

**Artifacts vs. computed artifacts.** Structured charts and dataframes the runtime computes (for example, via the managed Python `code_interpreter`) are returned in `computedArtifacts`. The `artifacts` field carries provenance metadata — source references and dataset handles for the evidence behind the answer.

When the runtime returns `capability_miss`, the response still carries typed fields your client can branch on — no prose-parsing required.

***

## How the two surfaces differ

The runtime contract is identical across surfaces. What differs:

| Property      | `/api/v1/query` | `/api/chat`                |
| ------------- | --------------- | -------------------------- |
| Transport     | JSON response   | Streamed UI events         |
| Session state | Caller-managed  | Server-managed chat thread |
| Auth          | API key         | User wallet session        |

If an answer renders correctly through `/api/v1/query` but breaks in chat, the chat adapter is usually the first place to look — the shared runtime is the same on both sides.

***

## Debugging with developer traces

Pass `includeDeveloperTrace: true` to `/api/v1/query` to receive a structured diagnostics payload alongside the answer. Traces are the supported public way to inspect what the runtime did on a given request.

The chat app exposes a sibling surface: enable **Developer Mode** in Settings → Developer Settings, and a **Developer Logs** card appears at the bottom of each response with the same iterative-runtime signals (orchestration mode, tool call history, execution trace, verification, tool-registry stats). The two surfaces are kept conceptually aligned — the chat card and the SDK `developerTrace` tell the same story for the same turn class — but the SDK trace additionally includes a `timeline`, `summary` rollups, `cost`, and `contributorSearches` that the chat card does not surface. See the [Debugging Guide](/guides/troubleshooting#using-developer-mode-for-debugging) for the full section list.

The trace is intentionally a typed, stable surface — internal intermediate diagnostics can change across rollouts without breaking it.

## Chat streaming UI extensions (first-party `/chat`)

Beyond message text, the chat stream emits structured UI payloads used by the web client:

| Data part                                                                                   | Purpose                                                                                                                  |
| ------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------ |
| `data-reasoningStep`                                                                        | Incremental librarian reasoning chunks, optionally keyed with `relatedExecutionCallId` to align with tool execution.     |
| `data-executionToolCalls`                                                                   | Final per-call timeline (arguments + truncated results + `executionCallId`) for the docked execution sidecar.            |
| `data-chartDelta`, `data-codeDelta`, `data-sheetDelta`, `data-textDelta`, `data-imageDelta` | Artifact document streaming payloads; buffered client-side next to structural `data-id`/`data-kind`/`data-finish` parts. |

Users can wipe all server-side conversations with `DELETE /api/history` (authenticated).

***

## Related docs

* [Protocol Architecture](/architecture/protocol)
* [The Agentic Pattern](/sdk/agentic-pattern)
* [SDK Reference](/sdk/reference)
* [Handshake Architecture](/guides/handshake-architecture)
* [Grounded vs. Synthetic: Free LLM Comparison](/why-context/free-llm-comparison)