Skip to content

Commit 8b9b1c6

Browse files
committed
ci(conformance): add 2026-07-28 carried-forward leg + bump harness to 0.2.0-alpha.4
Adds a third server step and second client step that run `--suite all --spec-version 2026-07-28` against a separate baseline file. The existing `--suite draft` step only runs scenarios introduced in 2026-07-28; this new leg re-runs every applicable scenario at the 2026 wire so regressions in 2025-era behaviour under the stateless protocol are caught. The 2026 leg needs its own expected-failures.2026-07-28.yml because baseline entries are keyed by scenario name only — a scenario that passes at its default version but fails when forced to 2026 cannot share a file with the 2025 legs (the passing leg flags it stale). Baseline derived empirically from a local alpha.4 build. The alpha.3 -> alpha.4 bump required no entry changes to the existing expected-failures.yml. client.py now reads MCP_CONFORMANCE_PROTOCOL_VERSION (set by the harness on every invocation) so handlers can branch on it once the stateless client path lands; today it is logged only.
1 parent 364b762 commit 8b9b1c6

4 files changed

Lines changed: 179 additions & 4 deletions

File tree

.github/actions/conformance/client.py

Lines changed: 11 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -6,6 +6,9 @@
66
Contract:
77
- MCP_CONFORMANCE_SCENARIO env var -> scenario name
88
- MCP_CONFORMANCE_CONTEXT env var -> optional JSON (for client-credentials scenarios)
9+
- MCP_CONFORMANCE_PROTOCOL_VERSION env var -> spec version the harness mock
10+
server is speaking (e.g. "2025-11-25", "2026-07-28"). Always set; defaults
11+
to the harness's LATEST_SPEC_VERSION when --spec-version is omitted.
912
- Server URL as last CLI argument (sys.argv[1])
1013
- Must exit 0 within 30 seconds
1114
@@ -50,6 +53,13 @@
5053
)
5154
logger = logging.getLogger(__name__)
5255

56+
#: Spec version the harness is running this scenario at (e.g. "2025-11-25",
57+
#: "2026-07-28"). The harness always sets this (it falls back to its own
58+
#: LATEST_SPEC_VERSION when --spec-version is omitted), so None means we were
59+
#: invoked outside the harness. Handlers that need to take the stateless 2026
60+
#: path will branch on this once the SDK has one; today it is logged only.
61+
PROTOCOL_VERSION: str | None = os.environ.get("MCP_CONFORMANCE_PROTOCOL_VERSION")
62+
5363
# Type for async scenario handler functions
5464
ScenarioHandler = Callable[[str], Coroutine[Any, None, None]]
5565

@@ -347,6 +357,7 @@ def main() -> None:
347357

348358
server_url = sys.argv[1]
349359
scenario = os.environ.get("MCP_CONFORMANCE_SCENARIO")
360+
logger.debug(f"Conformance protocol version: {PROTOCOL_VERSION!r}")
350361

351362
if scenario:
352363
logger.debug(f"Running explicit scenario '{scenario}' against {server_url}")
Lines changed: 149 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,149 @@
1+
# Expected failures for the carried-forward 2026-07-28 legs
2+
# (`--suite all --spec-version 2026-07-28` for both server and client).
3+
#
4+
# This baseline is separate from expected-failures.yml because entries are
5+
# keyed by scenario name only: a scenario that passes at its default version
6+
# in the 2025 legs but fails when forced to 2026-07-28 (or vice versa) cannot
7+
# be expressed in a shared file (the passing leg would flag the entry as
8+
# stale). Like expected-failures.yml, this single file covers both
9+
# directions: the client 2026 leg reads the `client:` section and the server
10+
# 2026 leg reads the `server:` section. Both burn down independently of the
11+
# 2025 legs.
12+
#
13+
# Baseline established against @modelcontextprotocol/conformance pinned in
14+
# .github/workflows/conformance.yml (CONFORMANCE_VERSION = 0.2.0-alpha.4).
15+
# New conformance releases are adopted by deliberately bumping that pin and
16+
# reconciling both this file and expected-failures.yml in the same change.
17+
#
18+
# Entries are grouped by what unblocks them. As each gap closes the
19+
# corresponding scenarios start passing and MUST be removed from this list
20+
# (the runner fails on stale entries), so the baseline burns down per
21+
# milestone.
22+
23+
client:
24+
# --- No stateless client path on main yet ---
25+
# client.py drives the 2025 stateful lifecycle (initialize handshake +
26+
# session). The 2026-mode mock server is stateless, so the call sequence
27+
# never reaches the assertion. Unblocks when client.py's is_modern_protocol()
28+
# branch takes the per-request _meta path.
29+
- tools_call
30+
31+
# --- SEP-837 (application_type during DCR) ---
32+
# The sep-837-application-type-present check only fires on 2026-version
33+
# runs; the client omits application_type during Dynamic Client
34+
# Registration, so every auth scenario that reaches DCR fails it on this
35+
# leg (the same scenarios pass at their default version in the 2025 legs).
36+
- auth/metadata-default
37+
- auth/metadata-var1
38+
- auth/metadata-var2
39+
- auth/metadata-var3
40+
- auth/scope-from-www-authenticate
41+
- auth/scope-from-scopes-supported
42+
- auth/scope-omitted-when-undefined
43+
- auth/token-endpoint-auth-basic
44+
- auth/token-endpoint-auth-post
45+
- auth/token-endpoint-auth-none
46+
- auth/offline-access-not-supported
47+
48+
# --- Auth scenarios cut short by the 2026 connection lifecycle ---
49+
# The auth fixture flow drives the 2025 stateful lifecycle; the 2026-mode
50+
# mock rejects the MCP POST before the scope-escalation behaviour these
51+
# scenarios measure, so no authorization requests are observed. Unblocks
52+
# when client.py's auth flow speaks the 2026 per-request lifecycle.
53+
- auth/scope-step-up
54+
- auth/scope-retry-limit
55+
56+
# --- Same gaps as the 2025 baseline (fail identically when forced to 2026-07-28) ---
57+
# SEP-2575 (request metadata / _meta envelope): client does not populate the
58+
# _meta envelope or the MCP-Protocol-Version header semantics yet.
59+
- request-metadata
60+
# SEP-2322 (multi-round-trip requests): client does not echo requestState /
61+
# handle IncompleteResult yet.
62+
- sep-2322-client-request-state
63+
# SEP-2243 (HTTP standardization): no fixture handler / client header support yet.
64+
- http-custom-headers
65+
- http-invalid-tool-headers
66+
# SEP-2106 (JSON Schema $ref handling): client still dereferences network $refs.
67+
- json-schema-ref-no-deref
68+
# SEP-2468 (authorization response iss parameter): not implemented in the client.
69+
- auth/iss-supported
70+
- auth/iss-not-advertised
71+
- auth/iss-supported-missing
72+
- auth/iss-wrong-issuer
73+
- auth/iss-unexpected
74+
- auth/iss-normalized
75+
- auth/metadata-issuer-mismatch
76+
# SEP-2352 (authorization server migration): client does not re-register when
77+
# PRM authorization_servers changes.
78+
- auth/authorization-server-migration
79+
# auth/enterprise-managed-authorization (SEP-990) is in the 2025 baseline but
80+
# NOT here: the harness skips it as inapplicable at --spec-version 2026-07-28
81+
# (it is an extension scenario not carried into the 2026 wire), so it is
82+
# neither run nor evaluated on this leg.
83+
84+
server:
85+
# --- No stateless server path on main yet (carried-forward 2025-era scenarios) ---
86+
# mcp-everything-server only runs in 2025 stateful mode. With
87+
# --spec-version 2026-07-28 the harness sends stateless requests
88+
# (MCP-Protocol-Version: 2026-07-28, _meta envelope, no initialize), which
89+
# the server rejects before the handler runs. These scenarios all pass on
90+
# the 2025 legs; they unblock once mcp-everything-server routes 2026
91+
# requests through a stateless path.
92+
- completion-complete
93+
- tools-list
94+
- tools-call-simple-text
95+
- tools-call-image
96+
- tools-call-audio
97+
- tools-call-embedded-resource
98+
- tools-call-mixed-content
99+
- tools-call-error
100+
- tools-call-with-progress
101+
- server-sse-multiple-streams
102+
- resources-list
103+
- resources-read-text
104+
- resources-read-binary
105+
- resources-templates-read
106+
- prompts-list
107+
- prompts-get-simple
108+
- prompts-get-with-args
109+
- prompts-get-embedded-resource
110+
- prompts-get-with-image
111+
- dns-rebinding-protection
112+
# SEP-2106 (JSON Schema 2020-12 in tool inputSchema): the fixture tool's
113+
# schema has none of the 2020-12 keywords the scenario checks. The scenario
114+
# is in `--suite all` but not `--suite active`, so this is the only leg that
115+
# runs it; it fails identically at 2025-11-25 (not a 2026-path regression).
116+
- json-schema-2020-12
117+
118+
# --- Draft scenarios (same failures and reasons as the `--suite draft` leg) ---
119+
# SEP-2575 (stateless HTTP / _meta envelope): server has no stateless mode,
120+
# _meta-derived capabilities, error-code mappings, or server/discover yet.
121+
- server-stateless
122+
# SEP-2322 (multi-round-trip requests / IncompleteResult): not implemented.
123+
- input-required-result-basic-elicitation
124+
- input-required-result-basic-sampling
125+
- input-required-result-basic-list-roots
126+
- input-required-result-request-state
127+
- input-required-result-multiple-input-requests
128+
- input-required-result-multi-round
129+
- input-required-result-non-tool-request
130+
- input-required-result-result-type
131+
- input-required-result-tampered-state
132+
- input-required-result-capability-check
133+
# SEP-2549 (caching): no ttlMs/cacheScope support.
134+
- caching
135+
# SEP-2243 (HTTP header standardization): -32001 HeaderMismatch handling and
136+
# case-insensitive/whitespace-trimmed header validation not implemented.
137+
- http-header-validation
138+
- http-custom-header-server-validation
139+
140+
# --- WARNING-only entries ---
141+
# These scenarios emit no FAILURE checks, only SHOULD-level WARNINGs, but
142+
# the expected-failures evaluator counts WARNINGs as failures. Same entries
143+
# as the draft suite in expected-failures.yml.
144+
# SEP-2164: server returns -32600 (not -32602) and omits error.data.uri.
145+
- sep-2164-resource-not-found
146+
# SEP-2322 SHOULD-level behaviours (re-request missing inputResponses,
147+
# ignore unrecognized inputResponses keys).
148+
- input-required-result-missing-input-response
149+
- input-required-result-ignore-extra-params

.github/actions/conformance/expected-failures.yml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,10 @@
22
# CI exits 0 if only these fail, exits 1 on unexpected failures or stale entries.
33
#
44
# Baseline established against @modelcontextprotocol/conformance pinned in
5-
# .github/workflows/conformance.yml (CONFORMANCE_VERSION = 0.2.0-alpha.3).
5+
# .github/workflows/conformance.yml (CONFORMANCE_VERSION = 0.2.0-alpha.4).
66
# New conformance releases are adopted by deliberately bumping that pin and
7-
# reconciling this file in the same change.
7+
# reconciling both this file and expected-failures.2026-07-28.yml in the same
8+
# change.
89
#
910
# Entries are grouped by SEP. As each SEP lands in the SDK the corresponding
1011
# scenarios start passing and MUST be removed from this list (the runner fails

.github/workflows/conformance.yml

Lines changed: 16 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,8 +15,9 @@ permissions:
1515

1616
env:
1717
# Pinned conformance harness version. Bump deliberately and reconcile
18-
# .github/actions/conformance/expected-failures.yml in the same change.
19-
CONFORMANCE_VERSION: "0.2.0-alpha.3"
18+
# both .github/actions/conformance/expected-failures*.yml files in the
19+
# same change.
20+
CONFORMANCE_VERSION: "0.2.0-alpha.4"
2021

2122
jobs:
2223
server-conformance:
@@ -43,6 +44,12 @@ jobs:
4344
./.github/actions/conformance/run-server.sh
4445
--suite draft
4546
--expected-failures ./.github/actions/conformance/expected-failures.yml
47+
- name: Run server conformance (2026-07-28 wire, all suite)
48+
run: >-
49+
./.github/actions/conformance/run-server.sh
50+
--suite all
51+
--spec-version 2026-07-28
52+
--expected-failures ./.github/actions/conformance/expected-failures.2026-07-28.yml
4653
4754
client-conformance:
4855
runs-on: ubuntu-latest
@@ -64,3 +71,10 @@ jobs:
6471
--command 'uv run --frozen python .github/actions/conformance/client.py'
6572
--suite all
6673
--expected-failures ./.github/actions/conformance/expected-failures.yml
74+
- name: Run client conformance (2026-07-28 wire, all suite)
75+
run: >-
76+
npx --yes @modelcontextprotocol/conformance@"$CONFORMANCE_VERSION" client
77+
--command 'uv run --frozen python .github/actions/conformance/client.py'
78+
--suite all
79+
--spec-version 2026-07-28
80+
--expected-failures ./.github/actions/conformance/expected-failures.2026-07-28.yml

0 commit comments

Comments
 (0)