fix(attestation): include Azure vTPM AK intermediates#49
Conversation
Fetch Azure vTPM AK issuer certificates from the leaf certificate's AIA CA Issuers URLs during attestation generation and serialize them into the Azure evidence document. During verification, treat evidence-supplied intermediates as untrusted chain material, combine them with the legacy bundled intermediate for backwards compatibility, and continue pinning Azure vTPM roots. Bound the number of serialized intermediates during deserialization and keep stripping TPM NV zero padding before WebPKI verification. Add offline AK-chain fixtures for the observed Azure Cloud Virtual TPM CA - 24 -> Azure Cloud Virtual TPM CA 2025 chain.
ameba23
left a comment
There was a problem hiding this comment.
Thank you for the PR! We have let support for Azure slip, because we currently have no production Azure instances. So wonderful to have this fix contributed.
Very good idea to do the intermediary certificate fetching on the 'attester' side rather than the verifier side. One thing we could consider for a followup would be to cache these to avoid re-fetching on subsequent attestation generation.
Great to have the test assets with observed intermediaries. I think it could be useful to have a complete observed attestation payload to test against. We could spin up an Azure instance and get one if you don't want to add this yourself.
We use stable rust but nightly for formatting / clippy, which is why CI fails. This is unconventional and not documented (sorry). Could you please run cargo +nightly fmt && cargo +nightly clippy. Thanks.
| pub(crate) fn fetch_ak_intermediates_from_aia( | ||
| ak_cert: &X509Certificate<'_>, | ||
| ) -> Result<Vec<Vec<u8>>, MaaError> { | ||
| const MAX_AIA_DEPTH: usize = 6; |
There was a problem hiding this comment.
This seems to be too arbitrary. Why hardcode the depth? why not fetch all intermediates to verify the full chain? what happens if there were slightly more than 6 for example? would the full chain verification up to the Azure Root CA anchor still succeed?
There was a problem hiding this comment.
Good catch! Refactored the fetching logic quite a bit in bd465dd, in part to fetch from all possible URLs, not only the first one; let me know what you think.
As for your question, there was a big inconsistency. I combined MAX_AIA_DEPTH with the more important MAX_EVIDENCE_AK_INTERMEDIATE_CERTIFICATES which is used on the verifier path. The main idea is that we need to protect from DDoS attacks on the verifier side, so we need some bound. 6 is indeed very arbitrary... but I somehow doubt Microsoft would have chains greater than 2/3.... pure hunch though I really have no idea. Happy to increase if you think we should.
There was a problem hiding this comment.
Agree that there has to be a limit to prevent attacks, but i'm not sure what it should be. 6 or 8 sounds like plenty to me.
|
@samlaf , thank you for surfacing this new change from Azure vTPM attestation process side as well as for your contribution to solve the issue. |
8b1bc40 to
94a74a3
Compare
fix output dir for fixtures commit fixtures update test to use new yaml fixtures fix fixture generating code (serde_saphyr bug?) actually fix serializer make fixture match new serialization format
make from maaerror box fix clippy: only box MaaError::AiaFetch
Consolidate AK intermediate bounds into a single MAX_EVIDENCE_AK_INTERMEDIATE_CERTIFICATES limit shared by generation and verification, so generated evidence cannot exceed what the verifier accepts. Fail closed when the AK issuer chain is incomplete or exceeds that limit instead of silently returning a partial chain. When following AIA CA Issuers URLs, try alternate URLs sequentially as fallbacks if earlier URLs fail, and trim fetched DER to the parsed certificate before serializing it as evidence. Box the large ureq AIA fetch error inside MaaError to keep the error enum size small enough for clippy.
Add local HTTP server tests for AK issuer fetching to verify that AIA caIssuers URLs are tried sequentially when earlier URLs fail, and that explicit PEM responses are accepted as a lenient fallback to DER. Keep the tests deterministic by using a local server instead of depending on Azure or Microsoft CDN availability.
94a74a3 to
01e60d4
Compare
|
Thanks for the quick feedback guys! Always fun to work with responsive people. :) Sorry had pushed some unsigned commits from the TDX box... so had to force-push to squash them. Will address comments now.
@ameba23 refactored the tests to follow your previous yaml file convention. Let me know what you think! 74a7881
@MoeMahhouk good question... both options are possible. I went with this option because it felt better in a TEE world to have verification be deterministic, not require a network connection, and be totally self-contained. But if you think otherwise we could support verifier fallbacking to fetching from network. Depends how flexible (but also bug-prone!) we want this library to be I guess. Added a comment on create_azure_attestation in 2ef3e85, let me know what you think. |
| let mut serializer_options = serde_saphyr::SerializerOptions::default(); | ||
| // With compact list indentation enabled, serde_saphyr can emit an | ||
| // indentless empty sequence after a nested block sequence, e.g. | ||
| // `event_log:\n[]`, which its parser rejects. Disable compact list | ||
| // indentation for fixture output so nested/empty sequences are always | ||
| // indented under their mapping keys. | ||
| serializer_options.compact_list_indent = false; |
There was a problem hiding this comment.
Seems like this bug was fixed: bourumir-wyngs/serde-saphyr#126
Prob worth bumping your serde-saphyr dep?
Great! Yes nice to see a full verification working, and thanks for also adding the code to generate more test assets later should things change.
I think it depends on the use-case which is better, and if we want the
On the other hand, @MoeMahhouk 's concern is that some of our CVMs run in constrained network environments where the endpoints for fetching intermediaries would have to be explicitly allow-listed, which increases the attack surface and could create problems if Microsoft were to change the hostname providing them. |
ameba23
left a comment
There was a problem hiding this comment.
🥇 Approving as i think this is a much needed and well-crafted fix.
But will not merge until @MoeMahhouk is happy with the idea of doing attester-side intemediary fetching, due to the potential issues with flashbox.
Would be good to also get a thumbs up from @0x416e746f6e
| } | ||
|
|
||
| fn fetch_certificate_der(url: &str) -> Result<Vec<u8>, MaaError> { | ||
| if !(url.starts_with("http://") || url.starts_with("https://")) { |
There was a problem hiding this comment.
Not sure if we should have extra checks here on the hostname/IP. A malicious vTPM could cause a network call to arbitrary host here. But i can't see how they would turn that into a meaningful attack.
There was a problem hiding this comment.
Were you thinking a blacklist (eg. disallow localhost) or a whitelist? Something like this perhaps?
fn is_allowed_azure_aia_host(host: &str) -> bool {
let host = host.trim_end_matches('.').to_ascii_lowercase();
host == "www.microsoft.com"
|| host == "crl.microsoft.com"
|| host == "primary-cdn.pki.core.windows.net"
|| host == "secondary-cdn.pki.core.windows.net"
|| host.ends_with(".pki.core.windows.net")
} But then again I'm not sure this is even true and whether azure won't introduce other names...
| } | ||
|
|
||
| issuer_urls = fetched_issuer.ca_issuers_urls; | ||
| intermediates.push(fetched_issuer.der); |
There was a problem hiding this comment.
I think it might be better to remove the check: if fetched_issuer.is_self_signed { (above) and instead, do:
| intermediates.push(fetched_issuer.der); | |
| intermediates.push(fetched_issuer.der); | |
| if verify_ak_cert_with_azure_roots(ak_cert_der, &intermediates, now_secs).is_ok() { | |
| return Ok(intermediates); | |
| } |
This will avoid an unneeded fetch for the root cert which we already have, and make us sure right away that we have a complete chain which verifies against the known root.
| where | ||
| D: serde::Deserializer<'de>, | ||
| { | ||
| let certificates = Vec::<String>::deserialize(deserializer)?; |
There was a problem hiding this comment.
Maybe worth adding a maximum string length check here to avoid attempting to parse giant blobs as pem / der. But actually it would be better to cap the size of the whole payload because attempting to de-serialize in prepare_azure_attestation. Which is kind of needed anyway unrelated to this PR.
There was a problem hiding this comment.
Agree that this should be done in a follow-up small self-contained PR. Created #50
@ameba23 , no blockers from my side. We can merge to unblock if this is needed elsewhere. |
Verify the AK certificate against pinned Azure roots after each AIA-fetched issuer is added, and stop once the fetched chain is complete. This avoids fetching the root certificate in normal Azure vTPM chains. Keep the bundled Azure intermediate as a legacy verification-only fallback for older evidence, but exclude it while generating new AIA-based evidence so the serialized intermediates are self-contained. Also convert Azure wall-clock lookup failures into MaaError instead of panicking.
This fixes Azure TDX/vTPM attestation verification failures caused by newer Azure vTPM AK certificate chains that are not covered by the currently bundled
Global Virtual TPM CA - 03intermediate.On a current Azure TDX CVM, the AK leaf certificate chain can look like:
The TPM NV AK cert index on this VM contained only:
so parsing additional intermediates from the TPM AK cert payload is not sufficient for this chain.
This PR changes Azure attestation generation to fetch the AK issuer chain from the leaf certificate’s AIA
CA IssuersURLs and serialize those intermediates into the attestation evidence. Verification remains offline: the verifier treats these intermediates as untrusted chain material and still pins the Azure vTPM roots.Microsoft guidance indicates that AIA should be used to discover Azure vTPM intermediate CAs, since public docs can lag behind production intermediate rotation:
https://learn.microsoft.com/en-us/answers/questions/5897616/download-intermediate-ca-cert-for-azure-cloud-virt
Changes
Adds optional evidence field:
During Azure attestation generation:
CA IssuersURLs up to a bounded depth;During verification:
Keeps verification network-free/deterministic.
Keeps
#[serde(default)]on the new field so old evidence still deserializes.Adds offline fixture tests for the observed Azure chain:
Azure Cloud Virtual TPM CA - 24Azure Cloud Virtual TPM CA 2025Security notes
The fetched intermediates are not trusted anchors. They are serialized as untrusted evidence only. Verification still requires the AK certificate chain to terminate at a pinned Azure vTPM root.
The number of evidence-supplied intermediates is capped to avoid unbounded peer-controlled chain material.
Behavior change
Azure attestation generation now makes outbound HTTP(S) requests to Microsoft PKI/AIA endpoints in order to fetch issuer certificates.
Azure attestation verification does not make network requests.
Validation
Tested with this code on an Azure Standard_DC4eds_v6 TDX box on westus3. Before this change, verification failed with:
After this change:
The generated evidence includes two serialized AK intermediates:
This subsumes #48.
PR #48 improves handling for cases where the TPM AK cert payload contains concatenated DER intermediates. However, on the Azure TDX VM tested here, the TPM AK cert NV index contained only:
and no intermediate certificates. Therefore #48 does not fix the observed
UnknownIssuerfailure for this chain.This PR handles that case by following the AIA issuer URLs from the AK leaf certificate and including the resulting issuer chain in the evidence.