[FEAT] Support JSON Schema in Responses #1177

riedgar-ms · 2025-11-07T20:31:51Z

Description

The OpenAI Responses API provides for structured output with JSON schema. This change:

Fixes support in OpenAIResponsesTarget when JSON output is requested without a schema
Allows the user to supply a schema via prompt_metadata
Adds a test for each scenario

Tests and Documentation

I have added a test for both scenarios (i.e. with and without schema). However, this is not yet documented.

riedgar-ms · 2025-11-07T20:35:24Z

pyrit/prompt_target/openai/openai_response_target.py


+        text_format = None
+        if is_json_response:
+            if conversation[-1].message_pieces[0].prompt_metadata.get("json_schema"):


I am not, to put it mildly, a fan of smuggling in the schema this way. Two reasons:

The schema can be loaded as a Python object. However, doing so would require a change to MessagePiece

Even if leaving it as a string, then it should be extracted by the is_response_format_json() method

However, both of these have a rather larger blast radius, so I wanted to consult first @romanlutz @rlundeen2

riedgar-ms · 2025-11-07T20:36:25Z

pyrit/prompt_target/openai/openai_response_target.py

            "input": input_items,
            # Correct JSON response format per Responses API
-            "response_format": {"type": "json_object"} if is_json_response else None,
+            "text": text_format,


This is the 'bug fix' part; response_format is from the Chat completions API. See:

https://platform.openai.com/docs/api-reference/responses/create#responses_create-text

romanlutz · 2025-11-09T03:18:36Z

Do you expect a different schema per message sent, or just one schema for one target? I think that's the main decision point. If it's the latter then we can pass it to the target. If it's the former that makes it trickier and I would probably suggest something like what you're doing here.

riedgar-ms · 2025-11-09T13:37:40Z

Schema is per-response. You might want different JSONs as a conversation develops, and you also probably don't want a JSON every time. This came out of finding that the SelfAskScorer wasn't working with an OpenAIResponsesTarget; I've not tracked down that prompt yet (ISTR YAML files), but for all of them, anything which asks for a JSON response could have the required schema stored there too, and passed along as needed.

I was wondering if I could store the schema as a Python object (which would be serialisable to JSON), but I've not dug through things enough to know if prompt_metadata has to have str or int values.

romanlutz · 2025-11-09T15:07:54Z

Re: Response target not working with the scorer

We don't currently enforce a json format, we just use the other option (I don't recall the name offhand, but switching to schema was definitely on our list of things to do). What exactly is failing currently?

Maybe this should be a new parameter on a message piece... curious what @rlundeen2 thinks.

riedgar-ms · 2025-11-09T16:51:21Z

The specific bug is that the Responses API doesn't use response_format in the body, but the alternative I show. There are also two options: json_object which doesn't enforce a schema (although may enforce JSON output; I haven't dug enough through the code to be sure), and json_schema where you can pass an actual JSON schema object.

…orer-fix-01

romanlutz · 2025-11-13T13:18:40Z

The specific bug is that the Responses API doesn't use response_format in the body, but the alternative I show. There are also two options: json_object which doesn't enforce a schema (although may enforce JSON output; I haven't dug enough through the code to be sure), and json_schema where you can pass an actual JSON schema object.

Yes we support json_object although it's more of a suggestion than enforced output format. We noticed json_schema a while ago but the problem was always how to pass the schema. I'll ping @rlundeen2 and @bashirpartovi internally to see if anyone has thoughts.

…orer-fix-01

riedgar-ms · 2025-11-13T16:56:00Z

@romanlutz @rlundeen2 , I've just pushed changes which modify is_response_format_json() so that if a JSON schema is supplied, then it will return that schema. Now I'm not a huge fan of functions having different return types like this, but changing the return type to a tuple didn't feel much better.

I think I've caught all the places this could have affected.....

…orer-fix-01

bashirpartovi · 2025-11-13T21:19:14Z

Thanks @riedgar-ms for helping fix this issue. However, I do not agree with this approach because it violates least surprise principle and does not provide a clean design. I believe all of this can be encapsulated much cleaner (this is just to demonstrate an approach):

We could define a class to encapsulate json response configurations such as follows:

from __future__ import annotations
...

@dataclass
class JsonResponseConfig:

    enabled: bool = False
    schema: Optional[Dict[str, Any]] = None
    schema_name: str = "CustomSchema"
    strict: bool = True
    
    @classmethod
    def from_metadata(cls, metadata: Optional[Dict[str, Any]]) -> "JsonResponseConfig":
        if not metadata:
            return cls(enabled=False)
            
        response_format = metadata.get("response_format")
        if response_format != "json":
            return cls(enabled=False)
            
        schema_val = metadata.get("json_schema")
        if schema_val:
            if isinstance(schema_val, str):
                try:
                    schema = json.loads(schema_val) if schema_val else None
                except json.JSONDecodeError:
                    raise ValueError(f"Invalid JSON schema provided: {schema_val}")
            else:
                schema = schema_val
                
            return cls(
                enabled=True,
                schema=schema,
                schema_name=metadata.get("schema_name", "CustomSchema"),
                strict=metadata.get("strict", True)
            )
        
        return cls(enabled=True)

We also need to add a new method to the prompt chat target to handle retrieving the json config:

class PromptChatTarget(PromptTarget):
    ...

    def is_response_format_json(self, *, message_piece: MessagePiece) -> bool:
        config = self.get_json_response_config(message_piece=message_piece)
        return config.enabled

    def get_json_response_config(self, *, message_piece: MessagePiece) -> JsonResponseConfig:
        config = JsonResponseConfig.from_metadata(message_piece.prompt_metadata)
        
        if config.enabled and not self.is_json_response_supported():
            target_name = self.get_identifier()["__type__"]
            raise ValueError(f"This target {target_name} does not support JSON response format.")
            
        return config

We also need to change the base target for open ai to update the signature with the json config class:

class OpenAIChatTargetBase(OpenAITarget, PromptChatTarget):
    ...

    async def _construct_request_body(
        self,
        *,
        conversation: MutableSequence[Message],
        json_config: JsonResponseConfig
    ) -> dict:
        raise NotImplementedError

OpenAI chat target becomes:

class OpenAIChatTarget(OpenAIChatTargetBase):
    ...
    async def _construct_request_body(
        self,
        *,
        conversation: MutableSequence[Message],
        json_config: JsonResponseConfig
    ) -> dict:
        messages = await self._build_chat_messages_async(conversation)
        response_format = self._build_response_format(json_config)
        
        body_parameters = {
            "model": self._model_name,
            "max_completion_tokens": self._max_completion_tokens,
            "temperature": self._temperature,
            "top_p": self._top_p,
            "frequency_penalty": self._frequency_penalty,
            "presence_penalty": self._presence_penalty,
            "logit_bias": self._logit_bias,
            "stream": False,
            "seed": self._seed,
            "n": self._n,
            "messages": messages,
            "response_format": response_format,
        }

        if self._extra_body_parameters:
            body_parameters.update(self._extra_body_parameters)

        return {k: v for k, v in body_parameters.items() if v is not None}
    
    def _build_response_format(self, json_config: JsonResponseConfig) -> Optional[Dict[str, Any]]:
        if not json_config.enabled:
            return None
            
        if json_config.schema:
            return {
                "type": "json_schema",
                "json_schema": {
                    "name": json_config.schema_name,
                    "schema": json_config.schema,
                    "strict": json_config.strict
                }
            }
        
        return {"type": "json_object"}

Open AI response target:

class OpenAIResponseTarget(OpenAIChatTargetBase):
    ...

    async def _construct_request_body(
        self,
        *,
        conversation: MutableSequence[Message],
        json_config: JsonResponseConfig
    ) -> dict:
        input_items = await self._build_input_for_multi_modal_async(conversation)

        text_format = self._build_text_format(json_config)
        
        body_parameters = {
            "model": self._model_name,
            "max_output_tokens": self._max_output_tokens,
            "temperature": self._temperature,
            "top_p": self._top_p,
            "stream": False,
            "input": input_items,
            "text": text_format,
        }

        if self._extra_body_parameters:
            body_parameters.update(self._extra_body_parameters)

        return {k: v for k, v in body_parameters.items() if v is not None}
    
    def _build_text_format(self, json_config: JsonResponseConfig) -> Optional[Dict[str, Any]]:
        if not json_config.enabled:
            return None
            
        if json_config.schema:
            return {
                "format": {
                    "type": "json_schema",
                    "json_schema": {
                        "name": json_config.schema_name,
                        "schema": json_config.schema,
                        "strict": json_config.strict
                    }
                }
            }
        
        logger.info("Using json_object format without schema - consider providing a schema for better results")
        return {"format": {"type": "json_object"}}

OpenAI chat target base:

class OpenAIChatTargetBase(OpenAITarget, PromptChatTarget):
   ...
    
    async def send_prompt_async(self, *, message: Message) -> Message:
        self._validate_request(message=message)
        conversation = self._get_conversation(message=message)
        
        json_config = JsonResponseConfig(enabled=False)
        if message.message_pieces:
            last_piece = message.message_pieces[-1]
            json_config = self.get_json_response_config(message_piece=last_piece)
        
        request_body = await self._construct_request_body(
            conversation=conversation,
            json_config=json_config
        )
        .... 
        # rest of the method

riedgar-ms · 2025-11-13T21:31:22Z

@bashirpartovi happy to make changes along those lines

rlundeen2 · 2025-11-13T22:41:46Z

Agreed with this approach Bashir suggested. I know it's a bigger change but I think will be a lot more maintainable.

…orer-fix-01

riedgar-ms · 2025-12-08T16:46:57Z

Working to handle the merge conflicts

…orer-fix-01

bashirpartovi

Looks great @riedgar-ms , thanks for adding the changes, probably it is good to also get someone else's blessing as well :)

…orer-fix-01

riedgar-ms · 2025-12-17T16:24:10Z

doc/api.rst

    AttackOutcome
    AttackResult
    DecomposedSeedGroup
+    JsonResponseConfig


I did this because one of the unit tests whinged at me, but I'm not sure that this really ought to be public @romanlutz ?

Ah yes, test_api_documentation.py 😆 You can definitely add exclusions there. It wouldn't be the first.

…orer-fix-01

romanlutz · 2025-12-23T13:38:03Z

pyrit/models/json_response_config.py

+    strict: bool = True
+
+    @classmethod
+    def from_metadata(cls, *, metadata: Optional[Dict[str, Any]]) -> "JsonResponseConfig":


nit: should be possible with the imported annotations.

Suggested change

def from_metadata(cls, *, metadata: Optional[Dict[str, Any]]) -> "JsonResponseConfig":

def from_metadata(cls, *, metadata: Optional[Dict[str, Any]]) -> JsonResponseConfig:

romanlutz · 2025-12-23T13:46:30Z

tests/integration/targets/test_openai_responses_gpt5.py

+    }
+
+    prompt = "Create a JSON object that describes a mystical cat "
+    prompt += "with the following properties: name, age, colour."


Suggested change

prompt += "with the following properties: name, age, colour."

prompt += "with the following properties: name, age, color."

😜

romanlutz · 2025-12-23T13:52:49Z

pyrit/prompt_target/common/prompt_chat_target.py


        Returns:
-            bool: True if the response format is JSON and supported, False otherwise.
+                bool: True if the response format is JSON, False otherwise.


what prompted the extra indentation?

romanlutz · 2025-12-23T13:57:00Z

pyrit/models/json_response_config.py

+                enabled=True,
+                schema=schema,
+                schema_name=metadata.get("schema_name", "CustomSchema"),
+                strict=metadata.get("strict", True),


I have to say this worries me. You also pointed that out in a comment earlier @riedgar-ms . The keys used here are basically now reserved for JSON schema. I would be less concerned if it was prefixed with json_schema__ or something like that but that makes it potentially somewhat harder to work with. Perhaps depends on how you're planning to use it, too.

romanlutz · 2025-12-23T13:57:56Z

doc/api.rst

    AttackOutcome
    AttackResult
    DecomposedSeedGroup
+    JsonResponseConfig


Ah yes, test_api_documentation.py 😆 You can definitely add exclusions there. It wouldn't be the first.

riedgar-ms added 4 commits November 7, 2025 12:55

Drafting a quick test script

073cf62

Corrected JSON support

244f2cf

Expand testing

9fccc5b

Don't need this

dd36ea2

riedgar-ms commented Nov 7, 2025

View reviewed changes

riedgar-ms added 2 commits November 7, 2025 16:31

Some small refinements

170b16b

Draft unit test updates

dd56600

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

8df25b9

…orer-fix-01

riedgar-ms added 3 commits November 13, 2025 10:56

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

5405dec

…orer-fix-01

Proposal for schema smuggling

fa4ca37

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

7390271

…orer-fix-01

riedgar-ms added 2 commits November 13, 2025 13:21

Linting issues

320d58c

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

842cd03

…orer-fix-01

riedgar-ms added 6 commits November 14, 2025 05:44

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

80e8cd4

…orer-fix-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

2003466

…orer-fix-01

Add the JSONResponseConfig class

45cb825

Better name

ec4efaa

Start on other changes

1eb4395

Next changes

29fdb2f

riedgar-ms marked this pull request as draft November 15, 2025 16:01

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

630842a

…orer-fix-01

riedgar-ms marked this pull request as draft December 8, 2025 16:46

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

4da3b9d

…orer-fix-01

romanlutz self-assigned this Dec 9, 2025

riedgar-ms added 15 commits December 10, 2025 16:27

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

fd03bba

…orer-fix-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

21fe8a0

…orer-fix-01

Switching auth

48c61cc

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

cf71d2d

…orer-fix-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

d231d79

…orer-fix-01

Working on next bit

c8b4d0d

Get one test working again...

03c59a6

Should be the final test working (plus linting)

5275b14

Think this is the other place?

9dfdcf6

Missing import?

5cef55d

And another

bea538c

Sort imports

e5bbea0

A bad merge

f325976

More missed merges

271ea14

ruff fix

e90e409

riedgar-ms marked this pull request as ready for review December 13, 2025 22:59

bashirpartovi approved these changes Dec 15, 2025

View reviewed changes

riedgar-ms added 4 commits December 17, 2025 09:58

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

b1a7fdb

…orer-fix-01

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

a272af9

…orer-fix-01

Need a doc string

eee44be

Forgot doc hook

41f576c

riedgar-ms commented Dec 17, 2025

View reviewed changes

riedgar-ms added 2 commits December 22, 2025 12:07

Merge remote-tracking branch 'origin/main' into riedgar-ms/selfask-sc…

955ce14

…orer-fix-01

Address ruff issues

5c53356

romanlutz approved these changes Dec 23, 2025

View reviewed changes

	def from_metadata(cls, *, metadata: Optional[Dict[str, Any]]) -> "JsonResponseConfig":
	def from_metadata(cls, *, metadata: Optional[Dict[str, Any]]) -> JsonResponseConfig:

	prompt += "with the following properties: name, age, colour."
	prompt += "with the following properties: name, age, color."

[FEAT] Support JSON Schema in Responses #1177

Are you sure you want to change the base?

[FEAT] Support JSON Schema in Responses #1177

Conversation

riedgar-ms commented Nov 7, 2025

Description

Tests and Documentation

Uh oh!

riedgar-ms Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

romanlutz commented Nov 9, 2025

Uh oh!

riedgar-ms commented Nov 9, 2025

Uh oh!

romanlutz commented Nov 9, 2025

Uh oh!

riedgar-ms commented Nov 9, 2025

Uh oh!

romanlutz commented Nov 13, 2025

Uh oh!

riedgar-ms commented Nov 13, 2025

Uh oh!

bashirpartovi commented Nov 13, 2025

Uh oh!

riedgar-ms commented Nov 13, 2025

Uh oh!

rlundeen2 commented Nov 13, 2025

Uh oh!

riedgar-ms commented Dec 8, 2025

Uh oh!

bashirpartovi left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

riedgar-ms Nov 7, 2025 •

edited

Loading