ChatInferenceRequest

Properties

Name	Type	Description	Notes
messages	List[ChatInferenceRequestMessagesInner]	Array of chat messages. Content can be a simple string or an array of content blocks for multimodal input.
model_id	str	Model ID. Use Nova models for multimodal support.
temperature	float		[optional] [default to 0.7]
max_tokens	int	Max tokens. Claude 4.5 supports up to 64k.	[optional] [default to 4096]
top_p	float		[optional]
stream	bool	Ignored in buffered mode, always returns complete response	[optional]
system_prompt	str	Optional custom system prompt. When tools are enabled, this is prepended with tool usage guidance.	[optional]
stop_sequences	List[str]	Custom stop sequences	[optional]
response_format	ChatInferenceRequestResponseFormat		[optional]
tool_config	ChatInferenceRequestToolConfig		[optional]
session_id	str	Optional session ID for conversation continuity. Omit to use stateless mode, include to continue an existing session.	[optional]
var_async	bool	Enable async/durable execution mode. When true, returns 202 with pollUrl instead of waiting for completion. Use for long-running inference, client-executed tools, or operations >30 seconds.	[optional] [default to False]

Example

from quantcdn.models.chat_inference_request import ChatInferenceRequest

# TODO update the JSON string below
json = "{}"
# create an instance of ChatInferenceRequest from a JSON string
chat_inference_request_instance = ChatInferenceRequest.from_json(json)
# print the JSON string representation of the object
print(ChatInferenceRequest.to_json())

# convert the object into a dict
chat_inference_request_dict = chat_inference_request_instance.to_dict()
# create an instance of ChatInferenceRequest from a dict
chat_inference_request_from_dict = ChatInferenceRequest.from_dict(chat_inference_request_dict)

[Back to Model list] [Back to API list] [Back to README]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatInferenceRequest

Properties

Example

FilesExpand file tree

ChatInferenceRequest.md

Latest commit

History

ChatInferenceRequest.md

File metadata and controls

ChatInferenceRequest

Properties

Example