Invoke Model With Response Stream
bedrockruntime_invoke_model_with_response_stream | R Documentation |
Invoke the specified Amazon Bedrock model to run inference using the prompt and inference parameters provided in the request body¶
Description¶
Invoke the specified Amazon Bedrock model to run inference using the prompt and inference parameters provided in the request body. The response is returned in a stream.
To see if a model supports streaming, call
GetFoundationModel
and check the responseStreamingSupported
field in the response.
The CLI doesn't support streaming operations in Amazon Bedrock,
including invoke_model_with_response_stream
.
For example code, see Invoke model with streaming code example in the Amazon Bedrock User Guide.
This operation requires permissions to perform the
bedrock:InvokeModelWithResponseStream
action.
To deny all inference access to resources that you specify in the
modelId field, you need to deny access to the bedrock:InvokeModel
and
bedrock:InvokeModelWithResponseStream
actions. Doing this also denies
access to the resource through the Converse API actions (converse
and
converse_stream
). For more information see Deny access for inference
on specific
models.
For troubleshooting some of the common errors you might encounter when
using the invoke_model_with_response_stream
API, see Troubleshooting
Amazon Bedrock API Error
Codes
in the Amazon Bedrock User Guide
Usage¶
bedrockruntime_invoke_model_with_response_stream(body, contentType,
accept, modelId, trace, guardrailIdentifier, guardrailVersion,
performanceConfigLatency)
Arguments¶
body |
The prompt and inference parameters in the format specified in
the |
contentType |
The MIME type of the input data in the request. You must specify
|
accept |
The desired MIME type of the inference body in the response. The
default value is |
modelId |
[required] The unique identifier of the model to invoke to run inference. The
|
trace |
Specifies whether to enable or disable the Bedrock trace. If enabled, you can see the full Bedrock trace. |
guardrailIdentifier |
The unique identifier of the guardrail that you want to use. If you don't provide a value, no guardrail is applied to the invocation. An error is thrown in the following situations.
|
guardrailVersion |
The version number for the guardrail. The value can also be
|
performanceConfigLatency |
Model performance settings for the request. |
Value¶
A list with the following syntax:
list(
body = list(
chunk = list(
bytes = raw
),
internalServerException = list(
message = "string"
),
modelStreamErrorException = list(
message = "string",
originalStatusCode = 123,
originalMessage = "string"
),
validationException = list(
message = "string"
),
throttlingException = list(
message = "string"
),
modelTimeoutException = list(
message = "string"
),
serviceUnavailableException = list(
message = "string"
)
),
contentType = "string",
performanceConfigLatency = "standard"|"optimized"
)
Request syntax¶
svc$invoke_model_with_response_stream(
body = raw,
contentType = "string",
accept = "string",
modelId = "string",
trace = "ENABLED"|"DISABLED",
guardrailIdentifier = "string",
guardrailVersion = "string",
performanceConfigLatency = "standard"|"optimized"
)