Create Endpoint
sagemaker_create_endpoint | R Documentation |
Creates an endpoint using the endpoint configuration specified in the request¶
Description¶
Creates an endpoint using the endpoint configuration specified in the
request. SageMaker uses the endpoint to provision resources and deploy
models. You create the endpoint configuration with the
create_endpoint_config
API.
Use this API to deploy models using SageMaker hosting services.
You must not delete an EndpointConfig
that is in use by an endpoint
that is live or while the update_endpoint
or create_endpoint
operations are being performed on the endpoint. To update an endpoint,
you must create a new EndpointConfig
.
The endpoint name must be unique within an Amazon Web Services Region in your Amazon Web Services account.
When it receives the request, SageMaker creates the endpoint, launches the resources (ML compute instances), and deploys the model(s) on them.
When you call create_endpoint
, a load call is made to DynamoDB to
verify that your endpoint configuration exists. When you read data from
a DynamoDB table supporting
Eventually Consistent Reads
, the response might not reflect the results of a recently completed
write operation. The response might include some stale data. If the
dependent entities are not yet in DynamoDB, this causes a validation
error. If you repeat your read request after a short time, the response
should return the latest data. So retry logic is recommended to handle
these possible issues. We also recommend that customers call
describe_endpoint_config
before calling create_endpoint
to minimize
the potential impact of a DynamoDB eventually consistent read.
When SageMaker receives the request, it sets the endpoint status to
Creating
. After it creates the endpoint, it sets the status to
InService
. SageMaker can then process incoming requests for
inferences. To check the status of an endpoint, use the
describe_endpoint
API.
If any of the models hosted at this endpoint get model data from an Amazon S3 location, SageMaker uses Amazon Web Services Security Token Service to download model artifacts from the S3 path you provided. Amazon Web Services STS is activated in your Amazon Web Services account by default. If you previously deactivated Amazon Web Services STS for a region, you need to reactivate Amazon Web Services STS for that region. For more information, see Activating and Deactivating Amazon Web Services STS in an Amazon Web Services Region in the Amazon Web Services Identity and Access Management User Guide.
To add the IAM role policies for using this API operation, go to the IAM
console, and choose Roles in the left navigation pane. Search the IAM
role that you want to grant access to use the create_endpoint
and
create_endpoint_config
API operations, add the following policies to
the role.
-
Option 1: For a full SageMaker access, search and attach the
AmazonSageMakerFullAccess
policy. -
Option 2: For granting a limited access to an IAM role, paste the following Action elements manually into the JSON file of the IAM role:
"Action": ["sagemaker:CreateEndpoint", "sagemaker:CreateEndpointConfig"]
"Resource": [
"arn:aws:sagemaker:region:account-id:endpoint/endpointName"
"arn:aws:sagemaker:region:account-id:endpoint-config/endpointConfigName"
]
For more information, see SageMaker API Permissions: Actions, Permissions, and Resources Reference.
['
`"arn:aws:sagemaker:region:account-id:endpoint/endpointName"`
`"arn:aws:sagemaker:region:account-id:endpoint-config/endpointConfigName"`
`]: R:%60%0A%0A%20%20%20%20%60%22arn:aws:sagemaker:region:account-id:endpoint/endpointName%22%60%0A%0A%20%20%20%20%60%22arn:aws:sagemaker:region:account-id:endpoint-config/endpointConfigName%22%60%0A%0A%20%20%20%20%60
Usage¶
Arguments¶
EndpointName
[required] The name of the endpoint.The name must be unique within an Amazon Web Services Region in your Amazon Web Services account. The name is case-insensitive in
create_endpoint
, but the case is preserved and must be matched in InvokeEndpoint.EndpointConfigName
[required] The name of an endpoint configuration. For more information, see
create_endpoint_config
.Tags
An array of key-value pairs. You can use tags to categorize your Amazon Web Services resources in different ways, for example, by purpose, owner, or environment. For more information, see Tagging Amazon Web Services Resources.
Value¶
A list with the following syntax:
Request syntax¶
svc$create_endpoint(
EndpointName = "string",
EndpointConfigName = "string",
DeploymentConfig = list(
BlueGreenUpdatePolicy = list(
TrafficRoutingConfiguration = list(
Type = "ALL_AT_ONCE"|"CANARY"|"LINEAR",
WaitIntervalInSeconds = 123,
CanarySize = list(
Type = "INSTANCE_COUNT"|"CAPACITY_PERCENT",
Value = 123
),
LinearStepSize = list(
Type = "INSTANCE_COUNT"|"CAPACITY_PERCENT",
Value = 123
)
),
TerminationWaitInSeconds = 123,
MaximumExecutionTimeoutInSeconds = 123
),
RollingUpdatePolicy = list(
MaximumBatchSize = list(
Type = "INSTANCE_COUNT"|"CAPACITY_PERCENT",
Value = 123
),
WaitIntervalInSeconds = 123,
MaximumExecutionTimeoutInSeconds = 123,
RollbackMaximumBatchSize = list(
Type = "INSTANCE_COUNT"|"CAPACITY_PERCENT",
Value = 123
)
),
AutoRollbackConfiguration = list(
Alarms = list(
list(
AlarmName = "string"
)
)
)
),
Tags = list(
list(
Key = "string",
Value = "string"
)
)
)