本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。
查询端点自动扩缩历史记
您可以使用以下方式查看从终端节点扩展活动的状态。DescribeScalingActivities.DescribeScalingActivities提供了有关过去六周指定命名空间中的扩展活动的描述性信息。
如何查询端点自动扩缩操作
使用查询自动扩缩终端节点DescribeScalingActivities. 为此,请指定ServiceNameSpace参数。ServiceNameSpace的名称是Amazon提供资源的服务。
有效的服务名称值包括:
ecs | elasticmapreduce | ec2 | appstream | dynamodb | rds | sagemaker | custom-resource | comprehend | lambda | cassandra
在这种情况下你需要设置ServiceNameSpace到sagemaker.
使用以下命令Amazon CLI命令以查看有关您的所有详细信息sagemaker具有扩展策略的终端节点:
aws application-autoscaling describe-scaling-activities \ --service-namespace sagemaker
您可以使用搜索特定终端节点。ResourceId:
aws application-autoscaling describe-scaling-activities \ --service-namespace sagemaker \ --resource-id endpoint/<endpoint_name>/variant/<variant_name>
运行此命令时,它会返回以下输出:
{ "ActivityId": "activity-id", "ServiceNamespace": "sagemaker", "ResourceId": "endpoint/<endpoint_name>/variant/<variant_name>", "ScalableDimension": "sagemaker:variant:DesiredInstanceCount", "Description": "string", "Cause": "string", "StartTime": timestamp, "EndTime": timestamp, "StatusCode": "string", "StatusMessage": "string" }
如何识别已阻止 AutoScaling 由于实例配额
当您扩展或添加更多实例时,您可能会达到账户级别的实例配额。您可以使用DescribeScalingActivities以检查您是否已达到实例配额。当你超出配额时,自动扩展将被阻止。
要检查您是否已达到实例配额,请使用Amazon CLI命令,如前面的示例中所示,其中您指定ResourceId:
aws application-autoscaling describe-scaling-activities \ --service-namespace sagemaker \ --resource-id endpoint/<endpoint_name>/variant/<variant_name>
在返回语法中,检查StatusCode和StatusMessage键及其关联的值。StatusCode回报Failed. 在StatusMessage有一条消息表明已达到账户级别的服务配额。以下为该消息具体形式的示例:
{ "ActivityId": "activity-id", "ServiceNamespace": "sagemaker", "ResourceId": "endpoint/<endpoint_name>/variant/<variant_name>", "ScalableDimension": "sagemaker:variant:DesiredInstanceCount", "Description": "string", "Cause": "minimum capacity was set to 110", "StartTime": timestamp, "EndTime": timestamp, "StatusCode": "Failed", "StatusMessage": "Failed to set desired instance count to 110. Reason: The account-level service limit 'ml.xx.xxxxxx for endpoint usage' is 1000 Instances, with current utilization of 997 Instances and a request delta of 20 Instances. Please contact Amazon support to request an increase for this limit. (Service: AmazonSageMaker; Status Code: 400; Error Code: ResourceLimitExceeded; Request ID: request-id)." }