Prerequisites for inference profiles
Before you can use an inference profile, check that you've fulfilled the following prerequisites:
-
Your role has access to the inference profile API actions. If your role has the AmazonBedrockFullAccess AWS-managed policy attached, you can skip this step. Otherwise, do the following:
-
Follow the steps at Creating IAM policies and create the following policy, which allows a role to do inference profile-related actions and run model inference using all foundation models and inference profiles.
(Optional) You can restrict the role's access in the following ways:
-
To restrict the API actions that the role can make, modify the list in the
Action
field to contain only the API operations that you want to allow access to. -
To restrict the role's access to specific inference profiles, modify the
Resource
list to contain only the inference profiles and foundation models that you want to allow access to. System-defined inference profiles begin withinference-profile
and application inference profiles begin withapplication-inference-profile
.Important
When you specify an inference profile in the
Resource
field in the first statement, you must also specify the foundation model in each Region associated with it. -
To restrict user access such that they can invoke a foundation model only through an inference profile, add a
Condition
field and use theaws:InferenceProfileArn
condition key. Specify the inference profile that you want to filter access on. This condition can be included in a statement that scopes to thefoundation-model
resources. -
For example, you can attach the following policy to a role to allow it to invoke the Anthropic Claude 3 Haiku model only through the US Anthropic Claude 3 Haiku inference profile in the account
111122223333
in us-west-2:
-
-
Follow the steps at Adding and removing IAM identity permissions to attach the policy to a role to grant the role permissions to view and use all the inference profiles.
-
-
You've requested access to the model defined in the inference profile that you want to use, in the Region from which you want to call the inference profile.