NetApp Closes Acquisition of Instaclustr Read the announcement

Use IAM roles and policies to provide the Kafka Connect cluster with access to AWS S3 bucket

In this document we will look at how to set up IAM roles and policies to provide access for your Kafka Connect cluster to your AWS S3 bucket.

Provide access to S3 bucket using permission policy

Recommended method for RIYOA account where both cluster instances and S3 bucket are in the same AWS account. This method will add a permission policy to the instance role. It is useful when you want to provide the Kafka Connect cluster with direct access to the S3 bucket.

  1. Follow Custom Kafka Connect Connectors until step 9 to create a Kafka Connect cluster with custom connectors
    • If you use the Instaclustr Console, in the Custom Connector Configuration section, choose “Add permission policy to instance role later”
    • If you use the Provisioning API, specify the S3 bucket name without any further access detail in the body of the POST request. For example
  2. After the Kafka Connect cluster gets to PROVISIONING state, go to the cluster, Details tab and copy the Data Center Id. This Id is also the name of the AWS instance role that the cluster is using
  3. Add a permission to allow access to the S3 bucket
    If you use AWS console

    • From the dashboard, go to IAM services, Roles
    • Find the instance role by the Data Center Id, click on the role
    • From here you can either edit the existing policy (s3-access-policy) and add the permission or add a new policy to the role
    • To edit the existing policy, click on s3-access-policy

      You can use the visual editor to add the permission by clicking on Add additional permissions, then add 2 additional permissions:

      • Service: S3; Action: List – ListBucket, Read – GetBucketLocation
      • Service: S3, Action: Read → GetObject
        and provide the bucket name similar to the screenshots below


      OR you can go to JSON tab and add the statements below into the JSON policy

    If you use AWS CLI

    • List current existing policy:
      aws iam get-role-policy --role-name <Data Center Id> --policy-name s3-access-policy
    • Copy the policy part from the output of the command to a text editor. The policy should be similar to:
    • Edit the policy, add the statements that allow bucket access, and save it as a JSON file at FILE_PATH
    • Delete the old policy
      aws iam delete-role-policy --role-name $CDC_ID --policy-name s3-access-policy
    • Add the edited policy
      aws iam put-role-policy --role-name $CDC_ID --policy-name new-s3-access-policy --policy-document file://FILE_PATH

After the policies are set up correctly, the Kafka Connect cluster should get to RUNNING state within 5 minutes. If the cluster does not get to RUNNING state after 5 minutes, please contact [email protected].

Provide access to S3 bucket using role

Recommended method for customers who use a RIIA account and have an S3 bucket on their own AWS account, but can also be used for RIYOA clusters. This method uses a separate role with access to the S3 bucket, let’s call it S3 access role, and allows the instance role to assume the S3 access role and gain access to the bucket. It is useful when you want to manage the S3 access role separately from the instance. You can do this using the AWS CLI or the AWS Console.

  • Using AWS CLI
    • Create a policy that allow access to the S3 bucket
      • Prepare a JSON file that contains the policy. It should be similar to:
      • Create the policy:
        aws iam create-policy --policy-name <policy-name> --policy-document file://FILE_PATH
      • Copy the ARN of the policy from the output of the command, which should be similar to:
        "arn": "arn:aws:iam::<aws-account-id>:policy/<policy-name>"
    • Prepare a trust policy document file that allows assumption to S3 access role
    • Create a role with the trust policy
      aws iam create-role --role-name <role-name> --assume-role-policy-document file://FILE_PATH
    • Attach the S3 access policy:
      aws iam attach-role-policy --role-name <role-name> --policy-arn <policy-arn>
    • Provision the Kafka Connect cluster using your preferred method:
      • If you use the Instaclustr Console, in the Custom Connector Configuration section,
        choose “Use IAM role”. Input the S3 access role ARN
      • If you use the Provisioning API, specify the S3 bucket name with the S3 access role ARN.
        For example:
    • After the cluster gets to PROVISIONING state, copy the Data Center Id
    • In your trust policy document, add the Data Center Id of the cluster to allow the instance role to assume the S3 access role
    • Update the trust policy of S3 access role
      aws iam update-assume-role-policy --role-name <role-name> --policy-document file://FILE_PATH

    After the roles and policies are set up correctly, the Kafka Connect cluster should get to RUNNING state within 5 minutes. If the cluster does not get to RUNNING state after 5 minutes, please contact our support team.

  • Using AWS Console
    • Create a S3 access policy
      • Go to the IAM dashboard, switch to Policies and click on Create Policy
      • You can either use the Visual editor to specify 2 permissions:
        • Service: S3; Action: List – ListBucket, Read – GetBucketLocation; Resource: <S3-bucket-arn>
        • Service: S3; Action: Read – GetObject, Resource: <S3-bucket-arn>/*

      • OR you can switch to the JSON tab and specify the permission using JSON format. For example:

      • Then click Next:Tags and optionally provide tags for the policy
      • Then review, name the policy and create it
    • Create the S3 access role with the S3 access policy and copy its ARN
      • Go to the IAM dashboard, switch to Roles and click on Create Role
      • Select Custom trust policy as trusted entity
      • Click Next, then find and tick select the S3 access policy we just created
      • Then click Next to review and create the role
    • Provision the Kafka Connect cluster using your preferred method with option “Use IAM role” and copy its Data Center Id
      If you use the Instaclustr Console, in the Custom Connector Configuration section,
      choose “Use IAM role”. Input the S3 access role ARN


      If you use the Provisioning API, specify the S3 bucket name with the S3 access role ARN.
      For example:
    • Update the trust policy of S3 access role
      • Go to the role and go to the Trust relationships tab. Click Edit trust policy
      • Change the policy statement to allow the Kafka Connect cluster to assume the S3 access role

      • Click Update policy

    After the roles and policies are set up correctly, the Kafka Connect cluster should get to RUNNING state within 5 minutes. If the cluster does not get to RUNNING state after 5 minutes, please contact [email protected].

By Instaclustr Support
Need Support?
Experiencing difficulties on the website or console?
Already have an account?
Need help with your cluster?
Contact Support
Why sign up?
To experience the ease of creating and managing clusters via the Instaclustr Console
Spin up a cluster in minutes