Amazon Athena Connection
See the Connector Marketplace topic. Please request your administrator to start a trial or subscribe to the Premium Amazon Athena connector.
Amazon Athena is a highly efficient and powerful analytics tool that provides users with the ability to perform SQL on vast datasets encompassing multiple files. It is an ideal solution to process large amounts of data quickly and effectively. Amazon Athena is widely recognized as a go-to solution for data analysis and processing needs.
This topic describes how to authenticate to Amazon Athena and configure any necessary connection properties in the Amazon Athena connection connector.
Prerequisite
For the Amazon Athena Data Sources and Emitters to run successfully in Gathr applications, the following permissions/policy actions should be assigned in AWS:
s3:ListBucket
s3:GetObject
s3:GetBucketLocation
s3:GetObject
s3:ListBucket
s3:ListBucketMultipartUploads
s3:AbortMultipartUpload
s3:PutObject
s3:ListMultipartUploadParts
s3:DeleteObject
athena:ListDataCatalogs
athena:GetDataCatalog
athena:GetTableMetadata
athena:ListTableMetadata
athena:StartQueryExecution
athena:GetQueryResults
athena:GetQueryExecution
glue:GetTables
glue:GetDatabases
glue:GetTable
Connection Configuration
Each connection property available in the Amazon Athena connector is explained below.
Connection Name
The name of the connection to be created should be provided. This is the name that will display on the list of available connections.
Auth Scheme
The type of authentication to use when connecting to Amazon Athena.
Possible Values
AWS Keys
If AWS Keys
is selected, additional configuration parameter will get listed.
AWS Access Key
The access key associated with the AWS account.
AWS Secret Key
The secret key associated with the AWS account.
EC2 Roles
If EC2 Roles
is selected, additional configuration parameters will get listed.
AWS Role ARN
Specify the Role ARN for the role you’d like to authenticate with. This will retrieve credentials for the specified role.
AWS IAM Roles
If AWS IAM Roles
is selected, additional configuration parameters will get listed.
AWS Access Key
The access key associated with the AWS account.
AWS Secret Key
The secret key associated with the AWS account.
AWS Role ARN
Specify the Role ARN for the role you’d like to authenticate with. This will retrieve credentials for the specified role.
AWS Region
Set this to the region where your Amazon Athena data is hosted.
S3 Staging Directory
Set this to a folder in S3 where you would like to store the results of queries.
Metadata Discovery Method
API to use to retrieve the metadata for Amazon Athena.
When Metadata Discovery Method is set to Glue
, it will query AWS Glue to retrieve Amazon Athena metadata.
When Metadata Discovery Method is set to Athena
, it will query AWS Athena to retrieve Amazon Athena metadata.
If Glue
is selected, additional configuration parameters will get listed.
CrossAccount Id
The Account Id to use when retrieving metadata from Glue. Default is current account id. When the property MetadataDiscoveryMethod has been set to ‘Glue’ then user is able to specify a Cross Account Id which they have permissions to access, and this account will be used to retrieve metadata.
VPC End Point
Your Athena VPC Endpoint. It will override the default athena.{region}.amazonaws.com
endpoint.
Enable FIPS Mode
FIPS or Federal Information Processing Standard are seperate endpoints in Amazon that may be required for certain government organizations.
Set Enable FIPS Mode to true to switch to using these endpoints.
After entering all the details, click on the TEST button.
If the connection service identification and authentication details are provided correctly, a success message stating “connection available” is generated.
Click on the CREATE button to save the changes.
If the details are incorrect or the server is down, you will get a message “Connection unavailable”.
If you have any feedback on Gathr documentation, please email us!