Gathr Deployment on GCP - BYOL

Gathr can be deployed on the Google Cloud Platform (GCP) to leverage the capabilities of Dataproc clusters for efficient and scalable data processing.

Create, manage and use Google Cloud Dataproc clusters from Gathr.

BYOL (Bring Your Own License) solution lets you run Gathr applications on Dataproc Clusters while using licenses purchased directly from Gathr.

Read step-by-step instructions to deploy Gathr (BYOL) on GCP.

You can access the Gathr (BYOL) page in Google Cloud Platform and Launch the application by performing the below steps.

Steps to Launch Gathr (BYOL) at Google Cloud Console

  1. Login to the Google Cloud Console portal using your Google account credentials.

  2. Search for “Gathr (BYOL)” in the search bar of the marketplace console and select Gathr (BYOL) from the listed items.

  1. Click on Get Started button to proceed further to accept the terms and agreements of GCP Marketplace and Gathr Data, Inc.

LandingPage_byol

  1. You wil be redirected to the Agreements page.

Accept the agreement, click Agree and then Deploy to proceed further.

agreement_byol

deploy_byol

You will be redirected to the Gathr (BYOL) deployment page.

Gathr (BYOL) Deployment

  1. Under the Gathr (BYOL) Deployment page, provide the below details.

launch_byol

Deployment name

Provide the product deployment name for BYOL services.

Deployment Service Account

Select an existing service account or create a new one. The roles given below are required in a service account being used to deploy Gathr on GCP:

  • roles/config.agent

  • roles/compute.networkAdmin

  • roles/dataproc.editor

  • roles/compute.admin

  • roles/iam.serviceAccountUser

If an existing service account already has these roles, it will be listed and can be selected for deployment.

If there are no service accounts with these roles, choose the option to create a new service account for deployment.

Zone

Select the zone in which the services need to be deployed. Example: us-east1-b, us-east4-a. The zone determines the computing resources that are available and the details of where the data is stored and used.

Machine type

The options available are General Purpose, Compute Optimized and Memory Optimized that can be selected based on different licenses for Cloud Marketplace virtual machine image solution with billing services.

Upon selecting General Purpose option under Machine Type (which is used for common workloads, optimized for cost and flexibility) provide the below details.

Series

CPU platform selection based on availability. Example: Second Generation: E2, N2 or Third Generation C3.

Machine type

Select Machine type. Option to select Standard types example: e2-standard 8 (8 vCPU 4 cores 32 GB memory) or Custom vCPU cores and memory are available.

GCS bucket Name

Provide an existing GCS bucket name where Gathr’s configuration and pipeline metadata is to be stored.

Provide the Boot Disk details:

Boot disk type

Select the boot disk type. Available options are: Balanced Persistent Disk, Extreme Persistent Disk, SSD Persistent Disk and Standard Persistent Disk. For details, click here.

Boot disk size in GB

Provide the boot disk size in GB. For details, click here.

Under Networking > Networking Interface provide the below details.

Network

The network determines which network traffic will the instance access.

Subnetwork

Assigns the IPv4 address from the sub network’s range. Instances in different sub-networks can communicate with each other using their internal IP’s as long as they belong to the same network.

External IP

An external IP address associated with this instance. Selecting None will result in the instance having no external internet access.

Option to add a network interface is available based on the available number of network interface.

Firewall

By default, all incoming traffic from outside the network is blocked except the 8090 port for Gathr. You can modify the source IP range to enable access only from specified IP range.

Example: You can specify IP range as shown below:

firewall-port-example

Select the type of network traffic you want to allow. Add tags and firewall rules to allow specific network traffic from the Internet. For example: Allow HTTP traffic from the Internet or Allow TCP port 22 traffic from the Internet etc. For details on VPC Firewall rules, click here.

  1. Once the required details are provided at Gathr (BYOL) Deployment page, click DEPLOY button.

The deployment process will take a couple of minutes to complete. Once done, you can get started with Gathr (BYOL).

deployment_in_progress_byol

Once the Deployment is done, the Details tab will show deployment information:

successfull_deployment_byol

  1. Click on Gathr URL to start Gathr onboarding process.

Gathr Onboarding Steps

  1. You will be redirected to Gathr Software End User License Agreement.

Accept the terms of licence agreement and click CONTINUE.

eula_byol

  1. Upload License that you received on your registered email ID from Gathr’s support team.

license_upload_byol

  1. Once the license is uploaded and validated successfully, click CONFIRM button.

license_upload_byol02

  1. You will be redirected to the welcome page of Gathr requesting you to save the essential details including Gathr application’s URL, Username and Password. Click CONTINUE.

welcome_page_byol

  1. On the next page you will see the links to Quick Start Guide, Demo Video and Overview Videos. Click the START HERE button to proceed further.

StartHere06

  1. Sign-in using Superuser credentials.

gathr_login_byol

  1. At first sign-in you will be required to change the password.

ChangePass08

  1. Once you change the password, click Back to Login button to login using the changed password.

BackToLogin09

You will now successfully land at the Gathr UI dashboard.

From here, you can further see details of Create Workspace for GCP in Gathr →, Manage Connections →, Manage Configuration →, Create Project →, Manage Users and Roles →, Create GCP Cluster →, Create Pipeline →, Manage Pipeline →.

Top