Getting Started

Gathr is an enterprise grade, visual, data analytics platform for unified streaming and batch data processing based on best-of-breed open source technologies. It supports the end-to-end functionality of data ingestion, enrichment, machine learning, action triggers, and visualization. Gathr offers an intuitive drag-and-drop visual interface to build and operationalize data applications five to ten times faster, across industries, data formats, and use cases.

Supported Browsers:

l Google Chrome 74.0 and above

l Mac Safari 12.0.0 and above

l Firefox 66 and above

Note: Internet Explorer is not supported.

This topic helps the user to get familiarized with Gathr landing page. A user can login to Gathr application either as a Superuser or as a Workspace user based on the role(s) and permission(s) assigned.

A Superuser is responsible for monitoring and administration of Gathr.

The monitoring can be done with the help of Superuser dashboard and administration can be done with the help of the features which are displayed in the main menu.

Once the Superuser logs in, the Superuser Dashboard will be displayed which is the landing page.

Given below is the illustration of the landing page and the details of each component is explained further.

S-AX_LandingPage

Illustration Number

Landing Page Components

Description

1

Gathr Logo

Logo of the Gathr product.

2

Summary

Name of the page that is displayed in the application.

3

Dashboard

Dashboard to monitor various features like Pipeline Summary, Metrics, StreamAnalytix Web Health, Connections, Alerts, License Summary and Inspect Sessions.

4

Workspace

A drop-down option for the Superuser to display the dashboard of either entire Gathr application or individual workspaces.

5

Refresh Interval

Refresh interval (in minutes or seconds) for the log monitoring.

6

Alerts

System alerts with a brief description and timestamp.

7

Superuser Settings

Options to manage the Superuser settings and option to logout of the application.

8

Gathr main menu

Side navigation panel in Gathr (main menu).

9

Gathr application footer

Links to Gathr Online Documentation, License details, Web Application Logs, Tomcat Catalina Logs and Audit Logs.

Dashboard

The superuser dashboard consists of a combination of different widgets such as, Pipeline Summary, Metrics, StreamAnalytix Web Health, Connections, Alerts, License Summary and Inspect Sessions.

The Workspace dashboard also consists of all the widgets given above except Connections and License Summary. All the widgets that appear in the Workspace dashboard will contain relevant details specific to the Workspace alone.

Pipeline Summary

This widget provides a summary of all the pipelines throughout the workspaces.

pipelinesummary

The below table describes the Pipeline Summary and its components:

Pipeline Summary

Pipelines

The number of pipelines created in Spark engine across all the workspaces.

Active

Number of active pipelines across all workspaces.

Starting

Number of pipelines in starting mode, across all workspaces.

Error

Number of pipelines that are in error state, across all workspaces.

Stopped

Number of pipeline that have stopped, across all workspaces.

Node Managers

Number of node managers for the pipelines.

Cores

Status of the allocated cores.

Memory

Status of the allocated memory.

Spark

Supported engine for Gathr application.

Metrics

Metrics provides the count of different entities throughout the workspaces.

Metrics

The below table describes the Metrics and its components:

Field

Description

Users

Number of users across all workspaces.

Tables

Number of tables across all workspaces.

Models

Number of models registered across all workspaces.

Components

Number of components across all workspaces.

Functions

Number of functions (system and custom) registered within all the workspaces.

Transformations

Total number of transformations created across all workspaces.

Gathr Web Health

This section helps to monitor the overall health of the Gathr Web Application through the memory and CPU utilization.

Application

Application Web Health shows the utilization of the CPU and Memory by the Gathr Application. Choose Application from the drop-down menu provided at the top right corner of the widget.

saxwebhealth

CPU

CPU shows the average use of all the cores available on machine (in percentage). CPU usage happens node by node. For example, there are 4 node managers on a machine and the Application used 2 nodes, that is 100% of one node and 90% of the other node. (As shown in the illustration above)

Memory

The total utilization of memory on machine (in percentage). That is the Application will use the total available memory irrespective of node.

System

This widget provides details about System Health. System Web Health is about the machine on which Gathr application is running.

CPU

The accumulative usage of all cores available on machine (in percentage) that is irrespective of CPU utilization of Gathr Application.

Memory

The total utilization of memory on machine (in percentage). This also is irrespective of the Application memory.

Connections

Connection is an end-point configuration for an infrastructure component. This widget provides the total number of connections available in Gathr i.e., total count of all the connections at Superuser level and all the Workspaces.

In the illustration below, each component type has a corresponding number to it, which signifies the number of different end-point configurations available.

Gathr can connect to infrastructure components like Cassandra, RabbitMQ, Elasticsearch, HDFS, Hbase, Kafka, Solr, and Hive.

Alerts

This widget will show system alerts, with a brief description and its timestamp. The user can also check the generated alerts on the UI along with email notifications.

On the Alerts widget, click the drop-down arrow and select Application/System.

Application

On the Alerts widget, click the drop-down arrow and select Application. User can apply Alerts on a streaming pipeline as well. User will see the description of the alert and its time stamp in this widget. The alert can have a customized description.

alert_1

System

This widget shows the alerts generated by a pipeline when it goes in error mode or terminated from YARN.

System alerts shows two types of alerts.

Pipeline stopped Alerts: Alerts thrown when a Pipeline is terminated from YARN.

Error Mode Alerts: Alerts thrown when the Pipeline goes in error mode.

Alerts

License Summary

This is the summary of the license and the license owner:

 LicenseSummary

Note: License type is dependent on the license validity.

License Validity

There are five types validity of a license:

Category

License Validity

Description

A

Time-based

License cannot be used after the expiry date. It provides early warning alerts.

B

Total no. of cores

License will not permit usage of any more than total number of cores.

C

Total no. of Nodes and Cores

License is valid for a specific no. of nodes and specific no. of cores underlying each node. System will not permit usage of any more than the specified number of node or cores per node.

For e.g., Like 2 nodes and 2 cores per node = 2nodes and 4 cores. Therefore, if you have 2 nodes but each node has 3 cores, the license will not be valid.

Any single installation of Gathr will support only one of the following license types:

1. A

2. B

3. C

4. A and B

5. A and C

The user can also find the same information on the bottom left corner of the dashboard page by clicking on License.

Supported modules are the supported engine, Spark

Property

Description

License Type

The license types are Enterprise, Developer and Trial.

License owner

Name of the license owner.

Email

Email id of the license owner.

Start date

Start date of the license duration.

Expiration date

Expiration date is the end date of the license duration.

Supported Engines

The supported processing engine.

Max Cores

Max Cores are number of cores allowed by the license.

Max Nodes

Max Nodes are number of nodes allowed by license.

Cores/Node

Cores/Node are maximum number of codes allowed per node.

Version

The release version of Gathr.

If duration is to be extended the license owner can contact:

support@Gathr.tech

You will get an alert 14 day before the expiry date, both email and periodic notifications on Gathr UI.

Violation

There are three codes available that shows the status of the license. Violation tab is at the top right corner of the widget.

LicenseViolation

GOOD - License not expired and not violated.

Expired - License expired.

Violated - License in violated state.

In case of expired or violated license, contact Gathr support team at:

support@Gathr.tech

Inspect Sessions

All the running Inspect sessions are listed in this widget.

All the users in all of the workspaces of the superuser can be viewed with the details explained below.

These inspect sessions can be filtered on the basis of

l Workspace

l Session Type: Local

You can view live streaming of logs corresponding to a local session and download logs.

Inspect Sessions

The functionality of each tab is shown below:

Field

Description

Workspace

The name of the workspace under the superuser for which the inspect session was created.

User

The name of the user under the workspace for which the inspect session was created.

Session Type

Session type: Local.

Session ID

The ID number of the inspect session.

Process ID

The Process ID number of the inspect session.

Running Since

The duration of the inspect session in hours/minutes and seconds.

Actions

Logs: View inspect session logs, download and delete.

Stop: Stop the inspect session.

Superuser can stop active session(s) of workspace(s) by using the stop button.

Superuser Settings

Go to Superuser on the top right corner of the superuser landing page and click on Settings.

Superuser can access settings with this option and also has a provision to logout of Gathr.

SuperuserSettingsCre

Under superuser settings you can change the password and regenerate token value. Regenerate token helps in regenerating the token in case the previous token expires. Success message is displayed when the superuser updates the token value.

SUPERUSERSETTINGS

Superuser Settings

Field

Description

User Name

Superuser name of the Gathr application.

Email Id

Superuser’s email id to receive notifications.

Language

Superuser’s choice of Language. English (US) by default.

Change Password

Checkbox option to change the existing password.

Note: Once the checkbox option is selected to change password, three additional fields appear in the settings i.e, Old Password, New Password and Confirm Password.

Token Value

Generated token value.

Regenerate Token

Generate token without updating password.

With the token value, superuser can perform actions such as workspace creation, viewing the cluster configuration directly through REST calls.

rest

Gathr Main Menu

Homepage

The main menu can be accessed from the left navigation panel and as explained earlier the Superuser can manage the Gathr administration with the help of features that are available in the main menu.

For more details on Gathr Administration, see Administration.

Live Logs

Live streaming of logs is available in Gathr so that you can debug issues from the browser itself without the need to access systems where Gathr Webstudio is running. The live logs available on the UI are mentioned below:

Application Logs can be viewed on the footer of the Gathr webpage as shown below:

 logs

l Web Logs

l Tomcat Logs

l Audit Logs

The other logs for local inspect sessions and data pipelines can be viewed on the relevant pages of the Gathr application as follows:

l To view the Local Inspect Session Logs, see Livy Connect window or Log button from the Inspect Sessions.

l To view the Data Pipeline related log details, see Pipeline Submission Logs.

Web Logs

Web logs can be viewed by clicking the Web Logs option found in the footer of Gathr application.

weblogs

Tomcat Logs

Tomcat Catalina logs can be viewed by clicking the Tomcat Logs option found in the footer of Gathr application.

tomcatlogs

Audit Logs

Application Audit logs can be viewed by clicking the Audit Logs option found in the footer of Gathr application.

Available Actions on Logs

The logs are generated with timestamps. Available actions on logs are:

Field

Description

Pin to another tab

The log window can be pinned to a new tab.

Download

Download the logs.gz file. In case of Web Application logs, Gathr logs are provided and ten latest sax.log archived files (if available) and by clicking on it, it can be downloaded.

Close window

Close the logs window.

Delete logs

Delete the logs and clear the console.

Get last N lines

By specifying the number of lines, last N log lines can be viewed.

Filter

Apply a keyword filter. The filters are not case sensitive.