Quick Start Guide

The aim of this quick start guide is to help you achieve below objectives:


Before you Begin

  • File Upload connector is the source used to create ETL and Ingestion applications.

  • To help you begin, a sample CSV can be downloaded and used in the File Upload connector.

    Download Sample CSV File

  • Gathr store is the target used to create ETL and Ingestion applications.

    Gathr store is a temporary storage service provided by Gathr. It allows you to access processed data results by downloading them from the Gathr Store output.


Quick Start with Data Ingestion

Create and run an Ingestion application in four simple steps:

  • Step 1: Configure the data source for the Ingestion application.

  • Step 2: Configure the target for ingesting data.

  • Step 3: Save the application with preferred deployment configuration settings.

  • Step 4: Run the application and oversee its performance.

Explore detailed walkthrough of each step below:


Step 1 - Configure Source

  1. Choose Create a data flow from the Gathr’s landing page.

    default-landing-page-ingestion

  2. On the dashboard, switch to Ingestion tab, and click on CREATE INGESTION APPLICATION.

    dashboard-ingestion

  3. Select File Upload as data source.

    quickstart-design-ingestion-01

  4. Update the Data Format and Header parameters. It allows the application to read the source data accurately.

    quickstart-design-ingestion-02

  5. Upload the sample file supermarket-sales-data.csv or any other file of your choice and click on SCHEMA button to populate the source schema for preview.

  6. Click on TRANSFORMATIONS button to complete the configuration of the file data source.

    quickstart-design-ingestion-03

    You can see the Profile Pane. To know more, refer to Data Preparation.

    quickstart-design-ingestion-04

    To know more, refer to Gathr supported Transformations in Data Ingestion.


Step 2 - Configure Target

  1. Select the Gathr Store as target.

    quickstart-design-ingestion-05

  2. Select the delimiter type, output fields, and configure any other parameters to complete the target configuration.

    quickstart-design-ingestion-06

  3. Click on APP CONFIGURATION to complete the target configuration.


Step 3 - Save Ingestion Application

  1. Update runtime configuration preferences.

    quickstart-design-ingestion-07

  2. Click on SAVE button to save the application and exit to the Ingestion listing page.

    To know about various options available on the Ingestion applications listing page, click here.


Step 4 - Run Ingestion Application

Run the saved Ingestion application from the listing page.

quickstart-design-ingestion-08

With Gathr Store as target, the emitted data after running this application will be available in the Gathr Store Output.

quickstart-design-ingestion-09

Explore the following features to manage Ingestion application:


Quick Start with ETL

Create and run an ETL application in five simple steps:

  • Step 1: Configure the data source for the ETL application.

  • Step 2: Add transformations based on specific data processing requirements.

  • Step 3: Configure the target for emitting the processed data.

  • Step 4: Save the application with preferred deployment configuration settings.

  • Step 5: Run the application and oversee its performance.

Explore detailed walkthrough of each step below:


Step 1 - Configure Source

  1. Choose Create a data flow from the Gathr’s landing page.

    default-landing-page-etl

  2. On the dashboard, switch to Advanced ETL tab, and click on CREATE ETL APPLICATION.

    dashboard-etl

  3. Click on Upload a file to get started to utilize File Connector as a data source.

    quickstart-design-etl-01

    There are several features available on the ETL canvas. To know more about them, click here.

  4. Update the Header and Data Format parameters. It allows the application to read the source data accurately.

    quickstart-design-etl-02

  5. Upload the sample file supermarket-sales-data.csv or any other file of your choice.

  6. Proceed to next tabs till you complete the configuration of the file data source.

    quickstart-design-etl-02-1

    Explore tab-specific details to configure any other data source by following the links below:

    With this step the data source configuration is completed.

    quickstart-design-etl-03


Step 2 - Add Transformations

Data transformation using AI Assistant

After configuring a data source, utilize Gathr’s AI Assistant to add transformations.

Switch to AI Assistant and instruct using natural language to transform source data.

Use the example below to guide the AI Assistant in processing sales data for your unique needs.

Fetch InvoiceID of all the purchases made in the City of Yangon under ProductLine "Home and lifestyle".

quickstart-design-etl-04

Following your instructions, the AI Assistant generates tasks by breaking down your requirements.

You can apply the recommended tasks individually or all at once to achieve your desired results.

quickstart-design-etl-04

After completing the tasks, you can transform the data further or return to the operators section to add data emitters.

quickstart-design-etl-04


User-driven data transformation

Effortlessly tailor your data transformation with the user-driven control in just a few simple steps:

  1. Navigate to the Transformations section, drag and drop the Dedup processor next to the File conector.

    quickstart-design-etl-04

  2. Click on Dedup transformation to see its configuration fields.

  3. Select the reference column for removing duplicate records, and proceed to finalize the configuration.

    Once the transformation configuraion is done, you can see the Profile Pane. To know more, refer to Data Preparation.

    Similarly, you can add more transformations as required. To know more, refer to Gathr supported Transformations.


Step 3 - Configure Target

  1. Navigate to the Emitters section, search for “Gathr Store” and drag and drop it next to the Dedup transformation.

    quickstart-design-etl-05

  2. Click on Gathr Store to see its configuration fields.

    quickstart-design-etl-06

  3. Select the delimiter type, output fields, and configure any other parameters in the target.

  4. Continue to complete the target configuration.

    Click on component inspection button (the eye icon) - placed right above the configured target, to preview the processed data.

    quickstart-design-etl-07


Step 4 - Save ETL Application

  1. Click on Save button above the Components pallet to load the Pipeline Definition page.

    quickstart-design-etl-09

  2. Update runtime configuration preferences.

  3. Save the application and exit to the ETL listing page.

    To know about various options available on the ETL applications listing page, click here.


Step 5 - Run ETL Application

Run the saved ETL application from the listing page.

quickstart-design-etl-10

With Gathr Store as target, the emitted data after running this application will be available in the Gathr Store Output.

quickstart-design-etl-11

Explore the following features to manage ETL application:

Top