Data Ingestion Applications

Data Ingestion is one of the most crucial steps of the data handling process. It involves transferring data from a source to a target after applying any desired transformations.

Gathr offers a very user-friendly interface to create data ingestion applications on its SaaS platform and trigger them on either Gathr cluster or user preferred clusters that are registered with Gathr using Compute Setup →


Useful references for Data Ingestion Applications:


Ingestion Applications Listing Page

All the data ingestion applications you create in your project will get listed on its Ingestion page.

Listed Applications Details

Details shown for each listed application are:

  • Source and Target components.

  • Application name and status.

  • Time since last modified. and next sync schedule info.

  • Assigned cluster and scheduling information.

  • Application description and custom tags added by the user.


Actions on Listed Applications

The actions available on the listing page for applications are described below:

ActionsDescription
SearchYou can specify the relevant keywords in the search bar to find the required application that you are looking for in the existing list of applications.
Sort ByYou can sort the listed applications by their name, created date or next scheduled date, either in ascending or descending order.
Refresh ApplicationsYou can refresh all the listed applications in a single click with this action.
Create ApplicationYou can create a new data ingestion application with this action.
The steps to create data ingestion applications are explained in the subsequent topics.
Start ApplicationYou can trigger any existing application with this action.
Stop ApplicationYou can directly stop any running application with this action.
View ApplicationYou can view the application details with this action.
This option is further explained in the View Applications → topic.
Edit ApplicationYou can edit an existing application with desired source, transformation, target or configuration details with this action and update to save the changes.
Delete ApplicationThis option allows you to delete an Ingestion application and send it to the Gathr’s Recycle Bin.
Schedule ApplicationYou can schedule an application to automatically trigger it as per the scheduling options listed as follows:
- Start Time.
- End Date and Time.
- Frequency - minutes, hourly, daily, weekly, monthly, and yearly.
- The minutes frequency can be set to 15, 20 or 30 minutes.
- Every - hour(s), week(s), month(s) and so on, as per the specified interval of frequency.
For Hourly frequency, an additional option Start Time will be available to provide the start time in hours and minutes.
- Set the end date and time to define when the scheduled executions should stop. You can specify an end date and time after which the scheduling will no longer occur. This is useful for scheduling tasks that have a finite duration or are only needed for a certain period.
- Specify the time zone in which the scheduling should occur. This ensures that the application runs at the desired time in the specified time zone.
- An auto-generated Cron Expression.
Change ClusterYou can change the cluster on which an ingestion or ETL application gets triggered.
Options are - Extra Small, Small, Medium, and Large.

Also, there is an option to utilize clusters that are registered with Gathr for running Data Ingestion applications. The prerequisite for that is to setup compute environments in Gathr from User Settings > Compute Setup → tab.


For account upgrade information, see Subscription →
Top