Salesforce Data Source

Parse Salesforce data from the source or import a file in either data type format except Parquet. Salesforce channel allows to read data from a Salesforce account.

Salesforce is a top-notch CRM application built on the Force.com platform. It can manage all the customer interactions of an organization through different media, like phone calls, site email inquiries, communities, as well as social media. This is done by reading Salesforce object specified by Salesforce Object Query Language.

First is to create a Salesforce connection and for that you would require the following prerequisites:

  • A valid Salesforce account.

  • User name of Salesforce account.

  • Password of Salesforce account.

  • Security token of Salesforce account.

Configuring Salesforce Data Source

To add a Salesforce Data Source into your pipeline, select the Data Source and click on it to configure.

Under the Schema Type tab, select Fetch From Source or Upload Data File.

Now, select a Salesforce connection and write a query to fetch any Salesforce object. Then provide an API version. Below are the configuration details:

FieldDescription
Connection NameSelect the Salesforce connection.
Override CredentialsUnchecked by default, check the override credentials option for user specific actions.
UsernameOnce the Override Credentials option is checked, provide the user name through which the Salesforce service is running.
PasswordProvide the password for Salesforce account.
Security TokenProvide the security token for Salesforce account.
Table NameSelect the source table name for detecting schema.
Query

Use the Salesforce Object Query Language (SOQL) to search your organization’s Salesforce data for specific information.

SOQL is similar to the SELECT statement in the widely used Structured Query Language (SQL) but is designed specifically for Salesforce data.

Infer SchemaInferschema from the query results to find the data type of the field specified in SOQL.This will work if number of record is 5 or greater. (Optional)
BulkFlag to enable bulk query. This is the preferred method when loading large sets of data. Bulk API is based on REST principles and is optimized for loading large sets of data. You can use it to query many records asynchronously by submitting batches. Salesforce will process batches in the background. The Default value is false. (Optional)
Pk Chunking

Flag to enable automatic primary key chunking for bulk query job. (Optional)

This splits bulk queries into separate batches of the size defined by chunkSize option.

By default, false and the default chunk size is 100,000.

PK Chunking feature can automatically make large queries manageable when using the Bulk API. PK stands for Primary Key — the object’s record ID — which is always indexed. This feature is supported for all custom objects, many standard objects, and their sharing tables.

ChunkSizeThe size of the number of records to include in each batch. Default value is 100,000. This option can only be used when Pk Chunking is true. Maximum size is 250,000.
TimeoutThe maximum time in milliseconds spent polling for the completion of bulk query job. This option can only be used when bulk is true.
VersionSelect Salesforce version from the drop-down list.

Configure Pre-Action in Source →

Top