ADLS Data Source - Batch and Streaming

Add an ADLS batch or streaming data source to create a pipeline. Click the component to configure it.

Under the Schema Type tab, select Fetch From Source, Upload Data File or Use Existing Dataset option. Edit the schema if required and click next to configure.

The options to Provide Schema field are also available of Upload Data File option under the Schema Type tab.

FieldDescription
Connection Name

Connections are the Service identifiers.

Select the connection name from the available list of connections, from where you would like to read the data.

ContainerProvide connection name in Azure Blob storage.
ADLS Directory Path

Provide directory path for ADLS file system.

ADD CONFIGURATIONSUser can add further configurations (Optional).
Environment ParamsUser can add further environment parameters. (Optional)

Provide the below fields to configure ADLS data source:

Click Next for Incremental Read option.

FieldDescription
Enable Incremental ReadUnchecked by default, check mark this option to enable incremental read support.
Read ByOption to read data incrementally either by choosing the File Modification Time option or Column Partition option.

Upon selecting the File Modification Time option, provide the below detail:

Offset

Specifies the last modified time of the file.

Records with timestamp value greater than the specified datetime (in UTC) will be fetched. After each pipeline run the datetime configuration will set to the most recent timestamp value from the last fetched records. The given value should be in UTC with ISO Date format as yyyy-MM-dd’T’HH:mm:ss.SSSZZZ. Example: 2021-12-24T13:20:54.825+0000.

Upon selecting the Column Partition option, provide the below details:

Read Control Type

Options to control data fetch:

Limit by Value: All records in reference column with values greater than the start value but less than/equal to the max value that you set will be read.

Limit by Incremental Size:  

All records in reference column with values greater than the start value with specified incremental size that you set will be selected.

Inclusive Start Offset

Check the checkbox for enabling the Inclusive Start Offset option to include the start value for incrementally reading the schema. Supports the integer, date and timestamp data types.

Upon selecting the Inclusive Start Offset option the schema from Start value to the Max value will be incrementally read.

Top