Neo4j

This component is supported in Gathr on-premise.

Field	Description
Connection Name	Connection name of the created connection for Neo4j.
Batch Read	Check the option to enable the batch processing.
Database Name	Name of the database from which data will be read.
Read Mode	Select one of the below option to fetch the records from Neo4j database. The available options are explained below.
Cypher Query	Options to provide a cypher query. Example: CREATE (n:Person {fullName: event.name + event.surname}).

Nodes	You can read nodes by specifying a single label, or multiple labels. Label list can be specified with starting colon. Example, :Person:Customer

Upon selecting Relationship option, the below fields are available:


Relationship	Option to define type of relationship. Specify the mapping detail, the source node, and the target node label as explained below.
Mapping	Check the option to control the result format by the mapping option. The result format can be controlled by the relationship.nodes.map Default is false. 👉 When it is set to false, source and target nodes properties are returned in separate columns prefixed with source or target. (i.e., source.name, target.price). When it is set to true, the source and target nodes properties are returned as Map[String, String] in two columns named source and target.
Source Nodes	Provide source nodes column. Example: MATCH (source:Person)-[rel: BOUGHT]->(target:Product) RETURN source, rel, target. Here, source:Person.
Target Nodes	Provide target nodes column. Example: MATCH (source:Person)-[rel: BOUGHT]->(target:Product) RETURN source, rel, target. Here target:Product.

Remaining common fields are as below:


Schema Flatten Limit	Number of records to be used to create the schema.
Schema Strategy	Strategy used by the connector in order to compute the schema definition for the dataset. Possible values are String and Sample.
Partitions	This defines the parallelization level while pulling data from Neo4j.
ADD CONFIGURATION	Option to add further configurations by providing the key-value pair.

If you have unchecked Batch option, the below fields will appear in case of streaming dataset case:


Streaming From	This option is used to trigger the connector from where to send data to the stream. You can select NOW (Starts reading from the current timestamp.) or All (Sends all the data to the database to the stream before reading the data).
Incremental Read Property	The timestamp property name used for incremental reading.

You have an option to add Environment parameters as mentioned below:

ENVIRONMENT PARAMS


ADD PARAMS	Option to add further environment parameters by providing the key-value pair.

Rename Column Name

In certain scenarios if the Column Name has special characters, you can use the Rename processor available in gathr.

Under the Detect Schema screen, select the Rename processor and join it with Neo4j source. Configure it by providing the desired column name as against the existing column name that contains special characters. Click Next for to see the renamed column name.

As shown below, the column name is updated.

If you have any feedback on Gathr documentation, please email us!