Cluster Configuration for Apache

💡 The component versions that are mentioned in this sub-topic are for representational purpose only. For the Gathr supported component version details, see Component Versions Supported →

Login to Gathr using Superuser credentials and go to Configuration from the main menu.

  1. Select the Web Studio tile and click on the Zookeeper tab.

    Webstudio_Zookeeper

    Provide value for the following property:

    FieldDescription
    Host ListThe comma separated list of all the nodes of Zookeeper cluster. This zookeeper cluster will be used to store Gathr configuration.
    For ex: hostname1:2181,hostname2:2181

    Save the changes by clicking on Save.

  2. Select Processing Engine tab and click on Spark tab.

    Provide values for the following properties to point Gathr to an external cluster:

    FieldDescription
    Spark cluster managerDefines Spark Cluster Manager i.e. ‘yarn’ or ‘standalone’.
    spark.history.serverDefines spark history server URL.
    Resource Manager HostDefines resource manager host name.
    Resource Manager Webapp PortDefines resource manager webapp port.
    Resource Manager PortDefines resource manager RPC port.
    ResourceManager High AvailabilityCheck this if Resource Manager is HA enabled.
    ResourceManager HA Logical NamesResource Manager HA logical IDs as defined in HA configuration.
    ResourceManager HA HostsResource Manager HA hostnames.
    ResourceManager HA ZK AddressResource Manager HA zookeeper quorum.

    Save the changes by clicking on Save.

Top