Sagemaker Processor

Amazon SageMaker allows to build, train, and deploy machine learning models quickly.

Amazon SageMaker is a fully-managed service that covers the entire machine learning workflow to label and prepare your data.

Choose an algorithm, train the algorithm, tune and optimize it for deployment, make predictions, and take action.

Your models get to production faster with much less effort and lower cost.

Once you have trained the model over Sagemaker, you can deploy them as endpoint which can be further utilized in Gathr for prediction over incoming data.

Sagemaker Processor Configuration

To add a SageMaker processor to the pipeline, drag the processor onto the canvas and right-click on it to configure as explained below:

FieldDescription
Connection NameMention the SageMaker Connection Name.
RegionSpecify the region name where end point is served. For.e.g., us-west-2.
End PointIt lists all the endpoint created in the region mentioned above.
Feature Column

All the columns, which serves as input to the model deployed as endpoint over SageMaker.

Algorithm NameName of the model algorithm, which is getting served as endpoint over Sagemaker.
ScoreSelect the score.
Predicted LabelSelect the predicted label.

Out of all built in supported algorithms from Sagemaker, user can utilize below mentioned algorithms endpoints for scoring over incoming data using Gathr.

  1. Linear Learner Binary Classification

  2. Linear Learner Regression

  3. Kmeans

  4. PCA

  5. Tensorflow Classification

  6. XGBoost

Gathr SageMaker supports Custom model as well.

For example, one can train model-using scikit-learn API and then deploy that model as endpoint over SageMaker for prediction.

Once endpoint is in In-service state, it can be used for prediction in Gathr.

The below table has Algorithms with their corresponding output field tabs:

FieldDescription
Linear Learner Binary ClassificationScore, Predicted Label.
Linear Learner RegressionScore.
KMeansClusterLabel, DistanceToCluster.
PCAProjection.
XGBoostPrediction.

Tensorflow

Classification

Score, OutputClass.
CustomPrediction.
Top