org.apache.spark.mllib.classification
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed number of iterations of gradient descent using a step size of 1.0. We use the entire data set to update the gradient in each iteration.
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
a LogisticRegressionModel which has the weights and offset from training.
Labels used in Logistic Regression should be {0, 1}
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed number of iterations of gradient descent using the specified step size. We use the entire data set to update the gradient in each iteration.
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
Step size to be used for each iteration of Gradient Descent.
a LogisticRegressionModel which has the weights and offset from training.
Labels used in Logistic Regression should be {0, 1}
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed
number of iterations of gradient descent using the specified step size. Each iteration uses
miniBatchFraction
fraction of the data to calculate the gradient.
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
Step size to be used for each iteration of gradient descent.
Fraction of data to be used per iteration.
Labels used in Logistic Regression should be {0, 1}
Train a logistic regression model given an RDD of (label, features) pairs.
Train a logistic regression model given an RDD of (label, features) pairs. We run a fixed
number of iterations of gradient descent using the specified step size. Each iteration uses
miniBatchFraction
fraction of the data to calculate the gradient. The weights used in
gradient descent are initialized using the initial weights provided.
RDD of (label, array of features) pairs.
Number of iterations of gradient descent to run.
Step size to be used for each iteration of gradient descent.
Fraction of data to be used per iteration.
Initial set of weights to be used. Array should be equal in size to the number of features in the data.
Labels used in Logistic Regression should be {0, 1}
Top-level methods for calling Logistic Regression using Stochastic Gradient Descent.
(Since version 2.0.0) Use ml.classification.LogisticRegression or LogisticRegressionWithLBFGS
Labels used in Logistic Regression should be {0, 1}