Register a JavaDStream
of values for significance testing.
Register a JavaDStream
of values for significance testing.
stream of BinarySample(isExperiment,value) pairs where the isExperiment denotes group (true = experiment, false = control) and the value is the numerical metric to test for significance
stream of significance testing results
Register a DStream
of values for significance testing.
Register a DStream
of values for significance testing.
stream of BinarySample(key,value) pairs where the key denotes group membership (true = experiment, false = control) and the value is the numerical metric to test for significance
stream of significance testing results
Set the number of initial batches to ignore.
Set the number of initial batches to ignore. Default: 0.
Set the statistical method used for significance testing.
Set the statistical method used for significance testing. Default: "welch"
Set the number of batches to compute significance tests over.
Set the number of batches to compute significance tests over. Default: 0. A value of 0 will use all batches seen so far.
Performs online 2-sample significance testing for a stream of (Boolean, Double) pairs. The Boolean identifies which sample each observation comes from, and the Double is the numeric value of the observation.
To address novelty affects, the
peacePeriod
specifies a set number of initial org.apache.spark.rdd.RDD batches of theDStream
to be dropped from significance testing.The
windowSize
sets the number of batches each significance test is to be performed over. The window is sliding with a stride length of 1 batch. Setting windowSize to 0 will perform cumulative processing, using all batches seen so far.Different tests may be used for assessing statistical significance depending on assumptions satisfied by data. For more details, see StreamingTestMethod. The
testMethod
specifies which test will be used.Use a builder pattern to construct a streaming test in an application, for example: