public abstract class Transformer extends PipelineStage
Constructor and Description |
---|
Transformer() |
Modifier and Type | Method and Description |
---|---|
abstract Transformer |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
abstract Dataset<Row> |
transform(Dataset<?> dataset)
Transforms the input dataset.
|
Dataset<Row> |
transform(Dataset<?> dataset,
ParamMap paramMap)
Transforms the dataset with provided parameter map as additional parameters.
|
Dataset<Row> |
transform(Dataset<?> dataset,
ParamPair<?> firstParamPair,
ParamPair<?>... otherParamPairs)
Transforms the dataset with optional parameters
|
Dataset<Row> |
transform(Dataset<?> dataset,
ParamPair<?> firstParamPair,
scala.collection.Seq<ParamPair<?>> otherParamPairs)
Transforms the dataset with optional parameters
|
params, transformSchema
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
clear, copyValues, defaultCopy, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, set, set, set, setDefault, setDefault, shouldOwn
toString, uid
$init$, initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, initLock, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log__$eq, org$apache$spark$internal$Logging$$log_, uninitialize
public abstract Transformer copy(ParamMap extra)
Params
defaultCopy()
.copy
in interface Params
copy
in class PipelineStage
extra
- (undocumented)public Dataset<Row> transform(Dataset<?> dataset, ParamPair<?> firstParamPair, ParamPair<?>... otherParamPairs)
dataset
- input datasetfirstParamPair
- the first param pair, overwrite embedded paramsotherParamPairs
- other param pairs, overwrite embedded paramspublic Dataset<Row> transform(Dataset<?> dataset, ParamPair<?> firstParamPair, scala.collection.Seq<ParamPair<?>> otherParamPairs)
dataset
- input datasetfirstParamPair
- the first param pair, overwrite embedded paramsotherParamPairs
- other param pairs, overwrite embedded paramspublic Dataset<Row> transform(Dataset<?> dataset, ParamMap paramMap)
dataset
- input datasetparamMap
- additional parameters, overwrite embedded params