public interface Word2VecBase extends Params, HasInputCol, HasOutputCol, HasMaxIter, HasStepSize, HasSeed
Word2Vec
and Word2VecModel
.Modifier and Type | Method and Description |
---|---|
int |
getMaxSentenceLength() |
int |
getMinCount() |
int |
getNumPartitions() |
int |
getVectorSize() |
int |
getWindowSize() |
IntParam |
maxSentenceLength()
Sets the maximum length (in words) of each sentence in the input data.
|
IntParam |
minCount()
The minimum number of times a token must appear to be included in the word2vec model's
vocabulary.
|
IntParam |
numPartitions()
Number of partitions for sentences of words.
|
StructType |
validateAndTransformSchema(StructType schema)
Validate and transform the input schema.
|
IntParam |
vectorSize()
The dimension of the code that you want to transform from words.
|
IntParam |
windowSize()
The window size (context words from [-window, window]).
|
getInputCol, inputCol
getOutputCol, outputCol
getMaxIter, maxIter
getStepSize, stepSize
clear, copy, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn
toString, uid
int getMaxSentenceLength()
int getMinCount()
int getNumPartitions()
int getVectorSize()
int getWindowSize()
IntParam maxSentenceLength()
maxSentenceLength
size.
Default: 1000IntParam minCount()
IntParam numPartitions()
StructType validateAndTransformSchema(StructType schema)
schema
- (undocumented)IntParam vectorSize()
IntParam windowSize()