HadoopRDD.HadoopMapPartitionsWithSplitRDD (Spark 1.3.1 JavaDoc)

Object
- org.apache.spark.rdd.RDD
- - org.apache.spark.rdd.HadoopRDD.HadoopMapPartitionsWithSplitRDD<U,T>

All Implemented Interfaces:

java.io.Serializable, Logging

Enclosing class:

HadoopRDD<K,V>
```
public static class HadoopRDD.HadoopMapPartitionsWithSplitRDD<U,T>
extends RDD
```
Analogous to MapPartitionsRDD, but passes in an InputSplit to the given function rather than the index of the partition.

See Also:
Serialized Form

Constructor Summary

Constructors
Constructor and Description
`HadoopRDD.HadoopMapPartitionsWithSplitRDD(RDD<T> prev, scala.Function2<org.apache.hadoop.mapred.InputSplit,scala.collection.Iterator<T>,scala.collection.Iterator<U>> f, boolean preservesPartitioning, scala.reflect.ClassTag<U> evidence$2, scala.reflect.ClassTag<T> evidence$3)`

Method Summary

Methods
Modifier and Type	Method and Description
`scala.collection.Iterator<U>`	`compute(Partition split, TaskContext context)` :: DeveloperApi :: Implemented by subclasses to compute a given partition.
`Partition[]`	`getPartitions()` Implemented by subclasses to return the set of partitions in this RDD.
`scala.Option<Partitioner>`	`partitioner()` Optionally overridden by subclasses to specify how they are partitioned.

Methods inherited from class org.apache.spark.rdd.RDD
aggregate, cache, cartesian, checkpoint, checkpointData, coalesce, collect, collect, collectPartitions, computeOrReadCheckpoint, conf, context, count, countApprox, countApproxDistinct, countApproxDistinct, countByValue, countByValueApprox, creationSite, dependencies, distinct, distinct, doCheckpoint, doubleRDDToDoubleRDDFunctions, elementClassTag, filter, filterWith, first, flatMap, flatMapWith, fold, foreach, foreachPartition, foreachWith, getCheckpointFile, getCreationSite, getNarrowAncestors, getStorageLevel, glom, groupBy, groupBy, groupBy, id, intersection, intersection, intersection, isCheckpointed, isEmpty, iterator, keyBy, map, mapPartitions, mapPartitionsWithContext, mapPartitionsWithIndex, mapPartitionsWithSplit, mapWith, markCheckpointed, max, min, name, numericRDDToDoubleRDDFunctions, partitions, persist, persist, pipe, pipe, pipe, preferredLocations, randomSplit, rddToAsyncRDDActions, rddToOrderedRDDFunctions, rddToPairRDDFunctions, rddToSequenceFileRDDFunctions, reduce, repartition, retag, retag, sample, saveAsObjectFile, saveAsTextFile, saveAsTextFile, setName, sortBy, sparkContext, subtract, subtract, subtract, take, takeOrdered, takeSample, toArray, toDebugString, toJavaRDD, toLocalIterator, top, toString, treeAggregate, treeReduce, union, unpersist, zip, zipPartitions, zipPartitions, zipPartitions, zipPartitions, zipPartitions, zipPartitions, zipWithIndex, zipWithUniqueId

Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait

Methods inherited from interface org.apache.spark.Logging
initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning

Constructor Detail

HadoopRDD.HadoopMapPartitionsWithSplitRDD

public HadoopRDD.HadoopMapPartitionsWithSplitRDD(RDD<T> prev,
                                         scala.Function2<org.apache.hadoop.mapred.InputSplit,scala.collection.Iterator<T>,scala.collection.Iterator<U>> f,
                                         boolean preservesPartitioning,
                                         scala.reflect.ClassTag<U> evidence$2,
                                         scala.reflect.ClassTag<T> evidence$3)

Method Detail
- partitioner
```
public scala.Option<Partitioner> partitioner()
```
 Description copied from class: RDD
 
 Optionally overridden by subclasses to specify how they are partitioned.
 
 Overrides:
 
 partitioner in class RDD
- getPartitions
```
public Partition[] getPartitions()
```
 Description copied from class: RDD
 
 Implemented by subclasses to return the set of partitions in this RDD. This method will only be called once, so it is safe to implement a time-consuming computation in it.
- compute
```
public scala.collection.Iterator compute(Partition split,
 TaskContext context)
```
 Description copied from class: RDD
 
 :: DeveloperApi :: Implemented by subclasses to compute a given partition.
 
 Specified by:
 
 compute in class RDD

Class HadoopRDD.HadoopMapPartitionsWithSplitRDD<U,T>

Constructor Summary

Method Summary

Methods inherited from class org.apache.spark.rdd.RDD

Methods inherited from class Object

Methods inherited from interface org.apache.spark.Logging

Constructor Detail

HadoopRDD.HadoopMapPartitionsWithSplitRDD

Method Detail

partitioner

getPartitions

compute