pyspark.streaming.DStream.groupByKey#

DStream.groupByKey(numPartitions=None)[source]#

Return a new DStream by applying groupByKey on each RDD.