Getting Started
User Guide
API Reference
Development
Migration Guide
Spark SQL
Pandas API on Spark
Structured Streaming
MLlib (DataFrame-based)
Spark Streaming
MLlib (RDD-based)
Spark Core
Resource Management
pyspark.streaming.DStream.groupByKey
¶
DStream.
groupByKey
(
numPartitions
=
None
)
[source]
¶
Return a new DStream by applying groupByKey on each RDD.
pyspark.streaming.DStream.glom
pyspark.streaming.DStream.groupByKeyAndWindow