pyspark.streaming.DStream.transform#

DStream.transform(func)[source]#

Return a new DStream in which each RDD is generated by applying a function on each RDD of this DStream.

func can have one argument of rdd, or have two arguments of (time, rdd)