Scala API Extensions

If you want to enjoy the full Scala experience you can choose to opt-in toextensions that enhance the Scala API via implicit conversions.

To use all the available extensions, you can just add a simple for theDataSet API

import org.apache.flink.streaming.api.scala.extensions._

Alternatively, you can import individual extensions a-là-carte to only use thoseyou prefer.

Normally, both the DataSet and DataStream APIs don’t accept anonymous patternmatching functions to deconstruct tuples, case classes or collections, like thefollowing:

val data: DataSet[(Int, String, Double)] = // [...]
data.map {
  case (id, name, temperature) => // [...]
  // The previous line causes the following compilation error:
  // "The argument types of an anonymous function must be fully known. (SLS 8.5)"
}

DataSet API

DataStream API

Method	Original	Example
mapWith	map (DataStream)	`data.mapWith { case (, value) => value.toString}`
flatMapWith	flatMap (DataStream)
filterWith	filter (DataStream)
keyingBy	keyBy (DataStream)	`data.keyingBy { case (id, , ) => id}`
mapWith	map (ConnectedDataStream)
flatMapWith	flatMap (ConnectedDataStream)	`data.flatMapWith( flatMap1 = case (, json) => parse(json), flatMap2 = case (, , json, ) => parse(json))`
keyingBy	keyBy (ConnectedDataStream)	`data.keyingBy( key1 = case (, timestamp) => timestamp, key2 = case (id, , ) => id)`
reduceWith	reduce (KeyedStream, WindowedStream)	`data.reduceWith { case ((, sum1), (, sum2) => sum1 + sum2}`
foldWith	fold (KeyedStream, WindowedStream)	`data.foldWith(User(bought = 0)) { case (User(b), (, items)) => User(b + items.size)}`
applyWith	apply (WindowedStream)	`data.applyWith(0)( foldFunction = case (sum, amount) => sum + amount windowFunction = case (k, w, sum) => // […])`
projecting	apply (JoinedStream)	`data1.join(data2). whereClause(case (pk, ) => pk). isEqualTo(case (, fk) => fk). projecting { case ((pk, tx), (products, fk)) => tx -> products }`

For more information on the semantics of each method, please refer to theDataSet and API documentation.

To use this extension exclusively, you can add the following import:

import org.apache.flink.api.scala.extensions.acceptPartialFunctions

The following snippet shows a minimal example of how to use these extensionmethods together (with the DataSet API):