Type Spark’s Structured Streaming #232

OlivierBlanvillain · 2018-01-20T14:46:13Z

We are currently missing these two Dataset method:

DataStreamWriter writeStream()
Dataset withWatermark(String eventTime, String delayThreshold)

That require some understanding of Spark streaming to be properly typed and tested. Here is the relevant documentation if anyone is interested and getting started on that:

https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html
https://databricks.com/blog/2017/05/08/event-time-aggregation-watermarking-apache-sparks-structured-streaming.html

etspaceman · 2019-07-30T15:45:00Z

+1 - This was a big blocker for us adopting Frameless, as most of our jobs are structured streaming jobs.

kyprifog · 2019-09-18T19:54:54Z

I'm curious why this never took off, my guess is that most typelevel people are using fs2 instead of spark streaming, but its still limited in that it can't out of the box do distributed streaming. Maybe typelevel people are using flink instead but seems doubtful from how flink is engineered.

This article is interesting, has anyone tried to extend this approach into the fs2/frameless world?

http://mandubian.com/2014/02/13/zpark/

This was referenced Jan 20, 2018

Missing Dataset methods #163

Open

Forwarding Typed Dataset methods #227

Merged

imarios added enhancement help wanted labels May 21, 2018

etspaceman mentioned this issue Dec 23, 2020

Add writeStream #480

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type Spark’s Structured Streaming #232

Type Spark’s Structured Streaming #232

OlivierBlanvillain commented Jan 20, 2018

etspaceman commented Jul 30, 2019 •

edited

kyprifog commented Sep 18, 2019 •

edited

Type Spark’s Structured Streaming #232

Type Spark’s Structured Streaming #232

Comments

OlivierBlanvillain commented Jan 20, 2018

etspaceman commented Jul 30, 2019 • edited

kyprifog commented Sep 18, 2019 • edited

etspaceman commented Jul 30, 2019 •

edited

kyprifog commented Sep 18, 2019 •

edited