Making Structured Streaming Ready for Production Updates: Spark Summit East talk by Tathagata Das

  Рет қаралды 4,626

Spark Summit

Spark Summit

Күн бұрын

Пікірлер: 4
@JoHeN1990
@JoHeN1990 5 жыл бұрын
Superb questions from the audience. JSON schema evolution and marking the processed files. 2 very common problems and yet still hard to solve. Great answer from TD though!
@shijiema6466
@shijiema6466 6 жыл бұрын
it's interesting to see how easily it converts ETL from batch mode to real-time mode. But what I really get from here is a confirmation of bright future of relational model and SQL. You can invent new ways to arrange and move the data, but when it comes to analyzing the data, so far it still has to be flattened (and joined).
@Namelessdad83
@Namelessdad83 7 жыл бұрын
What if we dont have a Distributed filesystem... What if we use a plain Spark Cluster along with Kafka? Can we use Zookeeper to work instead of HDFS/S3 for WAL?
这三姐弟太会藏了!#小丑#天使#路飞#家庭#搞笑
00:24
家庭搞笑日记
Рет қаралды 121 МЛН
Brawl Stars Edit😈📕
00:15
Kan Andrey
Рет қаралды 16 МЛН
Google I/O 2012 - Go Concurrency Patterns
51:27
Google for Developers
Рет қаралды 807 М.
这三姐弟太会藏了!#小丑#天使#路飞#家庭#搞笑
00:24
家庭搞笑日记
Рет қаралды 121 МЛН