Streaming Stock Market Data with Apache Spark and Kafka

  Рет қаралды 26,404

John O'Neill

John O'Neill

Күн бұрын

Пікірлер: 11
@mwandulu
@mwandulu 5 жыл бұрын
Probably the best talk about Kafka and use cases. Amazing..!
@usafa2000
@usafa2000 7 жыл бұрын
this talk was AMAZING!!
@Bonero7
@Bonero7 2 жыл бұрын
Great talk. Thank you
@swayam.
@swayam. 6 жыл бұрын
Why don't you store the offsets in a write optimized store that is indexed on timestamps? Then all you have to do is two read queries: start and end while searching for your offsets.
@SalilPitkar
@SalilPitkar 5 жыл бұрын
If inbound data is parallelized across nodes, how is the original sequence per symbol maintained? e.g. If there are 2 parallel nodes processing the inbound messages and two AAPL trades get split across nodes, but the one with later timestamp reaches the "AAPL" topic first, the consumer will receive the more recent trade first and then the older trade. AFAIK, Kafka maintains the sequence within a topic partition.
@patrickbike7908
@patrickbike7908 6 жыл бұрын
Thanks for sharing
@Justicewarrior795
@Justicewarrior795 3 жыл бұрын
wuhan?
@pajeetsingh
@pajeetsingh 3 жыл бұрын
You don't have rice grain of information about what goes in the underworld.
@Justicewarrior795
@Justicewarrior795 3 жыл бұрын
@@pajeetsingh 👻👻😱😱😱
@pajeetsingh
@pajeetsingh 3 жыл бұрын
Italian accent.
Developing Real-Time Data Pipelines with Apache Kafka
1:30:40
SpringDeveloper
Рет қаралды 157 М.
Get Rid of Traditional ETL, Move to Spark! (Bas Geerdink)
32:18
Spark Summit
Рет қаралды 96 М.
Жездуха 42-серия
29:26
Million Show
Рет қаралды 2,6 МЛН
Tuning and Debugging Apache Spark
47:14
Databricks
Рет қаралды 60 М.
"Apache Kafka and the Next 700 Stream Processing Systems" by Jay Kreps
28:56
Strange Loop Conference
Рет қаралды 44 М.
Introduction to Apache Kafka by James Ward
49:48
Devoxx
Рет қаралды 280 М.