Distributed Scheduling with Spring Boot: the challenges & pitfalls of implementing a background job

  Рет қаралды 13,514

Spring I/O

Spring I/O

Күн бұрын

Spring I/O 2024 - 30-31 May, Barcelona
Speaker: Rafael Ponte
Slides: speakerdeck.co...
Sooner or later a developer will implement his/her first background job using Java and Spring Boot, and what usually is a simple task for the majority of systems might become a nightmare in scenarios that need to deal with high performance, parallelism, distributed systems and a large volume of data. Scenarios like those hide several issues which many developers are not used to, such as large volumes of data, network failures, data inconsistency, out-of-memory errors and even taking the whole system down.
Although it seems controversial, dealing with many of these problems does not require hype technologies or services, but solid distributed systems fundamentals. This talk will present how an experienced developer implements a background job with Java and Spring Boot taking into consideration the main challenges and pitfalls it brings along, and how he/she designs a solution for high-performance, resilience and horizontal scalability at the same time he/she takes advantage of many modules of Spring Boot, Hibernate and the relational database.
If you still believe that a background job is a simple task, so this talk is for you!

Пікірлер: 140
@linhvudev
@linhvudev 3 ай бұрын
Thanks Rafael! especially for the SKIP_LOCKED feature, new knowledge learnt
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thank you so much! I am glad the talk was helpful for you! 🥰 And yeah, SKIP LOCKED is fantastic!! 💪🏻
@eduardo120155
@eduardo120155 2 ай бұрын
Congratulations on your presentation! You absolutely nailed it. Your thorough research and confident delivery captivated everyone in the room. Your ability to explain complex ideas so clearly is truly impressive. Keep up the fantastic work!
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the kind words, Eduardo! ❤
@terteseamos579
@terteseamos579 2 ай бұрын
this for me is the best presentation. Great job
@RafaelPonte
@RafaelPonte 2 ай бұрын
What a comment! Thanks for that! ❤️
@felipedossantos7246
@felipedossantos7246 Ай бұрын
I've seen this presentation in portuguese before of Rafael Pontes in Zup Channel, and I could implement something similar it in my job. Great work, Bro! Thank you so much
@RafaelPonte
@RafaelPonte Ай бұрын
Hi Felipe, Thanks for this comment and for having watched both versions of the talk. ❤
@RaphaelDeLio
@RaphaelDeLio 3 ай бұрын
Parabéns, Rafael! Foi um prazer assistir sua apresentação pessoalmente!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Obrigado demais, Rapha! ❤ Você eh top!
@codeisma
@codeisma 3 ай бұрын
Great talk! There are a few Java libraries that already solve these challenges (db-scheduler, JobRunr or Quartz). At JobRunr we'd love to share your talk as it explains JobRunr's architecture well and can help our users understand the challenges of distributed scheduling even better!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks for your comment! I'm glad you liked it! ☺ Please, I would appreciate it if you shared it! By the way, I received great feedback from Ronald, the creator of JobRunr-he watched my talk! He is a fantastic guy! ❤
@RonaldDehuysser
@RonaldDehuysser 3 ай бұрын
@@RafaelPonte You're too kind 🤩!
@marshall143
@marshall143 2 ай бұрын
What is your opinion on nflow Java library? Thank you for video
@RafaelPonte
@RafaelPonte 2 ай бұрын
⁠@@marshall143Thanks for the comment! 😊 I didn't know nFlow, but I understand that if your context allows your team or project to adopt a task scheduler or workflow engine, you should go with it. Usually, those libs and frameworks make the developer's life easier because they address very well all the issues discussed in the talk.
@paulorcv
@paulorcv Ай бұрын
Very good! thank your for the valuable content!
@RafaelPonte
@RafaelPonte Ай бұрын
You are welcome 🤗
@YZ-ix3dn
@YZ-ix3dn 2 ай бұрын
Thank you for clear and well-structured presentation. It's very useful and important information even for people with lots years of experience. I wish every developer should watch this video when every time they put @Transactional onto theirs method.
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the kind words! I am glad you enjoyed the talk! ☺
@popastefan6542
@popastefan6542 28 күн бұрын
36:48 Actually in our example, each instance will fight for first 50 records, not one record as it is illustrated in the slide.
@RafaelPonte
@RafaelPonte 28 күн бұрын
Thanks for the comment. Yeah, you're right. The number of rows is unimportant in understanding how the SQL feature works. The idea was to be didactic and straightforward.
@bkavun
@bkavun 2 ай бұрын
Great presentation, great work. Thanks a lot for sharing this knowledge with us!
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks so much! I am glad you liked it 🥳
@hirenpandit8499
@hirenpandit8499 2 ай бұрын
Great talk!! so much learnings and addressed real life problems I faced while writing background scheduled jobs... btw we used ShedLock library but this is real good insight.
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks! Nice you liked it!! 😊 By the way, ShedLock is a very cool library! 👊🏻
@matthewmoore5934
@matthewmoore5934 Ай бұрын
Great talk! A couple of thoughts. Your statement about entity state and transactions is only true if Spring's "open session in view" is not enabled. I find there is a lot of confusion out there about the Hibernate session, transaction state, OSIV, and entity state. Along similar lines, the call to a repository save() method is unnecessary when updating an attached entity because of Hibernate change tracking, and calling save() leads people to assume that it persists changes, which (counterintuitively) it doesn't. (It adds/merges detached entities to the session/persistence context.) Regarding transaction scope, I would argue it is still too broad. Work for a single user/card should generally happen in its own transaction, at least in an OLTP context.
@RafaelPonte
@RafaelPonte Ай бұрын
You are welcome! Thanks for the comment ☺️ I am unsure if I followed your comment about Open Session in View. I mean, there's no OSIV relation to a job scheduled by Spring since OSIV has to do with web/MVC scope. The code in the talk is correct. You are right about the save() method; it wasn't needed, but the idea was to show a simple and didact code, not getting into details about how to persist entities or their state transitions. The transaction scope is broader because we are working in batches 😊
@matthewmoore5934
@matthewmoore5934 Ай бұрын
@@RafaelPonte Thanks for the response. I spend much time working in a WebMVC context that (unfortunately) uses OSIV and I'm too used to its oddities. 🙂
@danielponte3134
@danielponte3134 3 ай бұрын
Parabéns meu irmão , você deu um show na apresentação, impecável! show de top!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Obrigado, meu irmão!
@ferlezcano
@ferlezcano 3 ай бұрын
Excellent topic! Have some background jobs running here and there and I definitely going to check them again.
@RafaelPonte
@RafaelPonte 3 ай бұрын
Nice! I am glad this talk was helpful to you! 👊🏻
@CodingForRealLife
@CodingForRealLife Ай бұрын
Amazing persentation, very usefull, thanks Rafael!
@RafaelPonte
@RafaelPonte Ай бұрын
You are welcome! 😊
@jesprotech
@jesprotech 3 ай бұрын
I really like the way you explained short running transactions. Nice addition to the jobs! Parabéns pela excelente apresentação! É muito útil!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks so much! I am glad you liked it 🥰
@tahahajivand1843
@tahahajivand1843 28 күн бұрын
it was so good. thanks
@RafaelPonte
@RafaelPonte 28 күн бұрын
I am glad you liked 😊
@hamedalipour1012
@hamedalipour1012 2 ай бұрын
you are an amazing presenter thank you so much learned a lot
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thank you so much!!! I am happy this talk was helpful for you 🥳
@pavanerbeck23
@pavanerbeck23 Ай бұрын
Nicely done @RafaelPonte.
@RafaelPonte
@RafaelPonte Ай бұрын
Thank you! ☺
@simongeard4824
@simongeard4824 20 күн бұрын
As a side note, the original example program has one further problem which wasn't discussed - if the job runs every 60 seconds, what happens if it takes more than 60 seconds to complete, giving you unintended parallel processing? I've been bitten by that one a few times...
@RafaelPonte
@RafaelPonte 20 күн бұрын
Thanks for the comment! 😊 In the context of the talk, this is not an issue. I mean, Spring will not allow running multiple jobs for the same task, even if it takes longer than 60 sec. But if the method is annotated with @Async then we can not say the same 😬
@simongeard4824
@simongeard4824 19 күн бұрын
@@RafaelPonte Fair enough... you're right that a good scheduler will avoid the problem (for sync operations, at least). My negative experiences have typically been with more naive scheduling tools...
@aleksandrS3894
@aleksandrS3894 2 ай бұрын
Does a single @Transactional annotation for Scheduled method (in case of JPA framework) fix the original code right away?
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the comment. It depends on which problem you're talking about. In the talk’s context, it solves only part of the problem: it makes the whole operation atomic and recoverable but causes a few side effects.
@MrDaniloko23
@MrDaniloko23 Ай бұрын
Great content!
@RafaelPonte
@RafaelPonte Ай бұрын
Thanks 🙏🏻
@gjperes1
@gjperes1 3 ай бұрын
O Rafael é fera demais!! Great presentation
@RafaelPonte
@RafaelPonte 3 ай бұрын
Brigadão!! ☺
@duyetpham7924
@duyetpham7924 3 ай бұрын
Beautiful presentation, thank you
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thank you so much! That's very nice you liked it! 🥰
@HariharanIyer
@HariharanIyer 2 ай бұрын
Great talk and lot of cool new (for me) information about Spring/JPA semantics! But not much of this is specific to background jobs, and not much in the talk about generic background job processing. So I'd say the title is a bit misleading.
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the comment 😊 I am glad the content was helpful for you! Out of curiosity, what do you understand as background jobs and job processing, and what do you expect from a talk about these subjects?
@user-un1um2vf3y
@user-un1um2vf3y 2 ай бұрын
what's the difference between reading and writing with a rabbit or kafka and reading and writing with a database? Usually i'm using REDIS for solve same problem, because it much faster than usual relation db
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the comment. I will ignore the trade-offs of having a new component in the infrastructure now and focus only on the developer's perspective. There are differences, but how they can impact your solution depends on your context. I mean, using Kafka or RabbitMQ in the talk's job perspective may have little difference on the job's code, but in the application perspective, which produces events in the queue, we may have to deal with a dual write issue. The same is true for Redis: it depends on how you're using it, such as a distributed lock provider or a message queue.
@annabeatrizelias9689
@annabeatrizelias9689 3 ай бұрын
Parabéns, muito show!
@jessilyneh
@jessilyneh 2 ай бұрын
Congrats for your amazing presentation, Rafa!
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks, Jess! ❤
@knoppix20
@knoppix20 29 күн бұрын
26:34, 31:59, 32:14, 36:04, 40:31 - key moments
@RafaelPonte
@RafaelPonte 29 күн бұрын
Thanks for the comment and for pointing out the key moments ☺️
@user-cw8lz4ml5u
@user-cw8lz4ml5u 3 ай бұрын
Great talk! Did not catch all the red flags in this :)
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks! I am glad you liked it!! ❤
@Cassitu
@Cassitu 2 ай бұрын
Parabeeens manooo! ficou top! sucesso
@RafaelPonte
@RafaelPonte 2 ай бұрын
Obrigado! Feliz que curtiu ❤
@YuliSlabko
@YuliSlabko 2 ай бұрын
Nice explanation! But did not cover very important case if your app has more than one job marked with @Scheduled annotation. Because it may be crucial moment of performance. May be it will be covered in next topics.
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the comment 😊 Nice you liked it! I am not sure if I understood what you mean. Usually, a single application has multiple @Scheduled jobs running concurrently doing different things (sometimes at other times). Could you give more details?
@YuliSlabko
@YuliSlabko 2 ай бұрын
@@RafaelPonte If you do not specify in application.yml thread pool size for scheduler explicitly all jobs will be operated by one single thread.
@RafaelPonte
@RafaelPonte 2 ай бұрын
​@@YuliSlabko Thanks for the explanation. Now I got your point! ☺ You're right. If your application runs multiple jobs close together or jobs that take too long to finish, tuning the Scheduler's thread pool size is essential. 👊🏻
@gsledoux
@gsledoux 2 ай бұрын
Parabéns marajá! 😉
@RafaelPonte
@RafaelPonte 2 ай бұрын
Brigadão ☺️☺️
@lacerdaph23
@lacerdaph23 3 ай бұрын
Rafa is humble, Freak and beatifiul
@RafaelPonte
@RafaelPonte 3 ай бұрын
Hehe, you're very kind, my friend! ❤
@JuniorAdy10
@JuniorAdy10 3 ай бұрын
Braaabo de mais. Parabéns, príncipe do oceano kkk 👏👏👏
@RafaelPonte
@RafaelPonte 3 ай бұрын
Brigadão, Junior! 👊🏻
@benicioavila
@benicioavila 3 ай бұрын
@@RafaelPonte Parabéns Rafael! Compartilhando com todos do meu time! Abraço.
@RafaelPonte
@RafaelPonte 3 ай бұрын
@@benicioavila obrigado ☺️ E valeu por compartilhar!! ❤️
@metrocartao
@metrocartao 2 ай бұрын
Muito bom!
@RafaelPonte
@RafaelPonte 2 ай бұрын
que massa que gostou 😊
@zickzack987
@zickzack987 3 ай бұрын
Ummm... Distribution topic starts after 27 min. Using db locks is tricky and works differently for different databases, e.g. lock escalation. Better use an app level locking. All that had not really to do a lot with jobs. Just long running tasks in a distributed system.
@RaphaelSousa-or1dl
@RaphaelSousa-or1dl 3 ай бұрын
Do you have a resource recommendation on app level locking? I'm studying the topic and it would be awesome to see it more detailed. Thanks
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks for your comment ☺ Distributed systems are tricky, and database locks have worked well for over 30 years. Although some databases might differ, an exclusive row-level lock works similarly. By the way, a few RDBMS suffer from lock escalation, but not PostgreSQL (which was used in the talk's context); in addition to that, we used many approaches in the talk that mitigate the chances of lock escalation 💪🏻 Regarding application-level locking, PostgreSQL offers Advisory Locks as an excellent alternative to row-level locks. They're very light and are handled by the application side.
@DiegoFerreiradaSilva
@DiegoFerreiradaSilva 3 ай бұрын
mandou bem, parabéns!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Obrigado, Diego ❤️
@lobaorn
@lobaorn 3 ай бұрын
Congrats Rafael! Parabéns Rafa!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks so much!!! 🥰
@mindcontrolkmc.3286
@mindcontrolkmc.3286 2 ай бұрын
Really great talk! But I am curious that if 2 save statements already wrap in 1 small transaction how can it combine with the hibernate batch with another save statement process
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the comment and feedback 😊 I am not sure if I understood your question correctly. Could you elaborate a little bit more on it?
@mindcontrolkmc.3286
@mindcontrolkmc.3286 2 ай бұрын
Hi Rafael, In the scenario of this video, we are using short-transaction to save data to the database so I think each transaction should be isolated so they can't be wrapped in one batch like your example INSERT INTO ... Values (A),(B)
@RafaelPonte
@RafaelPonte 2 ай бұрын
@@mindcontrolkmc.3286 Yeah, the idea is precisely that! For each batch (chunk) of 50 rows, Hibernate will group (and reorder if needed) each INSERT and UPDATE inside that short-running transaction and convert them into only two single statements right on the commit.
@victoralcantara8470
@victoralcantara8470 3 ай бұрын
Amazing! Congrats Rafa!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks, I'm glad you liked it ☺
@rommelcosta6548
@rommelcosta6548 3 ай бұрын
excellent lecture 💚
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks, my friend!
@davi.mustafa
@davi.mustafa 2 ай бұрын
é o cara! boooraa!
@RafaelPonte
@RafaelPonte 2 ай бұрын
Valeu Mustafa 👊🏻
@DevMultitask
@DevMultitask 3 ай бұрын
Great job Rafael!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thank you ☺
@顾清l
@顾清l 3 ай бұрын
In my understanding, `select ... limit 50 for update` would directly lock these 50 rows, instead of locking one row and processing one row at a time. But in the video, it seems to be the latter approach. Why is that?
@wukash999
@wukash999 3 ай бұрын
He just presents it like that for a purpose of presentation. Of course it will lock all 50 rows (as long as they meet select criteria and are not locked already). Overall this is a very basic presentation, not sure what was the point of that.
@RaphaelSousa-or1dl
@RaphaelSousa-or1dl 3 ай бұрын
@@wukash999 I think the point is to introduce to more unexperienced people the possibles problems one might encounter, so you can study further on it (at least for me it worked ,since I've never thought or knew about this problems), not to make a thourough implementation guide
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks for your comment ☺ As @wukash999 commented, the idea was to make it as didactic and accessible as possible so that junior and inexperienced developers could understand it. Do you think it got confused?
@RafaelPonte
@RafaelPonte 3 ай бұрын
@@wukash999 Thanks for your comment and helping them to understand my intention ☺ Do you think this was an introductory and basic talk? I'm afraid I have to disagree. The talk was designed to simplify the subject and make it accessible for everyone, but it's still a complex, tricky, and detailed theme.
@yonishachar1887
@yonishachar1887 16 күн бұрын
@@wukash999 How is that a very basic presentation? How would you implement it differently?
@brunobrasilweb
@brunobrasilweb 3 ай бұрын
Parabéns Rafael, Zerou game do Java.
@RafaelPonte
@RafaelPonte 3 ай бұрын
hahaha, valeu bruno!!!
@adyanto4043
@adyanto4043 Ай бұрын
The thing i hate the most in this video - "Conluding...", i was so engaged didnt wanted him to stop.
@RafaelPonte
@RafaelPonte Ай бұрын
hahaha, thank you so much for this lovely comment 🙏🏻😍 I am thrilled after reading it!!
@thiagonunes3619
@thiagonunes3619 3 ай бұрын
Nice!!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks! ❤
@lhsantos89
@lhsantos89 3 ай бұрын
What a prince 💛🔥
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks, Luis ❤
@flavioecintia
@flavioecintia 2 ай бұрын
boa ponte!!!!!
@RafaelPonte
@RafaelPonte 2 ай бұрын
Valeu, Flávio 😊
@rabah4306
@rabah4306 3 ай бұрын
@Transactional Will this works if You have to call a mongoRepositoy and Kafka template ? All or nothing If Kafka call KO The mongo call also ?
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks for your comment ☺ Although MongoDB and Kafka support some level of transactions, I don't know how @Transactional annotation would work with MongoRepositories or KafkaTemplates. It's worth reading the Spring Data docs. But it's important to be aware that you do NOT have an atomic operation (all or nothing) when your code mixes different external service calls, like PostgreSQL, Mongo, and Kafka. When you do that, you hit a common issue in distributed systems called "dual write".
@asterixcode
@asterixcode 3 ай бұрын
@@RafaelPonte I have the same use case where i need to write to mongo, kafka and also to google cloud storage bucket within the same transaction. Do you by any chance know how to solve this problem so I get a all or nothing? Or if not possible, how we would solve this problem then….
@rabah4306
@rabah4306 3 ай бұрын
@@RafaelPonte obrigado :)
@MrKar18
@MrKar18 2 ай бұрын
For mongo, you can spin a new session with transaction as well, manually. However for Kafka if the produced records are idempotent, you can use the mongo transaction support above to achieve the same.
@mustafaabdsh
@mustafaabdsh 3 ай бұрын
thank you
@RafaelPonte
@RafaelPonte 3 ай бұрын
you're welcome! ☺
@eddwinpaz
@eddwinpaz 3 ай бұрын
Adorei a conversa, mas não sei se queria falar sobre Spring Boot ou se candidatar a político, hahaha.. brincadeira!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Hahaha, valeu! 😊
@TJ-hs1qm
@TJ-hs1qm 2 ай бұрын
Is he describing Spark 😆?
@RafaelPonte
@RafaelPonte 2 ай бұрын
Thanks for the comment 😊 Do you mean Apache Spark? hehe
@knoppix20
@knoppix20 29 күн бұрын
Ola. Todo bem
@RafaelPonte
@RafaelPonte 28 күн бұрын
Tudo ótimo ☺️☺️
@gleicianylemos8824
@gleicianylemos8824 3 ай бұрын
👏👏👏
@RafaelPonte
@RafaelPonte 3 ай бұрын
thanks!!!
@ereboucas
@ereboucas 3 ай бұрын
Almost made me want to work with boring techs again ;)
@andreas_bergstrom
@andreas_bergstrom 3 ай бұрын
I’m moving back to Java/JVM after 15 years in Node/JS/Python
@RafaelPonte
@RafaelPonte 3 ай бұрын
Boring techs are amazing! 🙌🏻
@gabrielm6849
@gabrielm6849 Ай бұрын
sorry, the topic and the presentation has,without doubt, a high technical value, but the english of this guy, the accent and the way he tries-hard to emphasise almost each and every word in the sentence comes highly unnatural.. it really sounds tiring in the ear
@fbarrosoflf
@fbarrosoflf 3 ай бұрын
Congrats, nice job!
@RafaelPonte
@RafaelPonte 3 ай бұрын
Thanks, Barroso! 👊🏻
AWS Keynote at Siemens EDA User2User Europe 2024
27:34
ronen laviv
Рет қаралды 139
مسبح السرير #قصير
00:19
سكتشات وحركات
Рет қаралды 11 МЛН
Just Give me my Money!
00:18
GL Show Russian
Рет қаралды 1 МЛН
Matching Picture Challenge with Alfredo Larin's family! 👍
00:37
BigSchool
Рет қаралды 53 МЛН
A Beginner's Guide to Event-Driven Architecture
37:28
Software Developer Diaries
Рет қаралды 11 М.
Spring Tips: Distributed Job Scheduling with Jobrunr
19:03
SpringDeveloper
Рет қаралды 10 М.
Event-Driven Architecture (EDA) vs Request/Response (RR)
12:00
Confluent
Рет қаралды 138 М.
Bootiful Spring Boot 3.x by Josh Long @ Spring I/O 2024
54:26
Spring I/O
Рет қаралды 26 М.
CQS and CQRS: Command Query Responsibility Segregation
4:16
Drawing Boxes
Рет қаралды 23 М.
مسبح السرير #قصير
00:19
سكتشات وحركات
Рет қаралды 11 МЛН