1. About the Documentation
This section provides a brief overview of Reactor reference documentation. You do not need to read this guide in a linear fashion. Each piece stands on its own, though they often refer to other pieces.
1.1. Latest version & Copyright Notice
The Reactor reference guide is available as HTML documents. The latest copy is available at http://projectreactor.io/docs/core/release/reference/docs/index.html
Copies of this document may be made for your own use and for distribution to others, provided that you do not charge any fee for such copies and further provided that each copy contains this Copyright Notice, whether distributed in print or electronically.
1.2. Contributing to the Documentation
The reference guide is written in Asciidoc format and sources can be found at https://github.com/reactor/reactor-core/tree/master/src/docs/asciidoc.
If you have an improvement, we will be happy to get a pull-request from you!
We recommend that you check out a local copy of the repository, so that you can
generate the documentation using the asciidoctor
gradle task and check the
rendering. Some of the sections rely on included files, so GitHub rendering is
not always complete.
1.3. Getting help
There are several ways to reach out for help with Reactor.
-
Get in touch with the community on Gitter.
-
Ask a question on stackoverflow.com at
project-reactor
. -
Report bugs (or ask questions) in Github issues. The following repositories are most closely monitored: reactor-core (which covers the essential features) and reactor-addons (which covers reactor-test and adapters issues).
All of Reactor is open source, including this documentation. If you find problems with the docs or if you just want to improve them, please get involved. |
1.4. Where to go from here
-
Head to Getting started if you feel like jumping straight into the code.
-
If you are new to Reactive Programming though, you should probably better start with the Introduction to Reactive Programming.
-
In order to dig deeper into the core features of Reactor, head to Reactor Core Features:
-
If you are looking for the right tool for the job but cannot think of a relevant operator, try Which operator do I need?.
-
Learn more about Reactor’s reactive types in the "
Flux
, an Asynchronous Sequence of 0-n Items" and "Mono
, an Asynchronous 0-1 Result" sections. -
Switch threading contexts using a Scheduler.
-
Learn how to handle errors in the Handling Errors section.
-
-
Unit testing? Yes it is possible with the
reactor-test
project! See Testing. -
Programmatically creating a sequence is possible for more advanced creation of reactive sources.
2. Getting started
2.1. Introducing Reactor
Reactor is a fully non-blocking reactive programming foundation for the JVM,
with efficient demand management (in the form of managing "backpressure"). It
integrates directly with the Java 8 functional APIs, notably
CompletableFuture
, Stream
and Duration
. It offers composable asynchronous
sequence APIs Flux
(for [N] elements) and Mono
(for [0|1] elements),
extensively implementing the Reactive Extensions specification.
Reactor also supports non-blocking inter-process communication (IPC) with the
reactor-ipc
components. Suited for Microservices Architecture, Reactor IPC
offers backpressure-ready network engines for HTTP (including Websockets), TCP
and UDP. Reactive Encoding and Decoding are fully supported.
2.2. Pre-requisites
Reactor Core runs on Java 8
and above.
It has a transitive dependency to org.reactive-streams:reactive-streams:1.0.1
.
Android support:
|
2.3. The BOM
Reactor 3 uses a BOM (Bill of Materials - a standard Maven artifact)
model since reactor-core 3.0.4
, with the Aluminium
release train.
The BOM lets you group artifacts that are meant to work well together without having to wonder about the sometimes divergent versioning schemes of these artifacts.
The BOM is a list of versioned artifacts that is itself versioned, using a release train scheme with a codename followed by a qualifier. Here is an example:
Aluminium-RELEASE Carbon-BUILD-SNAPSHOT Aluminium-SR1 Carbon-SR32
The codenames represent what would traditionally be the MAJOR.MINOR number. They (mostly) come from the Periodic Table of Elements, in increasing alphabetical order.
The qualifiers are (in chronological order):
-
BUILD-SNAPSHOT
-
M1
..N
: Milestones or developer previews -
RELEASE
: The first GA (General Availability) release in a codename series -
SR1
..N
: The subsequent GA releases in a codename series (equivalent to PATCH number, SR stands for "Service Release").
2.4. Getting Reactor
As mentioned earlier, the easiest way to use Reactor in your core is to use the BOM and add the relevant dependencies to your project. Note that, when adding such a dependency, you must omit the version so that the version gets picked up from the BOM.
However, if you want to force the use of a specific artifact’s version, you can specify it when adding your dependency, as you usually would. You can also forgo the BOM entirely and specify dependencies by their artifact versions.
2.4.1. Maven installation
The BOM concept is natively supported by Maven. First, you need to import the
BOM by adding the following snippet to your pom.xml
. If the top section
(dependencyManagement
) already exists in your pom, add only the contents.
<dependencyManagement> (1)
<dependencies>
<dependency>
<groupId>io.projectreactor</groupId>
<artifactId>reactor-bom</artifactId>
<version>Aluminium-SR1</version>
<type>pom</type>
<scope>import</scope>
</dependency>
</dependencies>
</dependencyManagement>
1 | Notice the dependencyManagement tag. This is in addition to the regular
dependencies section. |
Next, add your dependencies to the relevant reactor projects, as usual, except
without a <version>
, as shown here:
<dependencies>
<dependency>
<groupId>io.projectreactor</groupId>
<artifactId>reactor-core</artifactId> (1)
(2)
</dependency>
<dependency>
<groupId>io.projectreactor.addons</groupId>
<artifactId>reactor-test</artifactId> (3)
<scope>test</scope>
</dependency>
</dependencies>
1 | Dependency on the core library |
2 | No version tag here |
3 | reactor-test provides facilities to unit test reactive streams |
2.4.2. Gradle installation
Gradle has no core support for Maven BOMs, but you can use Spring’s gradle-dependency-management plugin.
First, apply the plugin from the Gradle Plugin Portal:
plugins {
id "io.spring.dependency-management" version "1.0.1.RELEASE" (1)
}
1 | as of this writing, 1.0.1.RELEASE is the latest version of the plugin. Check for updates. |
Then use it to import the BOM:
dependencyManagement {
imports {
mavenBom "io.projectreactor:reactor-bom:Aluminium-SR1"
}
}
Finally add a dependency to your project without a version number:
dependencies {
compile 'io.projectreactor:reactor-core' (1)
}
1 | There is no third : separated section for the version. It is taken from
the BOM. |
2.4.3. Milestones and Snapshots
Milestones and developer previews are distributed through the Spring Milestones repository rather than Maven Central. To add it to your build configuration file, use the following snippet:
<repositories>
<repository>
<id>spring-milestones</id>
<name>Spring Milestones Repository</name>
<url>https://repo.spring.io/milestone</url>
</repository>
</repositories>
For Gradle, use the following snippet:
repositories {
maven { url 'http://repo.spring.io/milestone' }
mavenCentral()
}
Similarly, snapshots are also available in a separate dedicated repository:
<repositories>
<repository>
<id>spring-snapshots</id>
<name>Spring Snapshot Repository</name>
<url>https://repo.spring.io/snapshot</url>
</repository>
</repositories>
repositories {
maven { url 'http://repo.spring.io/snapshot' }
mavenCentral()
}
3. Introduction to Reactive Programming
Reactor is an implementation of the Reactive Programming paradigm, which can be summed up as:
Reactive programming is an asynchronous programming paradigm concerned with data streams and the propagation of change. This means that it becomes possible to express static (e.g. arrays) or dynamic (e.g. event emitters) data streams with ease via the employed programming language(s)
As a first step in the direction of reactive programming, Microsoft created the
Reactive Extensions (Rx) library in the .NET ecosystem. Then RxJava implemented
reactive programming on the JVM. As time went on, a standardization for Java
emerged through the Reactive Streams effort, a specification which defines a
set of interfaces and interaction rules for reactive libraries on the JVM. It
will be integrated into Java 9 (with the Flow
class).
The reactive programming paradigm is often presented in object-oriented
languages as an extension of the Observer design pattern. One can also compare
the main reactive streams pattern with the familiar Iterator design pattern, as
there is a duality to the Iterable
-Iterator
pair in all of these libraries.
One major difference is that while an Iterator is pull-based, reactive
streams are push-based.
Using an iterator is an imperative programming pattern, even though the method of
accessing values is solely the responsibility of the Iterable
. Indeed, it is up to the
developer to choose when to access the next()
item in the sequence. In
reactive streams, the equivalent of the above pair is Publisher-Subscriber
.
But it is the Publisher
that notifies the Subscriber of newly available values
as they come, and this push aspect is key to being reactive. Also, operations
applied to pushed values are expressed declaratively rather than imperatively:
the programmer expresses the expresses the logic of the computation rather than
describing its exact control flow.
In addition to pushing values, the error handling and completion aspects are
also covered in a well defined manner. A Publisher
can push new values to
its Subscriber
(by calling onNext
) but can also signal an error (by calling
onError
) or completion (by calling onComplete
). Both errors and completion
terminate the sequence. The following descriptor shows the possibilities:
onNext x 0..N [onError | onComplete]
This approach is very flexible. The pattern supports use cases where there is no value, one value, or n values (including an infinite sequence of values, such as the continuing ticks of a clock).
Consider why we would need such an asynchronous reactive library in the first place.
3.1. Blocking can be wasteful
Modern applications can reach huge numbers of concurrent users, and, even though the capabilities of modern hardware have continued to improve, performance of modern software is still a key concern.
There are broadly two ways one can improve a program’s performance:
-
parallelize: use more threads and more hardware resources.
-
seek more efficiency in how current resources are used.
Usually, Java developers write programs using blocking code. This practice is fine until there is a performance bottleneck, at which point the time comes to introduce additional threads, running similar blocking code. But this scaling in resource utilization can quickly introduce contention and concurrency problems.
Worse still, blocking wastes resources. If you look closely, as soon as a program involves some latency (notably I/O, such as a database request or a network call), resources are wasted because a thread (or many threads) now sits idle, waiting for data.
So the parallelization approach is not a silver bullet. It is necessary in order to access the full power of the hardware, but it is also complex to reason about and susceptible to resource wasting.
3.2. Asynchronicity to the rescue?
The second approach mentioned above, seeking more efficiency, can be a solution to the resource wasting problem. By writing asynchronous, non-blocking code, you let the execution switch to another active task using the same underlying resources and later come back to the current process when the asynchronous processing has finished.
But how can you produce asynchronous code on the JVM? Java offers two models of asynchronous programming:
-
Callbacks: Asynchronous methods do not have a return value but take an extra
callback
parameter (a lambda or anonymous class) that gets called when the result is available. A well known example is Swing’sEventListener
hierarchy. -
Futures: Asynchronous methods return a
Future<T>
immediately. The asynchronous process computes aT
value, but theFuture
object wraps access to it. The value is not immediately available, and the object can be polled until the value is available. For instance,ExecutorService
runningCallable<T>
tasks useFuture
objects.
Are these techniques good enough? Not for every use case, and both approaches have limitations.
Callbacks are hard to compose together, quickly leading to code that is difficult to read and maintain (known as "Callback Hell").
Consider an example: showing top five favorites from a user on the UI or suggestions if she doesn’t have a favorite. This goes through three services (one gives favorite IDs, the second fetches favorite details, while the third offers suggestions with details):
userService.getFavorites(userId, new Callback<List<String>>() { (1)
public void onSuccess(List<String> list) { (2)
if (list.isEmpty()) { (3)
suggestionService.getSuggestions(new Callback<List<Favorite>>() {
public void onSuccess(List<Favorite> list) { (4)
UiUtils.submitOnUiThread(() -> { (5)
list.stream()
.limit(5)
.forEach(uiList::show); (6)
});
}
public void onError(Throwable error) { (7)
UiUtils.errorPopup(error);
}
});
} else {
list.stream() (8)
.limit(5)
.forEach(favId -> favoriteService.getDetails(favId, (9)
new Callback<Favorite>() {
public void onSuccess(Favorite details) {
UiUtils.submitOnUiThread(() -> uiList.show(details));
}
public void onError(Throwable error) {
UiUtils.errorPopup(error);
}
}
));
}
}
public void onError(Throwable error) {
UiUtils.errorPopup(error);
}
});
1 | We have callback-based services: a Callback interface with a method invoked
when the async process was successful and one invoked in case of an error. |
2 | The first service invokes its callback with the list of favorite IDs. |
3 | If the list is empty, we must go to the suggestionService . |
4 | The suggestionService gives a List<Favorite> to a second callback. |
5 | Since we’re dealing with UI we need to ensure our consuming code will run in the UI thread. |
6 | We use Java 8 Stream to limit the number of suggestions processed to five,
and we show them in a graphical list in the UI. |
7 | At each level, we deal with errors the same way: show them in a popup. |
8 | Back to the favorite ID level. If the service returned a full list, then we
need to go to the favoriteService to get detailed Favorite objects. Since we
want only five, we first stream the list of IDs to limit it to five. |
9 | Once again, a callback. This time we get a fully-fledged Favorite object
that we push to the UI inside the UI thread. |
That is a lot of code, and it is a bit hard to follow and has repetitive parts. Consider its equivalent in Reactor:
userService.getFavorites(userId) (1)
.flatMap(favoriteService::getDetails) (2)
.switchIfEmpty(suggestionService.getSuggestions()) (3)
.take(5) (4)
.publishOn(UiUtils.uiThreadScheduler()) (5)
.subscribe(uiList::show, UiUtils::errorPopup); (6)
1 | We start with a flow of favorite IDs. |
2 | We asynchronously transform these into detailed Favorite objects
(flatMap ). We now have a flow of Favorite . |
3 | In case the flow of Favorite is empty, we switch to a fallback through the
suggestionService . |
4 | We are only interested in, at most, five elements from the resulting flow. |
5 | At the end, we want to process each piece of data in the UI thread. |
6 | We trigger the flow by describing what to do with the final form of the data (show it in a UI list) and what to do in case of an error (show a popup). |
What if you wanted to ensure the favorite IDs are retrieved in less than 800ms
or, if it takes longer, get them from a cache? In the callback-based code, that
is a complicated task. But in Reactor it becomes as easy as adding a timeout
operator in the chain:
userService.getFavorites(userId)
.timeout(Duration.ofMillis(800)) (1)
.onErrorResume(cacheService.cachedFavoritesFor(userId)) (2)
.flatMap(favoriteService::getDetails) (3)
.switchIfEmpty(suggestionService.getSuggestions())
.take(5)
.publishOn(UiUtils.uiThreadScheduler())
.subscribe(uiList::show, UiUtils::errorPopup);
1 | If the part above emits nothing for more than 800ms, propagate an error. |
2 | In case of any error, fall back to the cacheService . |
3 | The rest of the chain is similar to the previous example. |
Futures are a bit better, but they are still not good at composition, despite
the improvements brought in Java 8 by CompletableFuture
. Orchestrating
multiple futures together is doable but not easy. Also, Future
has other
problems: It is easy to end up with another blocking situation with Future
objects by calling the get()
method, and they lack support for multiple
values and advanced error handling.
Consider another example: We get a list of IDs from which we want to fetch a name and a statistic and combine these pair-wise, all of it asynchronously.
CompletableFuture
combinationCompletableFuture<List<String>> ids = ifhIds(); (1)
CompletableFuture<List<String>> result = ids.thenComposeAsync(l -> { (2)
Stream<CompletableFuture<String>> zip =
l.stream().map(i -> { (3)
CompletableFuture<String> nameTask = ifhName(i); (4)
CompletableFuture<Integer> statTask = ifhStat(i); (5)
return nameTask.thenCombineAsync(statTask, (name, stat) -> "Name " + name + " has stats " + stat); (6)
});
List<CompletableFuture<String>> combinationList = zip.collect(Collectors.toList()); (7)
CompletableFuture<String>[] combinationArray = combinationList.toArray(new CompletableFuture[combinationList.size()]);
CompletableFuture<Void> allDone = CompletableFuture.allOf(combinationArray); (8)
return allDone.thenApply(v -> combinationList.stream()
.map(CompletableFuture::join) (9)
.collect(Collectors.toList()));
});
List<String> results = result.join(); (10)
assertThat(results).contains(
"Name NameJoe has stats 103",
"Name NameBart has stats 104",
"Name NameHenry has stats 105",
"Name NameNicole has stats 106",
"Name NameABSLAJNFOAJNFOANFANSF has stats 121");
1 | We start off with a future that gives us a list of id values to process. |
2 | We want to start some deeper asynchronous processing once we get the list. |
3 | For each element in the list: |
4 | Asynchronously get the associated name. |
5 | Asynchronously get the associated task. |
6 | Combine both results. |
7 | We now have a list of futures that represent all the combination tasks. In order to execute these tasks, we need to convert the list to an array. |
8 | Pass the array to CompletableFuture.allOf , which outputs a Future that
completes when all tasks have completed. |
9 | The tricky bit is that allOf returns CompletableFuture<Void> , so we
reiterate over the list of futures, collecting their results via join()
(which here doesn’t block since allOf ensures the futures are all done). |
10 | Once the whole asynchronous pipeline has been triggered, we wait for it to be processed and return the list of results that we can assert. |
Since Reactor has more combination operators out of the box, this process can be simplified:
Flux<String> ids = ifhrIds(); (1)
Flux<String> combinations =
ids.flatMap(id -> { (2)
Mono<String> nameTask = ifhrName(id); (3)
Mono<Integer> statTask = ifhrStat(id); (4)
return nameTask.zipWith(statTask, (5)
(name, stat) -> "Name " + name + " has stats " + stat);
});
Mono<List<String>> result = combinations.collectList(); (6)
List<String> results = result.block(); (7)
assertThat(results).containsExactly( (8)
"Name NameJoe has stats 103",
"Name NameBart has stats 104",
"Name NameHenry has stats 105",
"Name NameNicole has stats 106",
"Name NameABSLAJNFOAJNFOANFANSF has stats 121"
);
1 | This time, we start from an asynchronously provided sequence of ids (a
Flux<String> ). |
2 | For each element in the sequence, we asynchronously process it
(inside the fuction that is the body flatMap call) twice. |
3 | Get the associated name. |
4 | Get the associated statistic. |
5 | Asynchronously combine the 2 values. |
6 | Aggregate the values into a List as they become available. |
7 | In production, we would continue working with the Flux asynchronously by further
combining it or subscribing to it. Most probably, we would return the result Mono .
Since we are in a test, we block, waiting for the processing to finish instead, and then
directly return the aggregated list of values. |
8 | Assert the result. |
These perils of Callback and Future are similar and are what Reactive
Programming addresses with the Publisher-Subscriber
pair.
3.3. From Imperative to Reactive Programming
Reactive libraries such as Reactor aim to address these drawbacks of "classic" asynchronous approaches on the JVM while also focusing on a few additional aspects:
-
Composability and readability
-
Data as a flow manipulated with a rich vocabulary of operators
-
Nothing happens until you subscribe
-
Backpressure or the ability for the consumer to signal the producer that the rate of emission is too high
-
High level but high value abstraction that is concurrency-agnostic
3.3.1. Composability and Readability
By composability, we mean the ability to orchestrate multiple asynchronous tasks, using results from previous tasks to feed input to subsequent ones or executing several tasks in a fork-join style, as well as reusing asynchronous tasks as discrete components in a higher-level system.
The ability to orchestrate tasks is tightly coupled to the readability and maintainability of code. As the layers of asynchronous processes increase in both number and complexity, being able to compose and read code becomes increasingly difficult. As we saw, the callback model is simple, but one of its main drawbacks is that, for complex processes, you need to have a callback executed from a callback, itself nested inside another callback, and so on. That mess is known as Callback Hell. And as you can guess (or know from experience), such code is pretty hard to go back to and reason about.
Reactor offers rich composition options, wherein code mirrors the organization of the abstract process, and everything is generally kept at the same level (nesting is minimized).
3.3.2. The assembly line analogy
You can think of data processed by a reactive application as moving through
an assembly line. Reactor is both the conveyor belt and the workstations. The
raw material pours from a source (the original Publisher
) and ends up as a
finished product ready to be pushed to the consumer (or Subscriber
).
The raw material can go through various transformations and other intermediary steps or be part of a larger assembly line that aggregates intermediate pieces together. If there is a glitch or clogging at one point (perhaps boxing the products takes a disproportionately long time), the afflicted workstation can signal upstream to limit the flow of raw material.
3.3.3. Operators
In Reactor, operators are the workstations in our assembly analogy. Each
operator adds behavior to a Publisher
and wraps the previous step’s
Publisher
into a new instance. The whole chain is thus linked, such that data
originates from the first Publisher
and moves down the chain, transformed by
each link. Eventually, a Subscriber
finishes the process. Remember that
nothing happens until a Subscriber
subscribes to a Publisher
, as we see
shortly.
Understanding this can help you avoid a common mistake that would lead you to believe that an operator you used in your chain is not being applied. See this item in the FAQ. |
While the Reactive Streams specification doesn’t specify operators at all, one of the best added values of reactive libraries like Reactor is the rich vocabulary of operators that they provide. These cover a lot of ground, from simple transformation and filtering to complex orchestration and error handling.
3.3.4. Nothing Happens Until You subscribe()
In Reactor, when you write a Publisher
chain, data doesn’t start pumping into
it by default. Instead, what you have is an abstract description of your
asynchronous process (which can help with reusability and composition).
By the act of subscribing, you tie the Publisher
to a Subscriber
, which
triggers the flow of data in the whole chain. This is achieved internally by a
single request
signal from the Subscriber
that is propagated upstream, all
the way back to the source Publisher
.
3.3.5. Backpressure
Propagating signals upstream is also used to implement backpressure, which we described in the assembly line analogy as a feedback signal sent up the line when a workstation processes more slowly than an upstream workstation.
The real mechanism defined by the Reactive Streams specification is pretty close
to the analogy: a subscriber can work in unbounded mode and let the source
push all the data at its fastest achievable rate or it can use the request
mechanism to signal the source that it is ready to process at most n
elements.
Intermediate operators can also change the request in-transit. Imagine a
buffer
operator that groups elements in batches of 10. If the subscriber
requests 1 buffer, then it is acceptable for the source to produce 10 elements.
Prefetching strategies can also be applied, if producing the elements before
they are requested is not too costly.
This transforms the push model into a push-pull hybrid where the downstream can pull n elements from upstream if they are readily available, but, if the elements are not ready, then they will get pushed by the upstream whenever they are produced.
3.3.6. Hot vs Cold
In the Rx family of reactive libraries, one can distinguish two broad categories of reactive sequences: hot and cold. This distinction mainly has to do with how the reactive stream reacts to subscribers:
-
A Cold sequence starts anew for each
Subscriber
, including at the source of data. If the source wraps an HTTP call, a new HTTP request is made for each subscription. -
A Hot sequence does not start from scratch for each
Subscriber
. Rather, late subscribers receive signals emitted after they subscribed. Note, however, that some hot reactive streams can cache or replay the history of emissions totally or partially. From a general perspective, a hot sequence can even emit when no subscriber is listening (an exception to the "nothing happens before you subscribe" rule).
For more information on hot vs cold in the context of Reactor, see this reactor-specific section.
4. Reactor Core Features
The Reactor project main artifact is reactor-core
, a reactive library that
focuses on the Reactive Streams specification and targets Java 8.
Reactor introduces composable reactive types that implement Publisher
but also
provide a rich vocabulary of operators, most notably Flux
and Mono
. A Flux
object represents a reactive sequence of 0..N items, while a Mono
object
represents a single-valued-or-empty (0..1) result.
This distinction carries a bit of semantic information into the type, indicating
the rough cardinality of the asynchronous processing. For instance, an HTTP
request only produces one response, so there wouldn’t be much sense in doing a
count
operation. Expressing the result of such an HTTP call as a
Mono<HttpResponse>
thus makes more sense than expressing it as a
Flux<HttpResponse>
, as it offers only operators that are relevant to a
context of zero items or one item.
Operators that change the maximum cardinality of the processing also switch to
the relevant type. For instance, the count
operator exists in Flux
, but it
returns a Mono<Long>
.
4.1. Flux
, an Asynchronous Sequence of 0-n Items
A Flux<T>
is a standard Publisher<T>
representing an asynchronous sequence
of 0 to N emitted items, optionally terminated by either a completion signal or
an error. Thus, the possible values of a flux are a value, a completion signal,
or an error. As in the Reactive Streams spec, these 3 types of signal translate
to calls to a downstream object’s onNext
, onComplete
or onError
methods.
With this large scope of possible signals, Flux
is the general-purpose
reactive type. Note that all events, even terminating ones, are optional: no
onNext
event but an onComplete
event represents an empty finite sequence,
but remove the onComplete
and you have an infinite empty sequence.
Similarly, infinite sequences are not necessarily empty. For example,
Flux.interval(Duration)
produces a Flux<Long>
that is infinite and emits
regular ticks from a clock.
4.2. Mono
, an Asynchronous 0-1 Result
A Mono<T>
is a specialized Publisher<T>
that emits at most one item then
optionally terminates with an onComplete
signal or an onError
.
As such it offers only a subset of the operators that are available for a
Flux
. For instance, combination operators can either ignore the right
hand-side emissions and return another Mono
or emit values from both sides,
in which case they’ll switch to a Flux
.
For example, Mono#concatWith(Publisher)
returns a Flux
while Mono#then(Mono)
returns another Mono
.
Note that a Mono
can be used to represent no-value asynchronous processes that
only have the concept of completion (such as Runnable
). To create one, use an
empty Mono<Void>
.
4.3. Simple Ways to Create a Flux/Mono and Subscribe to it
The easiest way to get started with Flux
and Mono
is to use one of the
numerous factory methods found in their respective classes.
For instance, to create a sequence of String
, you can either enumerate them
or put them in a collection and create the Flux from it:
Flux<String> seq1 = Flux.just("foo", "bar", "foobar");
List<String> iterable = Arrays.asList("foo", "bar", "foobar");
Flux<String> seq2 = Flux.fromIterable(iterable);
Other examples of factory methods include:
Mono<String> noData = Mono.empty(); (1)
Mono<String> data = Mono.just("foo");
Flux<Integer> numbersFromFiveToSeven = Flux.range(5, 3); (2)
1 | Notice the factory method honors the generic type even though there will be no value |
2 | The first parameter is the start of the range, while the second parameter is the number of items to produce. |
When it comes to subscribing, Flux
and Mono
make use of Java 8 lambdas. You
have a wide choice of .subscribe()
variants that take lambdas for different
combinations of callbacks:
Flux
subscribe(); (1)
subscribe(Consumer<? super T> consumer); (2)
subscribe(Consumer<? super T> consumer,
Consumer<? super Throwable> errorConsumer); (3)
subscribe(Consumer<? super T> consumer,
Consumer<? super Throwable> errorConsumer,
Runnable completeConsumer); (4)
subscribe(Consumer<? super T> consumer,
Consumer<? super Throwable> errorConsumer,
Runnable completeConsumer,
Consumer<? super Subscription> subscriptionConsumer); (5)
1 | Subscribe and trigger the sequence. |
2 | Do something with each produced value. |
3 | Deal with values but also react to an error. |
4 | Deal with values and errors but also execute some code when the sequence successfully completes. |
5 | Deal with values and errors and successful completion but also do something with
the Subscription produced by this subscribe call. |
These variants return a reference to the subscription that one can use to
cancel said subscription when no more data is needed. Upon cancellation, the
source should stop producing values and clean up any resources it created. This
cancel and clean-up behavior is represented in Reactor by the general-purpose
Disposable interface.
|
4.3.1. subscribe
Method Examples
This section contains minimal examples of each of the five signatures for the
subscribe
method. Here is an example of the basic method with no arguments:
Flux<Integer> ints = Flux.range(1, 3); (1)
ints.subscribe(); (2)
1 | Set up a Flux that produces three values when a subscriber attaches. |
2 | Subscribe in the simplest way. |
The preceding code produces no visible output, but it does work. The Flux
produces three values. If we provide a lambda, we can make
the values visible. The next example for the subscribe
method shows how:
Flux<Integer> ints = Flux.range(1, 3); (1)
ints.subscribe(i -> System.out.println(i)); (2)
1 | Set up a Flux that produces three values when a subscriber attaches. |
2 | Subscribe with a subscriber that will print the values. |
The preceding code produces the following output:
1 2 3
To demonstrate the next signature, we intentionally introduce an error, as shown here:
Flux<Integer> ints = Flux.range(1, 4) (1)
.map(i -> { (2)
if (i <= 3) return i; (3)
throw new RuntimeException("Got to 4"); (4)
});
ints.subscribe(i -> System.out.println(i), (5)
error -> System.err.println("Error: " + error));
1 | Set up a Flux that produces four values when a subscriber attaches. |
2 | We need a map so that we can handle some values differently. |
3 | For most values, return the value. |
4 | For one value, force an error. |
5 | Subscribe with a subscriber that includes an error handler. |
We now have two lambda expressions: one for the content we expect and one for errors. The preceding code produces the following output:
1 2 3 Error: java.lang.RuntimeException: Got to 4
The next signature of the subscribe
method includes both an error handler and
a handler for completion events, as shown here:
Flux<Integer> ints = Flux.range(1, 4); (1)
ints.subscribe(i -> System.out.println(i),
error -> System.err.println("Error " + error),
() -> {System.out.println("Done");}); (2)
1 | Set up a Flux that produces four values when a subscriber attaches. |
2 | Subscribe with a Subscriber that includes a handler for completion events. |
Error signals and completion signals are both terminal events, one excluding the other. To make the completion consumer work, we had to take care not to trigger and error.
The completion matcher is a pair of empty parentheses because it matches the run
method in the Runnable
interface, which has no parameters. The preceding code
produces the following output:
1 2 3 4 Done
The last signature of the subscribe
method includes a custom Subscriber
,
which we describe right after this bit of code, which shows how to attach a
custom Subscriber
:
SampleSubscriber<Integer> ss = new SampleSubscriber<Integer>();
Flux<Integer> ints = Flux.range(1, 4);
ints.subscribe(i -> System.out.println(i),
error -> System.err.println("Error " + error),
() -> {System.out.println("Done");},
s -> ss.request(10));
ints.subscribe(ss);
As you can see, we provide a custom Subscriber
as the last argument to the
subscribe
method. Here is that custom Subscriber
object, which is the
simplest possible implementation of a Subscriber
:
package io.projectreactor.samples;
import org.reactivestreams.Subscription;
import reactor.core.publisher.BaseSubscriber;
public class SampleSubscriber<T> extends BaseSubscriber<T> {
public void hookOnSubscribe(Subscription subscription) {
System.out.println("Subscribed");
request(1);
}
public void hookOnNext(T value) {
System.out.println(value);
request(1);
}
}
The SampleSubscriber class extends BaseSubscriber
, which is the recommended
abstract class for user-defined Subscribers
in Reactor. The class offers
hooks that can be overridden to tune the subscriber’s behavior. By default,
it will trigger an unbounded request and behave exactly like subscribe()
.
It is much more useful however when you want a custom request amount. To that
effect, the bare minimum implementation is to implement both hookOnSubscribe(Subscription subscription)
and hookOnNext(T value)
. In this case, the hookOnSubscribe
method prints a
statement to standard out and makes the first request. Then the hookOnNext
method prints a statement and processes each of the remaining requests, one
request at a time.
The SampleSubscriber
class produces the following output:
Subscribed 1 2 3 4
You almost certainly want to implement the hookOnError , hookOnCancel ,
and hookOnComplete methods. You may also want to implement the hookFinally
method. SampleSubscribe is the absolute minimum implementation of a
Subscriber that performs bounded requests.
|
We’ll see BaseSubscriber
again soon.
The Reactive Streams specification defines another variant of the subscribe
method. It allows the attachment of a custom Subscriber without all the other
options, as shown here:
subscribe(Subscriber<? super T> subscriber);
This variant of the subscribe
method is useful if you already have a
Subscriber
handy. More often, though, you need it because you want to do
something subscription-related in the other callbacks. Most probably, you need
to deal with backpressure and triggering the requests yourself.
In that case, you can make things easier by using the BaseSubscriber
abstract
class, which offers convenience methods for handling backpressure:
BaseSubscriber
to fine tune backpressureFlux<String> source = someStringSource();
source.map(String::toUpperCase)
.subscribe(new BaseSubscriber<String>() { (1)
@Override
protected void hookOnSubscribe(Subscription subscription) {
(2)
request(1); (3)
}
@Override
protected void hookOnNext(String value) {
request(1); (4)
}
(5)
});
1 | The BaseSubscriber is an abstract class, so we create an anonymous
implementation and specify the generic type. |
2 | BaseSubscriber defines hooks for the various signal handling you can
implement in a Subscriber . It also deals with the boilerplate of capturing the
Subscription object so that you can manipulate it in other hooks. |
3 | request(n) is such a method. It propagates backpressure request to the
capture subscription from any of the hooks. Here we start the stream by
requesting 1 element from the source. |
4 | Upon receiving a new value, we continue requesting new items from the source one by one. |
5 | Other hooks are hookOnComplete , hookOnError , hookOnCancel , and
hookFinally (which is always called when the sequence terminates, with the
type of termination passed in as a SignalType parameter). |
When manipulating a request, you must be careful to produce enough
demand for the sequence to advance or your Flux will get "stuck". That is why
BaseSubscriber forces you to implement the subscription and onNext hooks,
where you should usually call request at least once.
|
BaseSubscriber
also offers a requestUnbounded()
method to switch to unbounded
mode (equivalent to request(Long.MAX_VALUE)
).
4.4. Programmatically creating a sequence
In this section, we introduce the creation of a Flux
or a Mono
by
programmatically defining its associated events (onNext
, onError
, and
onComplete
). All these methods share the fact that they expose an API to
trigger the events that we call a sink. There are actually a few sink
variants, which we’ll get to shortly.
4.4.1. Generate
The simplest form of programmatic creation of a Flux
is through the generate
method, which takes a generator function.
This is for synchronous and one-by-one emissions, meaning that
the sink is a SynchronousSink
and that its next()
method can only be called
at most once per callback invocation. You can then additionally call error(Throwable)
or complete()
, but this is optional.
The most useful variant is probably the one that also lets you keep a state
that you can refer to in your sink usage to decide what to emit next. The generator
function then becomes a BiFunction<S, SynchronousSink<T>, S>
, with <S>
the
type of the state object. You have to provide a Supplier<S>
for the initial
state, and your generator function now returns a new state on each round.
For instance, you could use an int
as the state:
generate
Flux<String> flux = Flux.generate(
() -> 0, (1)
(state, sink) -> {
sink.next("3 x " + state + " = " + 3*state); (2)
if (state == 10) sink.complete(); (3)
return state + 1; (4)
});
1 | We supply the initial state value of 0. |
2 | We use the state to choose what to emit (a row in the multiplication table of 3). |
3 | We also use it to choose when to stop. |
4 | We return a new state that we use in the next invocation (unless the sequence terminated in this one). |
The code above generates the table of 3, as the following sequence:
3 x 0 = 0 3 x 1 = 3 3 x 2 = 6 3 x 3 = 9 3 x 4 = 12 3 x 5 = 15 3 x 6 = 18 3 x 7 = 21 3 x 8 = 24 3 x 9 = 27 3 x 10 = 30
You can also use a mutable <S>
. The example above could for instance be
rewritten using a single AtomicLong
as the state, mutating it on each round:
Flux<String> flux = Flux.generate(
AtomicLong::new, (1)
(state, sink) -> {
long i = state.getAndIncrement(); (2)
sink.next("3 x " + i + " = " + 3*i);
if (i == 10) sink.complete();
return state; (3)
});
1 | This time, we generate a mutable object as the state. |
2 | We mutate the state here. |
3 | We return the same instance as the new state. |
If your state object needs to clean up some resources, use the
generate(Supplier<S>, BiFunction, Consumer<S>) variant to clean up the last
state instance.
|
Here is an example of using the generate method that includes a Consumer
:
Flux<String> flux = Flux.generate(
AtomicLong::new,
(state, sink) -> { (1)
long i = state.getAndIncrement(); (2)
sink.next("3 x " + i + " = " + 3*i);
if (i == 10) sink.complete();
return state; (3)
}, (state) -> System.out.println("state: " + state)); (4)
}
1 | Again, we generate a mutable object as the state. |
2 | We mutate the state here. |
3 | We return the same instance as the new state. |
4 | We see the last state value (11) as the output of this Consumer lambda. |
In the case of the state containing a database connection or other resource
that needs to be handled at the end of the process, the Consumer
lambda could
close the connection or otherwise handle any tasks that should be done at the
end of the process.
4.4.2. Create
The more advanced form of programmatic creation of a Flux
, create
can work
asynchronously or synchronously and is suitable for multiple emissions per
round.
It exposes a FluxSink
, with its next
, error
, and complete
methods.
Contrary to generate
, it doesn’t have a state-based variant. On the other
hand, it can trigger multiple events in the callback (even from any thread
at a later point in time).
create can be very useful to bridge an existing API with the reactive
world - such as an asynchronous API based on listeners.
|
Imagine that you use a listener-based API. It processes data by chunks
and has two events: (1) a chunk of data is ready and (2) the processing is
complete (terminal event), as represented in the MyEventListener
interface:
interface MyEventListener<T> {
void onDataChunk(List<T> chunk);
void processComplete();
}
You can use create
to bridge this into a Flux<T>
:
Flux<String> bridge = Flux.create(sink -> {
myEventProcessor.register( (4)
new MyEventListener<String>() { (1)
public void onDataChunk(List<String> chunk) {
for(String s : chunk) {
sink.next(s); (2)
}
}
public void processComplete() {
sink.complete(); (3)
}
});
});
1 | Bridge to the MyEventListener API |
2 | Each element in a chunk becomes an element in the Flux . |
3 | The processComplete event is translated to onComplete . |
4 | All of this is done asynchronously whenever the myEventProcessor executes. |
Additionally, since create
can be asynchronous and manages backpressure, you
can refine how to behave backpressure-wise, by indicating an OverflowStrategy
:
-
IGNORE
to Completely ignore downstream backpressure requests. This may yieldIllegalStateException
when queues get full downstream. -
ERROR
to signal anIllegalStateException
when the downstream can’t keep up. -
DROP
to drop the incoming signal if the downstream is not ready to receive it. -
LATEST
to let downstream only get the latest signals from upstream. -
BUFFER
(the default) to buffer all signals if the downstream can’t keep up. (this does unbounded buffering and may lead toOutOfMemoryError
).
Mono also has a create generator. The MonoSink of Mono’s create
doesn’t allow several emissions. It will drop all signals after the first one.
|
Push model
A variant of create
is push
, which is suitable for processing events
from a single producer. Similar to create
, push
can also be asynchronous
and can manage backpressure using any of the overflow strategies supported
by create
. Only one producing thread may invoke next
, complete
, or
error
at a time.
Flux<String> bridge = Flux.push(sink -> {
myEventProcessor.register(
new SingleThreadEventListener<String>() { (1)
public void onDataChunk(List<String> chunk) {
for(String s : chunk) {
sink.next(s); (2)
}
}
public void processComplete() {
sink.complete(); (3)
}
public void processError(Throwable e) {
sink.error(e); (4)
}
});
});
1 | Bridge to the SingleThreadEventListener API. |
2 | Events are pushed to the sink using next from a single listener thread. |
3 | complete event generated from the same listener thread. |
4 | error event also generated from the same listener thread. |
Hybrid push/pull model
Unlike push
, create
may be used in push
or pull
mode, making it suitable
for bridging with listener-based APIs where data may be delivered asynchronously
at any time. An onRequest
callback can be registered on FluxSink
to track
requests. The callback may be used to request more data from the source if
required and to manage backpressure by delivering data to the sink only when
requests are pending. This enables a hybrid push/pull model where downstream can
pull data that is already available from upstream and upstream can push data to
downstream when data becomes available at a later time.
Flux<String> bridge = Flux.create(sink -> {
myMessageProcessor.register(
new MyMessageListener<String>() {
public void onMessage(List<String> messages) {
for(String s : messages) {
sink.next(s); (3)
}
}
});
sink.onRequest(n -> {
List<String> messages = myMessageProcessor.request(n); (1)
for(String s : message) {
sink.next(s); (2)
}
});
1 | Poll for messages when requests are made. |
2 | If messages are available immediately, push them to the sink. |
3 | The remaining messages that arrive asynchronously later are also delivered. |
Cleaning up
Two callbacks, onDispose
and onCancel
, perform any cleanup on cancellation
or termination. onDispose
can be used to perform cleanup when the Flux
completes, errors out, or is cancelled. 'onCancel can be used to perform any
action specific to cancellation prior to cleanup with onDispose
.
Flux<String> bridge = Flux.create(sink -> {
sink.onRequest(n -> channel.poll(n))
.onCancel(() -> channel.cancel()) (1)
.onDipose(() -> channel.close()) (2)
});
1 | onCancel is invoked for cancel signal. |
2 | onDispose is invoked for complete, error, or cancel. |
4.4.3. Handle
The handle
method is a bit different. It is present in both Mono
and
Flux
. Also, it is an instance method, meaning that it is chained on an
existing source (as are the common operators).
It is close to generate
, in the sense that it uses a SynchronousSink
and
only allows one-by-one emissions. However, handle
can be used to generate an
arbitrary value out of each source element, possibly skipping some elements. In
this way, it can serve as a combination of map
and filter
. The signature of
handle is as follows:
handle(BiConsumer<T, SynchronousSink<R>>)
Let’s consider an example. The reactive streams specification disallows null
values in a sequence. What if you want to perform a map
but you want to use
a preexisting method as the map function, and that method sometimes returns null?
For instance, the following method can be applied safely to a source of integers:
public String alphabet(int letterNumber) {
if (letterNumber < 1 || letterNumber > 26) {
return null;
}
int letterIndexAscii = 'A' + letterNumber - 1;
return "" + (char) letterIndexAscii;
}
We can then use handle
to remove any nulls:
handle
for a "map and eliminate nulls" scenarioFlux<String> alphabet = Flux.just(-1, 30, 13, 9, 20)
.handle((i, sink) -> {
String letter = alphabet(i); (1)
if (letter != null) (2)
sink.next(letter); (3)
});
alphabet.subscribe(System.out::println);
1 | Map to letters. |
2 | If the "map function" returns null…. |
3 | Filter it out by not calling sink.next . |
Which will print out:
M I T
4.5. Schedulers
Reactor, like RxJava, can be considered concurrency agnostic. That is, it does not enforce a concurrency model but rather leaves you, the developer, in command. However, that doesn’t prevent the library from helping you with concurrency.
In Reactor, the execution model and where the execution happens is determined by
the Scheduler
that is used. A
Scheduler
is an interface that can abstract a wide range of implementations. The
Schedulers
class has static methods that give access to the following execution contexts:
-
the current thread (
Schedulers.immediate()
) -
a single, reusable thread (
Schedulers.single()
). Note that this method reuses the same thread for all callers, until the Scheduler is disposed. If you want a per-call dedicated thread, useSchedulers.newSingle()
for each call. -
an elastic thread pool (
Schedulers.elastic()
). It will create new worker pools as needed, and reuse idle ones. Worker pools that stay idle for too long (default is 60s) are disposed. This is a good choice for I/O blocking work for instance.Schedulers.elastic()
is a handy way to give a blocking process its own thread, so that it doesn’t tie up other resources. See How do I wrap a synchronous, blocking call? -
a fixed pool of workers that is tuned for parallel work (
Schedulers.parallel()
). It will create as many workers as you have CPU cores.
Additionally, you can create a Scheduler
out of any pre-existing
ExecutorService
using Schedulers.fromExecutorService(ExecutorService)
. (You
can also create one from an Executor
, although doing so is discouraged.)
You can also create new instances of the various scheduler types using newXXX
methods. For example, Schedulers.newElastic(yourScheduleName)
creates a new
elastic scheduler named yourScheduleName
.
Operators are implemented using non-blocking algorithms tuned to facilitate the work stealing that can happen in some Schedulers. |
Some operators use a specific Scheduler from Schedulers
by default (and
usually give you the option of providing a different one). For instance, calling
the factory method Flux.interval(Duration.ofMillis(300))
produces a
Flux<Long>
that ticks every 300ms. This is enabled by Schedulers.parallel()
by default.
Reactor offers two means of switching the execution context (or Scheduler
) in
a reactive chain: publishOn
and subscribeOn
. Both take a Scheduler
and
let you switch the execution context to that scheduler. But the placement of
publishOn
in the chain matters, while the placement of subscribeOn
does
not. To understand that difference, you first have to remember that
nothing happens until you subscribe().
In Reactor, when you chain operators, you can wrap as many Flux
and Mono
implementations inside one another as you need. Once you subscribe, a chain of
Subscriber
objects is created, backward (up the chain) to the first
publisher. This is effectively hidden from you. All you can see is the outer
layer of Flux
(or Mono
) and Subscription
, but these intermediate
operator-specific subscribers are where the real work happens.
With that knowledge, we can have a closer look at the publishOn
and
subscribeOn
operators:
-
publishOn
applies in the same way as any other operator, in the middle of the subscriber chain. It takes signals from downstream and replays them upstream while executing the callback on a worker from the associatedScheduler
. So it affects where the subsequent operators will execute (until anotherpublishOn
is chained in). -
subscribeOn
applies to the subscription process, when that backward chain is constructed. As a consequence, no matter where you place thesubscribeOn
in the chain, it always affects the context of the source emission. However, this doesn’t affect the behavior of subsequent calls topublishOn
. They still switch the execution context for the part of the chain after them.
Only the earliest subscribeOn call in the chain is actually taken into
account.
|
4.6. Threading
Flux
and Mono
do not create threads. Some operators, such as publishOn
, do
create threads. Also, as a form of work sharing, these operators can "steal"
threads from other work pools if the other pools are currently idle.
Consequently, neither the Flux
or Mono
objects nor the Subscriber
objects
have to be smart about threads. They rely on the operators to manage the
threads and work pools.
publishOn
forces the next operator (and possibly subsequent operators after
the next one) to run on a different thread. Similarly, subscribeOn
forces the
previous operator (and possibly operators prior to the previous one) to run on
a different thread. Remember that, until you subscribe, you’re defining a
process but not starting it. For that reason, Reactor can use these rules to
determine how the processing must proceed. Then, when you subscribe, the whole
process starts working.
Consider an example showing that multiple threads can support work sharing:
Flux.range(1, 10000) (1)
.publishOn(Schedulers.parallel()) (2)
.subscribe(result) (3)
1 | Create a Flux for 10,000 items |
2 | Create as many threads as there are CPUs (minimum 4). |
3 | Nothing Happens Until You subscribe() . |
Scheduler.parallel()
creates a fixed pool of single-threaded
ExecutorService
-based workers. Because one or two threads may lead to
problems, it always creates at least four threads. The publishOn method then
shares these threaded workers, getting items from whichever is emitting when
publishOn requests an item. In this fashion, we get work sharing (a form of
resource sharing). Reactor enables several other ways to share for resources
as well, as documented in
Schedulers.
Scheduler.elastic()
also creates threads and is a handy way to create a
dedicated thread in the case of wrapping a blocking resource (such as a
synchronous service). See How do I wrap a synchronous, blocking call?.
Behind the scenes, the operators ensure thread safety by using incremental counters and guard conditions. For instance, if we have four threads hitting a single stream (as shown in the preceding example) each request increments a counter, so that future requests from the threads get the right item.
4.7. Handling Errors
For a quick look at the available operators for error handling, see the relevant operator decision tree. |
In Reactive Streams, errors are terminal events. As soon as an error occurs, it
stops the sequence and gets propagated down the chain of operators to the last
step, the Subscriber
you defined and its onError
method.
Such errors should still be dealt with at the application level. For instance,
you might display an error notification in a UI or send a meaningful error
payload in a REST endpoint. For this reason, the subscriber’s onError
method
should always be defined.
If not defined, onError throws an UnsupportedOperationException .
You can further detect and triage it with the
Exceptions.isErrorCallbackNotImplemented method.
|
Reactor also offers alternative means of dealing with errors in the middle of the chain, as error-handling operators.
Before you learn about error-handling operators, you must keep in
mind that any error in a reactive sequence is a terminal event. Even if an
error-handling operator is used, it does not allow the original sequence to
continue but rather converts the onError signal into the start of a new
sequence (the fallback one). In other words, it replaces the terminated
sequence upstream.
|
Now we can consider each means of error handling one-by-one. When relevant, we
make a parallel with imperative programming’s try
patterns.
4.7.1. Error Handling Operators
You may be familiar with several ways of dealing with exceptions in a try-catch block. Most notably:
-
Catch and return a static default value.
-
Catch and execute an alternative path with a fallback method.
-
Catch and dynamically compute a fallback value.
-
Catch, wrap to a
BusinessException
, and re-throw. -
Catch, log an error-specific message, and re-throw.
-
Use the
finally
block to clean up resources or a Java 7 "try-with-resource" construct.
All of these have equivalents in Reactor, in the form of error-handling operators.
Before looking into these operators, we first establish a parallel between a reactive chain and a try-catch block.
When subscribing, the onError
callback at the end of the chain is akin to a catch
block. There, execution skips to the catch in case an Exception
is thrown:
Flux<String> s = Flux.range(1, 10)
.map(v -> doSomethingDangerous(v)) (1)
.map(v -> doSecondTransform(v)); (2)
s.subscribe(value -> System.out.println("RECEIVED " + value), (3)
error -> System.err.println("CAUGHT " + error) (4)
);
1 | A transformation is performed that can throw an exception. |
2 | If everything went well, a second transformation is performed. |
3 | Each successfully transformed value is printed out. |
4 | In case of an error, the sequence terminates and an error message is displayed. |
This is conceptually similar to the following try/catch block:
try {
for (int i = 1; i < 11; i++) {
String v1 = doSomethingDangerous(i); (1)
String v2 = doSecondTransform(v1); (2)
System.out.println("RECEIVED " + v2);
}
} catch (Throwable t) {
System.err.println("CAUGHT " + t); (3)
}
1 | If an exception is thrown here… |
2 | …the rest of the loop is skipped… |
3 | …and the execution goes straight to here. |
Now that we have established a parallel, we’ll look at the different error handling cases and their equivalent operators.
Static fallback value
The equivalent of (1) (catch and return a static default value) is
onErrorReturn
:
Flux.just(10)
.map(this::doSomethingDangerous)
.onErrorReturn("RECOVERED");
You also have the option of filtering (choosing) when to recover with a default value versus letting the error propagate, depending on the exception that occurred:
Flux.just(10)
.map(this::doSomethingDangerous)
.onErrorReturn(e -> e.getMessage().equals("boom10"), "recovered10");
Fallback Method
If you want more than a single default value and you have an alternative safer
way of processing your data, you can use onErrorResume
. This would be the
equivalent of (2) (catch and execute an alternative path with a fallback
method).
For example, if your nominal process is fetching data from an external and unreliable service, but you also keep a local cache of the same data that can be a bit more out of date but is more reliable, you could do the following:
Flux.just("key1", "key2")
.flatMap(k ->
callExternalService(k) (1)
.onErrorResume(e -> getFromCache(k)) (2)
);
1 | For each key, we asynchronously call the external service. |
2 | If the external service call fails, we fallback to the cache for that key.
Note that we always apply the same fallback, whatever the source error, e , is. |
Like onErrorReturn
, onErrorResume
has variants that let you filter which
exceptions to fallback on, based either on the exception’s class or a
Predicate
. The fact that it takes a Function
also allows you to choose a
different fallback sequence to switch to, depending on the error encountered:
Flux.just("timeout1", "unknown", "key2")
.flatMap(k ->
callExternalService(k)
.onErrorResume(error -> { (1)
if (error instanceof TimeoutException) (2)
return getFromCache(k);
else if (error instanceof UnknownKeyException) (3)
return registerNewEntry(k, "DEFAULT");
else
return Flux.error(error); (4)
})
);
1 | The function allows dynamically choosing how to continue. |
2 | If the source times out, hit the local cache. |
3 | If the source says the key is unknown, create a new entry. |
4 | In all other cases, "re-throw". |
Dynamic Fallback Value
Even if you do not have an alternative safer way of processing your data, you might want to compute a fallback value out of the exception you received. This would be the equivalent of (3) (catch and dynamically compute a fallback value).
For instance, if your return type has a variant dedicated to holding an
exception (think Future.complete(T success)
vs
Future.completeExceptionally(Throwable error)
), you could instantiate the
error-holding variant and pass the exception.
This can be done in the same way as the fallback method solution, using
onErrorResume
. You need a tiny bit of boilerplate:
erroringFlux.onErrorResume(error -> Mono.just( (1)
myWrapper.fromError(error) (2)
));
1 | The boilerplate creates a Mono from Mono.just with onErrorResume . |
2 | You then wrap the exception into the ad hoc class or otherwise compute the value out of the exception. |
Catch and Rethrow
In the "fallback method" example, the last line inside the flatMap
gives us a
hint as to how item (4) (Catch, wrap to a BusinessException
, and re-throw)
could be achieved:
Flux.just("timeout1")
.flatMap(k -> callExternalService(k)
.onErrorResume(original -> Flux.error(
new BusinessException("oops, SLA exceeded", original))
)
);
However, there is a more straightforward way of achieving the same with
onErrorMap
:
Flux.just("timeout1")
.flatMap(k -> callExternalService(k)
.onErrorMap(original -> new BusinessException("oops, SLA exceeded", original))
);
Log or React on the Side
For cases where you want the error to continue propagating, but you still want
to react to it without modifying the sequence (logging it, for instance) there
is the doOnError
operator. This is the equivalent of (5) (Catch, log an
error-specific message, and re-throw). This operator, as well as all operators
prefixed with doOn
, are sometimes referred to as a "side-effect". They let
you peek inside the sequence’s events without modifying them.
The following example ensures that, when we fallback to the cache, we at least log that the external service had a failure. We can also imagine we have statistic counters to increment as an error side-effect.
LongAdder failureStat = new LongAdder();
Flux<String> flux =
Flux.just("unknown")
.flatMap(k -> callExternalService(k) (1)
.doOnError(e -> {
failureStat.increment();
log("uh oh, falling back, service failed for key " + k); (2)
})
.onErrorResume(e -> getFromCache(k)) (3)
);
1 | The external service call that can fail… |
2 | …is decorated with a logging side-effect… |
3 | …and then protected with the cache fallback. |
Using Resources and the Finally Block
The last parallel to draw with imperative programming is the cleaning up that
can be done either via a Java 7 "try-with-resources" construct or the use of
the finally
block. This is the equivalent of (6) (use the finally
block
to clean up resources or a Java 7 "try-with-resource" construct). Both have
their Reactor equivalents, using
and doFinally
:
AtomicBoolean isDisposed = new AtomicBoolean();
Disposable disposableInstance = new Disposable() {
@Override
public void dispose() {
isDisposed.set(true); (4)
}
@Override
public String toString() {
return "DISPOSABLE";
}
};
Flux<String> flux =
Flux.using(
() -> disposableInstance, (1)
disposable -> Flux.just(disposable.toString()), (2)
Disposable::dispose (3)
);
1 | The first lambda generates the resource. Here we return our mock
Disposable . |
2 | The second lambda processes the resource, returning a Flux<T> . |
3 | The third lambda is called when the Flux from 2) terminates or is
cancelled, to clean up resources. |
4 | After subscription and execution of the sequence, the isDisposed atomic
boolean would become true . |
On the other hand, doFinally
is about side-effects that you want to be
executed whenever the sequence terminates, whether with onComplete
,
onError
, or cancellation. It gives you a hint as to what kind of termination
triggered the side-effect:
LongAdder statsCancel = new LongAdder(); (1)
Flux<String> flux =
Flux.just("foo", "bar")
.doFinally(type -> {
if (type == SignalType.CANCEL) (2)
statsCancel.increment(); (3)
})
.take(1); (4)
1 | We assume we want to gather statistics. Here we use a LongAdder . |
2 | doFinally consumes a SignalType for the type of termination. |
3 | Here we increment statistics in case of cancellation only. |
4 | take(1) will cancel after 1 item is emitted. |
Demonstrating the Terminal Aspect of onError
In order to demonstrate that all these operators cause the upstream original
sequence to terminate when the error happens, we can use a more visual example
with a Flux.interval
. The interval operator ticks every x units of time with
an increasing Long
value:
Flux<String> flux =
Flux.interval(Duration.ofMillis(250))
.map(input -> {
if (input < 3) return "tick " + input;
throw new RuntimeException("boom");
})
.onErrorReturn("Uh oh");
flux.subscribe(System.out::println);
Thread.sleep(2100); (1)
1 | Note that interval executes on a timer Scheduler by default.
Assuming we want to run that example in a main class, we add a sleep here so
that the application does not exit immediately without any value being produced. |
This prints out one line every 250ms:
tick 0 tick 1 tick 2 Uh oh
Even with one extra second of runtime, no more tick comes in from the interval
.
The sequence was indeed terminated by the error.
Retrying
There is another operator of interest with regards to error handling, and you
might be tempted to use it in the case above. retry
, as its name indicates,
lets you retry an error-producing sequence.
The trouble is that it works by re-subscribing to the upstream Flux
.
This is really a different sequence, and the original one is still
terminated. To verify that, we can re-use the previous example and append a
retry(1)
to retry once instead of the onErrorReturn:
Flux.interval(Duration.ofMillis(250))
.map(input -> {
if (input < 3) return "tick " + input;
throw new RuntimeException("boom");
})
.elapsed() (1)
.retry(1)
.subscribe(System.out::println,
System.err::println); (2)
Thread.sleep(2100); (3)
1 | elapsed associates each value with the duration since previous value was
emitted. |
2 | We also want to see when there is an onError . |
3 | Ensure we have enough time for our 4x2 ticks. |
This produces the following output:
259,tick 0 249,tick 1 251,tick 2 506,tick 0 (1) 248,tick 1 253,tick 2 java.lang.RuntimeException: boom
1 | A new interval started, from tick 0. The additional 250ms duration is
coming from the 4th tick, the one that causes the exception and subsequent
retry. |
As you can see above, retry(1)
merely re-subscribed to the original interval
once, restarting the tick from 0. The second time around, since the exception
still occurs, it gives up and propagates the error downstream.
There is a more advanced version of retry
that uses a "companion" Flux
to
tell whether or not a particular failure should retry: retryWhen
. This
companion Flux
is created by the operator but decorated by the user, in order
to customize the retry condition.
The companion Flux
is a Flux<Throwable>
that gets passed to a Function
,
the sole parameter of retryWhen
. As the user, you define that function and
make it return a new Publisher<?>
. Retry cycles will go like this:
-
Each time an error happens (potential for a retry), the error is emitted into the companion
Flux
, which has been decorated by your function. -
If the companion
Flux
emits something, a retry happens. -
If the companion
Flux
completes, the retry cycle stops and the original sequence completes too. -
If the companion
Flux
produces an error, the retry cycle stops and the original sequence also stops or completes, and the error causes the original sequence to fail and terminate.
The distinction between the previous two cases is important. Simply completing
the companion would effectively swallow an error. Consider the following way of
emulating retry(3)
using retryWhen
:
Flux<String> flux =
Flux.<String>error(new IllegalArgumentException()) (1)
.doOnError(System.out::println) (2)
.retryWhen(companion -> companion.take(3)); (3)
1 | This continuously produces errors, calling for retry attempts. |
2 | doOnError before the retry will let us see all failures |
3 | Here, we consider the first 3 errors as retry-able (take(3) ) and then
give up. |
In effect, this results in an empty Flux
, but it completes successfully.
Since retry(3)
on the same Flux
would have terminated with the latest error,
this retryWhen
example is not exactly the same as a retry(3)
.
Getting to the same behavior involves a few additional tricks:
Flux<String> flux =
Flux.<String>error(new IllegalArgumentException())
.retryWhen(companion -> companion
.zipWith(Flux.range(1, 4), (1)
(error, index) -> { (2)
if (index < 4) return index; (3)
else throw Exceptions.propagate(error); (4)
})
);
1 | Trick one: use zip and a range of "number of acceptable retries + 1". |
2 | The zip function lets you count the retries while keeping track of
the original error. |
3 | To allow for 3 retries, indexes before 4 return a value to emit… |
4 | …but, in order to terminate the sequence in error, we throw the original exception after these 3 retries. |
Similar code can be used to implement an exponential backoff and retry pattern, as shown in the FAQ. |
4.7.2. Handling Exceptions in Operators or Functions
In general, all operators can themselves contain code that potentially trigger an exception or calls a user-defined callback that can similarly fail, so they all contain some form of error handling.
As a rule of thumb, an Unchecked Exception will always be propagated through
onError
. For instance, throwing a RuntimeException
inside a map
function
will translate to an onError
event:
Flux.just("foo")
.map(s -> { throw new IllegalArgumentException(s); })
.subscribe(v -> System.out.println("GOT VALUE"),
e -> System.out.println("ERROR: " + e));
The preceding code would print out:
ERROR: java.lang.IllegalArgumentException: foo
Reactor, however, defines a set of exceptions (such as OutOfMemoryError
) that
are always deemed fatal. See the Exceptions.throwIfFatal
method. These
errors mean that Reactor cannot keep operating and are thrown rather than
propagated.
Internally, there are also cases where an unchecked exception still
cannot be propagated (most notably during the subscribe and request phases),
due to concurrency races that could lead to double onError or onComplete
conditions. When these races happen, the error that cannot be propagated is
"dropped". These cases can still be managed to some extent, as the error goes
through the Hooks.onErrorDropped customizable hook.
|
You may wonder, what about Checked Exceptions?
If, for example, you need to call some method that declares it throws
exceptions, you still have to deal with those exceptions in a try-catch
block. You have several options, though:
-
Catch the exception and recover from it. The sequence continues normally.
-
Catch the exception and wrap it into an unchecked exception, then throw it (interrupting the sequence). The
Exceptions
utility class can help you with that (we get to that next). -
If you are expected to return a
Flux
(for example, you are in aflatMap
), wrap the exception into an error-producingFlux
:return Flux.error(checkedException)
. (The sequence also terminates.)
Reactor has an Exceptions
utility class that you can use to ensure that
exceptions are wrapped only if they are checked exceptions:
-
Use the
Exceptions.propagate
method to wrap exceptions if necessary. It also callsthrowIfFatal
first and does not wrapRuntimeException
. -
Use the
Exceptions.unwrap
method to get the original unwrapped exception (going back to the root cause of a hierarchy of reactor-specific exceptions).
Consider the example of a map
that uses a conversion method that can throw
an IOException
:
public String convert(int i) throws IOException {
if (i > 3) {
throw new IOException("boom " + i);
}
return "OK " + i;
}
Now imagine that you want to use that method in a map
. You must now explicitly
catch the exception, and your map function cannot re-throw it. So you can
propagate it to the map’s onError
method as a RuntimeException
:
Flux<String> converted = Flux
.range(1, 10)
.map(i -> {
try { return convert(i); }
catch (IOException e) { throw Exceptions.propagate(e); }
});
Later on, when subscribing to the above Flux
and reacting to errors (such as
in the UI) you could revert back to the original exception in case you want to
do something special for IOExceptions:
converted.subscribe(
v -> System.out.println("RECEIVED: " + v),
e -> {
if (Exceptions.unwrap(e) instanceof IOException) {
System.out.println("Something bad happened with I/O");
} else {
System.out.println("Something bad happened");
}
}
);
4.8. Processor
Processors are a special kind of Publisher
that are also a Subscriber
. That
means that you can subscribe
to a Processor
(generally, they implement
Flux
), but you can also call methods to manually inject data into the
sequence or terminate it.
There are several kind of Processors, each with a few particular semantics, but before you start looking into these, you need to ask yourself the following question:
4.8.1. Do I need a Processor?
Most of the time, you should try to avoid using a Processor
. They are harder
to use correctly and prone to some corner cases.
If you think a Processor
could be a good match for your use case, ask
yourself if you have tried these two alternatives:
-
Could an operator or combination of operators fit the bill? (See Which operator do I need?.)
-
Could a generator operator work instead? (Generally, these operators are made to bridge APIs that are not reactive, providing a "sink" that is similar in concept to a
Processor
in the sense that it lets you populate the sequence with data or terminate it).
If, after exploring the above alternatives, you still think you need a
Processor
, read the Choosing the right Processor
appendix to learn about the different
implementations.
4.8.2. Producing from multiple threads
FluxProcessor
sinks safely gate multi-threaded producers and can be used
by applications that generate data from multiple threads concurrently.
For example, a thread-safe serialized sink can be created for
UnicastProcessor
:
UnicastProcessor<Integer> processor = UnicastProcessor.create();
FluxSink<Integer> sink = processor.sink(overflowStrategy);
Multiple producer threads may concurrently generate data on this serialized sink:
sink.next(n);
Overflow from next
behaves in two possible ways, depending on the Processor
:
-
An unbounded processor handles the overflow itself by dropping or buffering.
-
A bounded processor blocks or "spins" on the
IGNORE
strategy or applies theoverflowStrategy
behavior specified for thesink
.
5. Which operator do I need?
In this section, if an operator is specific to Flux or Mono it is
prefixed accordingly. Common operators have no prefix. When a specific use case
is covered by a combination of operators, it is presented as a method call, with
leading dot and parameters in parentheses, like this: .methodCall(parameter) .
|
I want to deal with:
5.1. Creating a new sequence…
-
that emits a
T
, and I already have:just
-
…from an
Optional<T>
:Mono#justOrEmpty(Optional<T>)
-
…from a potentially
null
T:Mono#justOrEmpty(T)
-
-
that emits a
T
returned by a method:just
as well-
…but lazily captured: use
Mono#fromSupplier
or wrapjust
insidedefer
-
-
that emits several
T
I can explicitly enumerate:Flux#just(T...)
-
that iterates over:
-
an array:
Flux#fromArray
-
a collection or iterable:
Flux#fromIterable
-
a range of integers:
Flux#range
-
-
that emits from various single-valued sources such as:
-
a
Supplier<T>
:Mono#fromSupplier
-
a task:
Mono#fromCallable
,Mono#fromRunnable
-
a
CompletableFuture<T>
:Mono#fromFuture
-
-
that completes:
empty
-
that errors immediately:
error
-
that never does anything:
never
-
that is decided at subscription:
defer
-
that depends on a disposable resource:
using
-
that generates events programmatically (can use state):
-
synchronously and one-by-one:
Flux#generate
-
asynchronously (can also be sync), multiple emissions possible in one pass:
Flux#create
(Mono#create
as well, without the multiple emission aspect)
-
5.2. An existing sequence
-
I want to transform existing data:
-
on a 1-to-1 basis (eg. strings to their length):
map
-
…by just casting it:
cast
-
-
on a 1-to-n basis (eg. strings to their characters):
flatMap
+ use a factory method -
on a 1-to-n basis with programmatic behavior for each source element and/or state:
handle
-
running an asynchronous task for each source item (eg. urls to http request):
flatMap
+ an asyncPublisher
-returning method-
…ignoring some data: conditionally return a
Mono.empty()
in the flatMap lambda -
…retaining the original sequence order:
Flux#flatMapSequential
(this triggers the async processes immediately but reorders the results) -
…where the async task can return multiple values, from a
Mono
source:Mono#flatMapMany
-
-
-
I want to aggregate a
Flux
: (theFlux#
prefix is assumed below)-
into a List:
collectList
,collectSortedList
-
into a Map:
collectMap
,collectMultiMap
-
into an arbitrary container:
collect
-
into the size of the sequence:
count
-
by applying a function between each element (eg. running sum):
reduce
-
…but emitting each intermediary value:
scan
-
-
into a boolean value from a predicate:
-
applied to all values (AND):
all
-
applied to at least one value (OR):
any
-
testing the presence of any value:
hasElements
-
testing the presence of a specific value:
hasElement
-
-
-
I want to combine publishers…
-
in sequential order:
Flux#concat
or.concatWith(other)
-
…but delaying any error until remaining publishers have been emitted:
Flux#concatDelayError
-
…but eagerly subscribing to subsequent publishers:
Flux#mergeSequential
-
-
in emission order (combined items emitted as they come):
Flux#merge
/.mergeWith(other)
-
…with different types (transforming merge):
Flux#zip
/Flux#zipWith
-
-
by pairing values:
-
from 2 Monos into a
Tuple2
:Mono#zipWith
-
from n Monos when they all completed:
Mono#zip
-
-
by coordinating their termination:
-
from 1 Mono and any source into a
Mono<Void>
:Mono#and
-
from n sources when they all completed:
Mono#when
-
into an arbitrary container type:
-
each time all sides have emitted:
Flux#zip
(up to the smallest cardinality) -
each time a new value arrives at either side:
Flux#combineLatest
-
-
-
only considering the sequence that emits first:
Flux#first
,Mono#first
,mono.or (otherMono).or(thirdMono)
, `flux.or(otherFlux).or(thirdFlux) -
triggered by the elements in a source sequence:
switchMap
(each source element is mapped to a Publisher) -
triggered by the start of the next publisher in a sequence of publishers:
switchOnNext
-
-
I want to repeat an existing sequence:
repeat
-
…but at time intervals:
Flux.interval(duration).flatMap(tick -> myExistingPublisher)
-
-
I have an empty sequence but…
-
I want a value instead:
defaultIfEmpty
-
I want another sequence instead:
switchIfEmpty
-
-
I have a sequence but I am not interested in values:
ignoreElements
-
…and I want the completion represented as a
Mono
:then
-
…and I want to wait for another task to complete at the end:
thenEmpty
-
…and I want to switch to another
Mono
at the end:Mono#then(mono)
-
…and I want to switch to a
Flux
at the end:thenMany
-
-
I have a Mono for which I want to defer completion…
-
…only when 1-N other publishers have all emitted (or completed):
Mono#delayUntilOther
-
…and deriving these publishers from the Mono value:
Mono#delayUntil(Function)
-
-
5.3. Peeking into a sequence
-
Without modifying the final sequence, I want to:
-
get notified of / execute additional behavior [1] on:
-
emissions:
doOnNext
-
completion:
Flux#doOnComplete
,Mono#doOnSuccess
(includes the result if any) -
error termination:
doOnError
-
cancellation:
doOnCancel
-
subscription:
doOnSubscribe
-
request:
doOnRequest
-
completion or error:
doOnTerminate
(Mono version includes the result if any)-
but after it has been propagated downstream:
doAfterTerminate
-
-
any type of signal, represented as a
Signal
:Flux#doOnEach
-
any terminating condition (complete, error, cancel):
doFinally
-
-
log what happens internally:
log
-
-
I want to know of all events:
-
each represented as
Signal
object:-
in a callback outside the sequence:
doOnEach
-
instead of the original onNext emissions:
materialize
-
…and get back to the onNexts:
dematerialize
-
-
-
as a line in a log:
log
-
5.4. Filtering a sequence
-
I want to filter a sequence:
-
based on an arbitrary criteria:
filter
-
…that is asynchronously computed:
filterWhen
-
-
restricting on the type of the emitted objects:
ofType
-
by ignoring the values altogether:
ignoreElements
-
by ignoring duplicates:
-
in the whole sequence (logical set):
Flux#distinct
-
between subsequently emitted items (deduplication):
Flux#distinctUntilChanged
-
-
-
I want to keep only a subset of the sequence:
-
by taking elements:
-
at the beginning of the sequence:
Flux#take(int)
-
…based on a duration:
Flux#take(Duration)
-
…only the first element, as a
Mono
:Flux#next()
-
-
at the end of the sequence:
Flux#takeLast
-
until a criteria is met (inclusive):
Flux#takeUntil
(predicate-based),Flux#takeUntilOther
(companion publisher-based) -
while a criteria is met (exclusive):
Flux#takeWhile
-
-
by taking at most 1 element:
-
at a specific position:
Flux#elementAt
-
at the end:
.takeLast(1)
-
…and emit an error if empty:
Flux#last()
-
…and emit a default value if empty:
Flux#last(T)
-
-
-
by skipping elements:
-
at the beginning of the sequence:
Flux#skip(int)
-
…based on a duration:
Flux#skip(Duration)
-
-
at the end of the sequence:
Flux#skipLast
-
until a criteria is met (inclusive):
Flux#skipUntil
(predicate-based),Flux#skipUntilOther
(companion publisher-based) -
while a criteria is met (exclusive):
Flux#skipWhile
-
-
by sampling items:
-
by duration:
Flux#sample(Duration)
-
but keeping the first element in the sampling window instead of the last:
sampleFirst
-
-
by a publisher-based window:
Flux#sample(Publisher)
-
based on a publisher "timing out":
Flux#sampleTimeout
(each element triggers a publisher, and is emitted if that publisher does not overlap with the next)
-
-
-
I expect at most 1 element (error if more than one)…
-
and I want an error if the sequence is empty:
Flux#single()
-
and I want a default value if the sequence is empty:
Flux#single(T)
-
and I accept an empty sequence as well:
Flux#singleOrEmpty
-
5.5. Errors
-
I want to create an erroring sequence:
error
…-
…to replace the completion of a successful
Flux
:.concat(Flux.error(e))
-
…to replace the emission of a successful
Mono
:.then(Mono.error(e))
-
…if too much time elapses between onNexts:
timeout
-
-
I want the try/catch equivalent of:
-
throwing:
error
-
catching an exception:
-
and falling back to a default value:
onErrorReturn
-
and falling back to another
Flux
orMono
:onErrorResume
-
and wrapping and re-throwing:
.onErrorMap(t -> new RuntimeException(t))
-
-
the finally block:
doFinally
-
the using pattern from Java 7:
using
factory method
-
-
I want to recover from errors…
-
by falling back:
-
to a value:
onErrorReturn
-
to a
Publisher
orMono
, possibly different ones depending on the error:Flux#onErrorResume
andMono#onErrorResume
-
-
by retrying:
retry
-
…triggered by a companion control Flux:
retryWhen
-
-
-
I want to deal with backpressure "errors"[2]…
-
by throwing a special
IllegalStateException
:Flux#onBackpressureError
-
by dropping excess values:
Flux#onBackpressureDrop
-
…except the last one seen:
Flux#onBackpressureLatest
-
-
by buffering excess values (bounded or unbounded):
Flux#onBackpressureBuffer
-
…and applying a strategy when bounded buffer also overflows:
Flux#onBackpressureBuffer
with aBufferOverflowStrategy
-
-
5.6. Time
-
I want to associate emissions with a timing (
Tuple2<Long, T>
) measured…-
since subscription:
elapsed
-
since the dawn of time (well, computer time):
timestamp
-
-
I want my sequence to be interrupted if there is too much delay between emissions:
timeout
-
I want to get ticks from a clock, regular time intervals:
Flux#interval
-
I want to emit a single
0
after an initial delay: staticMono.delay
. -
I want to introduce a delay:
-
between each onNext signal:
Mono#delayElement
,Flux#delayElements
-
before the subscription happens:
delaySubscription
-
5.7. Splitting a Flux
-
I want to split a
Flux<T>
into aFlux<Flux<T>>
, by a boundary criteria:-
of size:
window(int)
-
…with overlapping or dropping windows:
window(int, int)
-
-
of time
window(Duration)
-
…with overlapping or dropping windows:
window(Duration, Duration)
-
-
of size OR time (window closes when count is reached or timeout elapsed):
windowTimeout(int, Duration)
-
based on a predicate on elements:
windowUntil
-
……emitting the element that triggered the boundary in the next window (
cutBefore
variant):.windowUntil(predicate, true)
-
…keeping the window open while elements match a predicate:
windowWhile
(non-matching elements are not emitted)
-
-
driven by an arbitrary boundary represented by onNexts in a control Publisher:
window(Publisher)
,windowWhen
-
-
I want to split a
Flux<T>
and buffer elements within boundaries together…-
into
List
:-
by a size boundary:
buffer(int)
-
…with overlapping or dropping buffers:
buffer(int, int)
-
-
by a duration boundary:
buffer(Duration)
-
…with overlapping or dropping buffers:
buffer(Duration, Duration)
-
-
by a size OR duration boundary:
bufferTimeout(int, Duration)
-
by an arbitrary criteria boundary:
bufferUntil(Predicate)
-
…putting the element that triggered the boundary in the next buffer:
.bufferUntil(predicate, true)
-
…buffering while predicate matches and dropping the element that triggered the boundary:
bufferWhile(Predicate)
-
-
driven by an arbitrary boundary represented by onNexts in a control Publisher:
buffer(Publisher)
,bufferWhen
-
-
into an arbitrary "collection" type
C
: use variants likebuffer(int, Supplier<C>)
-
-
I want to split a
Flux<T>
so that element that share a characteristic end up in the same sub-flux:groupBy(Function<T,K>)
TIP: Note that this returns aFlux<GroupedFlux<K, T>>
, each innerGroupedFlux
shares the sameK
key accessible throughkey()
.
5.8. Going back to the Synchronous world
-
I have a
Flux<T>
and I want to:-
block until I can get the first element:
Flux#blockFirst
-
…with a timeout:
Flux#blockFirst(Duration)
-
-
block until I can get the last element (or null if empty):
Flux#blockLast
-
…with a timeout:
Flux#blockLast(Duration)
-
-
synchronously switch to an
Iterable<T>
:Flux#toIterable
-
synchronously switch to a Java 8
Stream<T>
:Flux#toStream
-
-
I have a
Mono<T>
and I want:-
to block until I can get the value:
Mono#block
-
…with a timeout:
Mono#block(Duration)
-
-
a
CompletableFuture<T>
:Mono#toFuture
-
6. Testing
Whether you have written a simple chain of Reactor operators or your very own operator, automated testing is always a good idea.
Reactor comes with a few elements dedicated to testing, gathered into their own
artifact: reactor-test
. You can find that project
on Github
inside of the reactor-core repository.
To use it in your tests, add it as a test dependency:
<dependencies>
<dependency>
<groupId>io.projectreactor</groupId>
<artifactId>reactor-test</artifactId>
<scope>test</scope>
(1)
</dependency>
1 | If you use the BOM, you do not need to specify a <version> … |
dependencies
blockdependencies {
testcompile 'io.projectreactor:reactor-test'
}
The two main uses of reactor-test
are:
-
Testing a sequence follows a given scenario, step-by-step, with
StepVerifier
. -
Producing data in order to test behavior of operators downstream (for example, your own operator) with
TestPublisher
.
6.1. Testing a scenario with StepVerifier
The most common case for testing a Reactor sequence is to have a Flux
or
Mono
defined in your code (for example, returned by a method), and wanting to
test how it behaves when subscribed to.
This translates well to defining a "test scenario", where you define your
expectations in terms of events, step-by-step: what is the next expected event?
Do you expect the Flux
to emit a particular value? Or maybe to do nothing for
the next 300ms? All of that can be expressed through the StepVerifier
API.
For instance, you could have the following utility method in your codebase that
decorates a Flux
:
public <T> Flux<T> appendBoomError(Flux<T> source) {
return source.concatWith(Mono.error(new IllegalArgumentException("boom")));
}
So in order to test it, you want to verify the following scenario:
I expect this
Flux
to first emitfoo
, then emitbar
, then produce an error with the message,boom
. Subscribe and verify these expectations.
In the StepVerifier
API, this translates to:
@Test
public void testAppendBoomError() {
Flux<String> source = Flux.just("foo", "bar"); (1)
StepVerifier.create( (2)
appendBoomError(source)) (3)
.expectNext("foo") (4)
.expectNext("bar")
.expectErrorMessage("boom") (5)
.verify(); (6)
}
1 | Since our method needs a source Flux , Define a simple one for
testing purposes. |
2 | Create a StepVerifier builder that wraps and verify a Flux . |
3 | Here we pass the Flux to be tested (the result of calling our utility
method) |
4 | The first signal we expect to happen upon subscription is an onNext , with
the value foo . |
5 | The last signal we expect to happen is a termination of the sequence with an
onError . The exception should have boom as a message. |
6 | It is important to trigger the test by calling verify() . |
The API is a builder. You start by creating a StepVerifier
and passing the
sequence to be tested. This offers a choice of methods that allow you to:
-
express expectations about the next signals to occur: if any other signal is received (or the content of the signal does not match the expectation), the whole test fails with a meaningful
AssertionError
. For example, you might useexpectNext(T...)
,expectNextCount(long)
. -
consume the next signal. This is used when you want to skip part of the sequence OR when you want to apply a custom
assertion
on the content of the signal (for example, check that there is anonNext
and assert the emitted item is a list of size 5). For example, you might useconsumeNextWith(Consumer<T>)
. -
miscellaneous actions like pausing, running arbitrary code (for example, if you want to manipulate a test specific state or context). For example, you might use
thenAwait(Duration)
,then(Runnable)
.
For terminal events, the corresponding expectation methods (expectComplete()
and expectError()
and all its variants) switch to an API where you cannot
express expectations anymore. In that last step, all you can do is perform some
additional configuration on the StepVerifier
and then trigger the
verification, eg. with verify()
.
What happens at this point is that the StepVerifier subscribes to the tested
Flux
or Mono
and plays the sequence, comparing each new signal with the
next step in the scenario. As long as these match, the test is considered a
success. As soon as there is a discrepancy, an AssertionError
is thrown.
Remember the verify() step, which triggers the verification.
In order to help, the API includes a few shortcut methods that combine the
terminal expectations with a call to verify() : verifyComplete() ,
verifyError() , verifyErrorMessage(String) , and others.
|
Note that if one of the lambda-based expectations throws an AssertionError
, it
is reported as is, failing the test. This is useful for custom assertions.
The verify() method and derived shortcut methods (verifyThenAssertThat ,
verifyComplete() , etc.) has no timeout by default, meaning it can block
indefinitely. You can use StepVerifier.setDefaultTimeout(Duration) to globally
set a timeout for these methods, or specify one on a per-call basis with
verify(Duration) .
|
6.2. Manipulating Time
Another very interesting capability of StepVerifier
is the way it can be used
with time-based operators in order to avoid long run times for corresponding
tests. This is done through the StepVerifier.withVirtualTime
builder.
It looks like this:
StepVerifier.withVirtualTime(() -> Mono.delay(Duration.ofDays(1)))
//... continue expectations here
The way this virtual time feature works is that it plugs in a custom
Scheduler
in Reactor’s Schedulers
factory. Since these timed operators
usually use the default Schedulers.parallel()
scheduler, replacing it with a
VirtualTimeScheduler
does the trick. However, an important pre-requisite is
that the operator be instantiated after the virtual time scheduler has been
activated.
In order to increase the chances this happens correctly, the StepVerifier
does not take a simple Flux
as input. withVirtualTime
takes a Supplier
,
which allows for lazily creating the instance of the tested flux AFTER having
done the scheduler set up.
Take extra care to ensure the Supplier<Publisher<T>> can be used
in a lazy fashion. Otherwise, virtual time is not guaranteed. Especially avoid
instantiating the Flux earlier in the test code and having the Supplier
return that variable. Instead, always instantiate the Flux inside the lambda.
|
There are two expectation methods that deal with time, and they are both valid with or without virtual time:
-
thenAwait(Duration)
pauses the evaluation of steps (allowing a few signals to occur or delays to run out) -
expectNoEvent(Duration)
also lets the sequence play out for a given duration, but fails the test if any signal occurs during that time.
Both methods pause the thread for the given duration in classic mode and advance the virtual clock instead in virtual mode.
expectNoEvent also considers the subscription as an event. If you use
it as a first step, it usually fails because the subscription signal is
detected. Use expectSubscription().expectNoEvent(duration) instead.
|
In order to quickly evaluate the behavior of our Mono.delay
above, we can
finish writing our code like this:
StepVerifier.withVirtualTime(() -> Mono.delay(Duration.ofDays(1)))
.expectSubscription() (1)
.expectNoEvent(Duration.ofDays(1)) (2)
.expectNext(0) (3)
.verifyComplete(); (4)
1 | See the tip above. |
2 | Expect nothing to happen during a full day… |
3 | …then expect a delay that emits 0 … |
4 | …then expect completion (and trigger the verification). |
We could have used thenAwait(Duration.ofDays(1))
above, but expectNoEvent
has the benefit of guaranteeing that nothing happened earlier than it should
have.
Note that verify()
returns a Duration
value. This is the real-time
duration of the entire test.
Virtual time is not a silver bullet. Keep in mind that all
Schedulers are replaced with the same VirtualTimeScheduler . In some cases,
you can lock the verification process because the virtual clock is not moved
forward before an expectation is expressed, resulting in the expectation
waiting on data that can only be produced by advancing time. In most cases,
you need to advance the virtual clock for sequences to emit. Virtual time also
gets very limited with infinite sequences, which might hog the thread on which
both the sequence and its verification run.
|
6.3. Performing Post-execution Assertions with StepVerifier
After having described the final expectation of your scenario, you can switch to
a complementary assertion API instead of triggering verify()
. To do so, use
verifyThenAssertThat()
instead.
This method returns a StepVerifier.Assertions
object, which you can use to
assert a few elements of state once the whole scenario has played out
successfully (since it also calls verify()
). Typical (albeit advanced)
usage is to capture elements that have been dropped by some operator and assert
them (see the section on Hooks).
6.4. Manually Emitting with TestPublisher
For more advanced test cases, it might be useful to have complete mastery over the source of data, in order to trigger finely chosen signals that closely match the particular situation you want to test.
Another situation is when you have implemented your own operator and you want to verify how it behaves with regards to the Reactive Streams specification, especially if its source is not well behaved.
For both cases, reactor-test
offers the TestPublisher
class. This is a
Publisher<T>
that lets you programmatically trigger various signals:
-
next(T)
andnext(T, T...)
triggers 1-nonNext
signals. -
emit(T...)
does the same AND doescomplete()
. -
complete()
terminates with anonComplete
signal. -
error(Throwable)
terminates with anonError
signal.
A well behaved TestPublisher
can be obtained through the create
factory
method. Also, a misbehaving TestPublisher
can be created using the
createNonCompliant
factory method. The latter takes a number of Violation
enums that define which parts of the specification the publisher can overlook.
For instance:
-
REQUEST_OVERFLOW
: Allowsnext
calls to be made despite an insufficient request, without triggering anIllegalStateException
. -
ALLOW_NULL
: Allowsnext
calls to be made with anull
value without triggering aNullPointerException
. -
CLEANUP_ON_TERMINATE
: Allows termination signals to be sent several times in a row. This includescomplete()
,error()
andemit()
.
Finally, the TestPublisher
keeps track of internal state after subscription,
which can be asserted through its various assert*
methods.
It can be used as a Flux
or Mono
by using the conversion methods flux()
and mono()
.
Suggest Edit to "Testing"
7. Debugging Reactor
Switching from an imperative and synchronous programming paradigm to a reactive and asynchronous one can sometimes be daunting. One of the steepest steps in the learning curve is how to analyze and debug when something goes wrong.
In the imperative world, this is usually pretty straightforward: just read the stacktrace and you see where the problem originated and more: Was it entirely a failure of your code? Did the failure occur in some library code? If so, what part of your code called the library, potentially passing in improper parameters that ultimately caused the failure?
7.1. The typical Reactor Stack Trace
When you shift to asynchronous code, things can get much more complicated.
Consider the following stack trace:
java.lang.IndexOutOfBoundsException: Source emitted more than one item
at reactor.core.publisher.MonoSingle$SingleSubscriber.onNext(MonoSingle.java:120)
at reactor.core.publisher.FluxFlatMap$FlatMapMain.emitScalar(FluxFlatMap.java:380)
at reactor.core.publisher.FluxFlatMap$FlatMapMain.onNext(FluxFlatMap.java:349)
at reactor.core.publisher.FluxMapFuseable$MapFuseableSubscriber.onNext(FluxMapFuseable.java:119)
at reactor.core.publisher.FluxRange$RangeSubscription.slowPath(FluxRange.java:144)
at reactor.core.publisher.FluxRange$RangeSubscription.request(FluxRange.java:99)
at reactor.core.publisher.FluxMapFuseable$MapFuseableSubscriber.request(FluxMapFuseable.java:172)
at reactor.core.publisher.FluxFlatMap$FlatMapMain.onSubscribe(FluxFlatMap.java:316)
at reactor.core.publisher.FluxMapFuseable$MapFuseableSubscriber.onSubscribe(FluxMapFuseable.java:94)
at reactor.core.publisher.FluxRange.subscribe(FluxRange.java:68)
at reactor.core.publisher.FluxMapFuseable.subscribe(FluxMapFuseable.java:67)
at reactor.core.publisher.FluxFlatMap.subscribe(FluxFlatMap.java:98)
at reactor.core.publisher.MonoSingle.subscribe(MonoSingle.java:58)
at reactor.core.publisher.Mono.subscribeWith(Mono.java:2668)
at reactor.core.publisher.Mono.subscribe(Mono.java:2629)
at reactor.core.publisher.Mono.subscribe(Mono.java:2604)
at reactor.core.publisher.Mono.subscribe(Mono.java:2582)
at reactor.guide.GuideTests.debuggingCommonStacktrace(GuideTests.java:722)
There is a lot going on there. We get an IndexOutOfBoundsException
, which tells
us that a "source emitted more than one item".
We can probably quickly come to assume that this source is a Flux/Mono, as
confirmed by the line below that mentions MonoSingle
. So it appears to be some
sort of complaint from a single
operator.
Referring to the Javadoc for Mono#single
operator, we see that single
has a
contract: the source must emit exactly one element. It appears we had a source
that emitted more than one and thus violated that contract.
Can we dig deeper and identify that source? The following rows don’t seem very
helpful. They take us on a travel inside the internals of what seems to be a
reactive chain, through subscribes
and requests
.
By skimming over these rows, we can at least start to form a picture of the kind
of chain that went wrong: it seems to involve a MonoSingle
, a FluxFlatMap
and a FluxRange
(each get several rows in the trace, but overall these 3
classes are involved). So a range().flatMap().single()
chain maybe?
But what if we use that pattern a lot in our application? This still doesn’t
tell us much, and simply searching for single
isn’t going to find the
problem. Then the last line refers to some of our code. Finally, we are getting
close.
Hold on, though. When we go to the source file, all we see is that a
pre-existing Flux
is subscribed to:
toDebug.subscribe(System.out::println, Throwable::printStackTrace);
All of this happened at subscription time, but the Flux
itself was not
declared there. Worse, when we go to where the variable is declared, we see:
public Mono<String> toDebug; //please overlook the public class attribute
The variable is not instantiated where it is declared. We must assume a worst-case scenario where we find out that there could be a few different code paths that set it in the application. We’re still unsure of which one caused the issue.
This is kind of the Reactor equivalent of a runtime error, as opposed to a compilation error. |
What we want to find out more easily is where the operator was added into the
chain, where the Flux
was declared. We usually refer to that as the assembly
of the Flux.
7.2. Activating debug mode
Even though the stacktrace was still able to convey some information for someone with a bit of experience, we can see that it is not ideal by itself in more advanced cases.
Fortunately, Reactor comes with a debugging-oriented capability of assembly-time instrumentation.
This is done by customizing the Hook.onOperator
hook at application start
(or at least before the incriminated flux or mono can be instantiated), like so:
Hooks.onOperatorDebug();
The idea is that this starts instrumenting the calls to the Flux
(and
Mono
) operator methods (where they are assembled into the chain) by wrapping
the construction of the operator and capturing a stacktrace there. Since this is
done when the operator chain is declared, the hook should be activated before
that, so the safest way is to activate it right at the start of your
application.
Later on, if an exception occurs, the failing operator is able to refer to that capture and append it to the stack trace.
In the next section, we see how the stack trace differs and how to interpret that new information.
7.3. Reading a Stack Trace in Debug Mode
Reusing our initial example but activating the operatorStacktrace
debug
feature, the stack trace is now this:
java.lang.IndexOutOfBoundsException: Source emitted more than one item
at reactor.core.publisher.MonoSingle$SingleSubscriber.onNext(MonoSingle.java:120)
at reactor.core.publisher.FluxOnAssembly$OnAssemblySubscriber.onNext(FluxOnAssembly.java:314) (1)
...
(2)
...
at reactor.core.publisher.Mono.subscribeWith(Mono.java:2668)
at reactor.core.publisher.Mono.subscribe(Mono.java:2629)
at reactor.core.publisher.Mono.subscribe(Mono.java:2604)
at reactor.core.publisher.Mono.subscribe(Mono.java:2582)
at reactor.guide.GuideTests.debuggingActivated(GuideTests.java:727)
Suppressed: reactor.core.publisher.FluxOnAssembly$OnAssemblyException: (3)
Assembly trace from producer [reactor.core.publisher.MonoSingle] : (4)
reactor.core.publisher.Flux.single(Flux.java:5335)
reactor.guide.GuideTests.scatterAndGather(GuideTests.java:689)
reactor.guide.GuideTests.populateDebug(GuideTests.java:702)
org.junit.rules.TestWatcher$1.evaluate(TestWatcher.java:55)
org.junit.rules.RunRules.evaluate(RunRules.java:20)
Error has been observed by the following operator(s): (5)
|_ Flux.single(TestWatcher.java:55) (6)
1 | This is new: We see the wrapper operator that captures the stack. |
2 | Apart from that, the first section of the stack trace is still the same for the most part, showing a bit of the operator’s internals (so we removed a bit of the snippet here) |
3 | This is where the new stuff from debugging mode starts to appear. |
4 | First, we get some details on where the operator was assembled. |
5 | We also get a traceback of the error as it propagated through the operator chain, from first to last (error site to subscribe site). |
6 | Each operator that saw the error is mentioned along with the class and line where it originated. If an operator is assembled from within Reactor code, the latter would be omitted. |
As you can see, the captured stack trace is appended to the original error as a
suppressed OnAssemblyException
. There are two parts to it, but the first
section is the most interesting. It shows the path of construction for the
operator that triggered the exception. Here it shows that the single
that
caused our issue was created in the scatterAndGather
method, itself called
from a populateDebug
method that got executed through JUnit.
Now that we are armed with enough information to find the culprit, we can have
a meaningful look at that scatterAndGather
method:
private Mono<String> scatterAndGather(Flux<String> urls) {
return urls.flatMap(url -> doRequest(url))
.single(); (1)
}
1 | Sure enough, here is our single . |
Now we can see what the root cause of the error was: a flatMap
that performs
several HTTP calls to a few URLs is chained with single
, which is too
restrictive. After a short git blame
and a quick discussion with the author of
that line, we find out he meant to use the less restrictive take(1)
instead.
We have solved our problem.
Error has been observed by the following operator(s):
That second part of the debug stack trace was not necessarily interesting in
this particular example, because the error was actually happening in the last
operator in the chain (the one closest to subscribe
). Considering another
example might make it more clear:
FakeRepository.findAllUserByName(Flux.just("pedro", "simon", "stephane"))
.transform(FakeUtils1.applyFilters)
.transform(FakeUtils2.enrichUser)
.blockLast();
Now imagine that, inside findAllUserByName
, there is a map
that fails. Here
we would see the following final traceback:
Error has been observed by the following operator(s):
|_ Flux.map(FakeRepository.java:27)
|_ Flux.map(FakeRepository.java:28)
|_ Flux.filter(FakeUtils1.java:29)
|_ Flux.transform(GuideDebuggingExtraTests.java:41)
|_ Flux.elapsed(FakeUtils2.java:30)
|_ Flux.transform(GuideDebuggingExtraTests.java:42)
This corresponds to a flattened version of the chain of operators or rather of the section of the chain that gets notified of the error:
-
The exception originates in the first
map
. -
It is seen by a second
map
(both in fact correspond to thefindAllUserByName
method). -
Then it is seen by a
filter
and atransform
, which indicate that part of the chain is constructed via a reusable transformation function (here, theapplyFilters
utility method). -
Finally, it is seen by an
elapsed
and atransform
. Once again,elapsed
is what is applied by the transformation function of that second transform.
We are dealing with a form of instrumentation here, and creating a stack trace is costly. That is why this debugging feature should only be activated in a controlled manner, as a last resort.
7.3.1. The checkpoint()
Alternative
The debug mode is global and affects every single operator assembled into a
Flux
or Mono
inside the application. This has the benefit of allowing
after-the-fact debugging: whatever the error, we will obtain additional info
to debug it.
As we saw earlier, this global knowledge comes at the cost of an impact on performance (due to the number of populated stack traces). That cost can be reduced if we have an idea of likely problematic operators. However, we usually don’t know which operators are likely to be problematic unless we observed an error in the wild, saw we were missing assembly information, and then modified the code to activate assembly tracking, hoping to observe the same error again.
In that scenario, we have to switch into debugging mode and make preparations in order to better observe a second occurrence of the error, this time capturing all the additional information.
If you can identify reactive chains that you assemble in your application for
which serviceability is critical, a mix of both techniques can be achieved
with the checkpoint()
operator.
You can chain this operator into a method chain. The checkpoint
operator
works like the hook version, but only for its link of that particular chain.
There is also a checkpoint(String)
variant that lets you add a unique String
identifier to the assembly traceback. This way, the stack trace is omitted and
you rely on the description to identify the assembly site. A
checkpoint(String)
imposes less processing cost than a regular checkpoint
.
checkpoint(String)
includes "light" in its output (which can be handy when
searching), as shown here:
... Suppressed: reactor.core.publisher.FluxOnAssembly$OnAssemblyException: Assembly site of producer [reactor.core.publisher.FluxElapsed] is identified by light checkpoint [light checkpoint identifier].
Last but not least, if you want to add a more generic description to the
checkpoint but still rely on the stack trace mechanism to identify the assembly
site, you can force that behavior using the checkpoint("description", true)
version. We’re now back to the initial message for the traceback, augmented
with a description
, as shown here:
Suppressed: reactor.core.publisher.FluxOnAssembly$OnAssemblyException: Assembly trace from producer [reactor.core.publisher.ParallelSource], described as [descriptionCorrelation1234] : (1) reactor.core.publisher.ParallelFlux.checkpoint(ParallelFlux.java:174) reactor.core.publisher.FluxOnAssemblyTest.parallelFluxCheckpointDescription(FluxOnAssemblyTest.java:159) Error has been observed by the following operator(s): |_ ParallelFlux.checkpointnull
1 | descriptionCorrelation1234 is the description provided in the checkpoint . |
The description could be a static identifier or user-readable description, or a wider correlation ID (for instance, coming from a header in the case of an HTTP request).
When both global debugging and local checkpoint() are enabled,
checkpointed snapshot stacks are appended as suppressed error output after the
observing operator graph and following the same declarative order.
|
7.4. Logging a Stream
In addition to stack trace debugging and analysis, another powerful tool to have in your toolkit is the ability to trace and log events in an asynchronous sequence.
The log()
operator can do just that. Chained inside a sequence, it will peek
at every event of the Flux/Mono upstream of it (including onNext
, onError
and onComplete
but also subscriptions, cancellations and
requests).
For instance, suppose we have logback activated and configured and a chain
like range(1,10).take(3)
. By placing a log()
just before the take, we can
get some insight as to how it works and what kind of events it propagates
upstream to the range:
Flux<Integer> flux = Flux.range(1, 10)
.log()
.take(3);
flux.subscribe();
This prints out (through the logger’s console appender):
10:45:20.200 [main] INFO reactor.Flux.Range.1 - | onSubscribe([Synchronous Fuseable] FluxRange.RangeSubscription) (1) 10:45:20.205 [main] INFO reactor.Flux.Range.1 - | request(unbounded) (2) 10:45:20.205 [main] INFO reactor.Flux.Range.1 - | onNext(1) (3) 10:45:20.205 [main] INFO reactor.Flux.Range.1 - | onNext(2) 10:45:20.205 [main] INFO reactor.Flux.Range.1 - | onNext(3) 10:45:20.205 [main] INFO reactor.Flux.Range.1 - | cancel() (4)
Here, in addition to the logger’s own formatter (time, thread, level,
message), the log()
operator outputs a few things in its own format:
1 | reactor.Flux.Range.1 is an automatic category for the log, in case you
use the operator several times in a chain. It allows you to distinguish which
operator’s events are being logged (here, the range ). This can be overwritten
with your own custom category using the log(String) method signature. After a
few separating characters, the actual event gets printed. Here we get
onSubscribe , request , 3 onNext , and cancel . For the first line,
onSubscribe , we get the implementation of the Subscriber , which usually
corresponds to the operator-specific implementation. Between square brackets,
we get additional information if the operator can be automatically optimized
via synchronous or asynchronous fusion (see the appendix on Micro-fusion). |
2 | On the second line, we can see that an unbounded request was propagated up from downstream. |
3 | Then the range sends three values in a row. |
4 | On the last line, we see a cancel() . |
The last line, (4), is the most interesting. We can see the take
in action
there. It operates by cutting the sequence short after it has seen enough
elements emitted. In short, take()
causes the source to cancel()
once it has
emitted the user-requested amount.
8. Advanced features and concepts
8.1. Mutualizing operator usage
From a clean-code perspective, code reuse is generally a good thing. Reactor offers a few patterns that will help you reuse and mutualize code, notably for operators or combination of operators that you might want to apply regularly in your codebase. If you think of a chain of operators as a recipe, you can create a cookbook of operator recipes.
8.1.1. transform
The transform
operator lets you encapsulate a piece of an operator chain into
a function. That function will be applied to an original operator chain at
assembly time to augment it with the encapsulated operators. Doing so applies
the same operations to all the subscribers of a sequence and is basically
equivalent to chaining the operators directly. Here’s an example:
Function<Flux<String>, Flux<String>> filterAndMap =
f -> f.filter(color -> !color.equals("orange"))
.map(String::toUpperCase);
Flux.fromIterable(Arrays.asList("blue", "green", "orange", "purple"))
.doOnNext(System.out::println)
.transform(filterAndMap)
.subscribe(d -> System.out.println("Subscriber to Transformed MapAndFilter: "+d));
This produces:
blue Subscriber to Transformed MapAndFilter: BLUE green Subscriber to Transformed MapAndFilter: GREEN orange purple Subscriber to Transformed MapAndFilter: PURPLE
8.1.2. compose
The compose
operator is very similar to transform
and also lets you
encapsulate operators in a function. The major difference is that this function
is applied to the original sequence on a per-subscriber basis. It means that
the function can actually produce a different operator chain for each
subscription (by maintaining some state). Here’s an example:
AtomicInteger ai = new AtomicInteger();
Function<Flux<String>, Flux<String>> filterAndMap = f -> {
if (ai.incrementAndGet() == 1) {
return f.filter(color -> !color.equals("orange"))
.map(String::toUpperCase);
}
return f.filter(color -> !color.equals("purple"))
.map(String::toUpperCase);
};
Flux<String> composedFlux =
Flux.fromIterable(Arrays.asList("blue", "green", "orange", "purple"))
.doOnNext(System.out::println)
.compose(filterAndMap);
composedFlux.subscribe(d -> System.out.println("Subscriber 1 to Composed MapAndFilter :"+d));
composedFlux.subscribe(d -> System.out.println("Subscriber 2 to Composed MapAndFilter: "+d));
This outputs:
blue Subscriber 1 to Composed MapAndFilter :BLUE green Subscriber 1 to Composed MapAndFilter :GREEN orange purple Subscriber 1 to Composed MapAndFilter :PURPLE blue Subscriber 2 to Composed MapAndFilter: BLUE green Subscriber 2 to Composed MapAndFilter: GREEN orange Subscriber 2 to Composed MapAndFilter: ORANGE purple
8.2. Hot vs Cold
So far we have considered that all Flux
(and Mono
) are the same: they all
represent an asynchronous sequence of data, and nothing happens before you
subscribe.
Really, though, there are two broad families of publishers: cold and hot.
The description above applies to the cold family of publishers. They generate data anew for each subscription If no subscription is created, then data never gets generated.
Think of an HTTP request: each new subscriber will trigger an HTTP call, but no call is made if no one is interested in the result.
Hot publishers, on the other hand, don’t depend on any number of
subscribers. They might start publishing data right away and would continue
doing so whenever a new Subscriber
comes in (in which case said subscriber
would only see new elements emitted after it subscribed). For hot
publishers, something does indeed happen before you subscribe.
One example of the few hot operators in Reactor is just
: it directly captures
the value at assembly time and will replay it to anybody subscribing to it
later on. To re-use the HTTP call analogy, if the captured data is the result
of an HTTP call then only one network call is made, when instantiating just.
To transform just
into a cold publisher, you can use defer
. It defers the
HTTP request in our example to subscription time (and would result in a
separate network call for each new subscription).
Most other hot publishers in Reactor extend Processor .
|
Contrast these two other examples:
Flux<String> source = Flux.fromIterable(Arrays.asList("blue", "green", "orange", "purple"))
.doOnNext(System.out::println)
.filter(s -> s.startsWith("o"))
.map(String::toUpperCase);
source.subscribe(d -> System.out.println("Subscriber 1: "+d));
source.subscribe(d -> System.out.println("Subscriber 2: "+d));
This first example produces:
blue green orange Subscriber 1: ORANGE purple blue green orange Subscriber 2: ORANGE purple
Both subscribers catch all four colors, because each subscriber causes the
process defined by the operators on the Flux
to run.
Compare the first example to this second example:
UnicastProcessor<String> hotSource = UnicastProcessor.create();
Flux<String> hotFlux = hotSource.publish()
.autoConnect()
.map(String::toUpperCase);
hotFlux.subscribe(d -> System.out.println("Subscriber 1 to Hot Source: "+d));
hotSource.onNext("blue");
hotSource.onNext("green");
hotFlux.subscribe(d -> System.out.println("Subscriber 2 to Hot Source: "+d));
hotSource.onNext("orange");
hotSource.onNext("purple");
hotSource.onComplete();
The second example produces:
Subscriber 1 to Hot Source: BLUE Subscriber 1 to Hot Source: GREEN Subscriber 1 to Hot Source: ORANGE Subscriber 2 to Hot Source: ORANGE Subscriber 1 to Hot Source: PURPLE Subscriber 2 to Hot Source: PURPLE
Subscriber 1 catches all four colors. Subscriber 2, having been created after the first two colors were produced, catches only the last two colors. This difference accounts for the doubling of "ORANGE" and "PURPLE" in the output. The process described by the operators on this Flux runs regardless of when subscriptions have been attached.
8.3. Broadcast to multiple subscribers with ConnectableFlux
Sometimes, you want to not only defer some processing to the subscription time of one subscriber, but you might actually want for several of them to rendezvous and then trigger the subscription / data generation.
This is what ConnectableFlux
is made for. Two main patterns are covered in the
Flux
API that return a ConnectableFlux
: publish
and replay
.
-
publish
dynamically tries to respect the demand from its various subscribers, in terms of backpressure, by forwarding these requests to the source. Most notably, if any subscriber has a pending demand of0
, publish will pause its requesting to the source. -
replay
buffers data seen through the first subscription, up to configurable limits (in time and buffer size). It will replay the data to subsequent subscribers.
A ConnectableFlux
offers additional methods to manage subscriptions downstream
vs subscription to the original source. For instance:
-
connect
can be called manually once you’ve reached enough subscriptions to the flux. That will trigger the subscription to the upstream source. -
autoConnect(n)
can do the same job automatically oncen
subscriptions have been made. -
refCount(n)
not only automatically tracks incoming subscriptions but also detects when these subscriptions are cancelled. If not enough subscribers are tracked, the source is "disconnected", causing a new subscription to the source later if additional subscribers appear.
Consider the following example:
Flux<Integer> source = Flux.range(1, 3)
.doOnSubscribe(s -> System.out.println("subscribed to source"));
ConnectableFlux<Integer> co = source.publish();
co.subscribe(System.out::println, e -> {}, () -> {});
co.subscribe(System.out::println, e -> {}, () -> {});
System.out.println("done subscribing");
Thread.sleep(500);
System.out.println("will now connect");
co.connect();
This code produces:
done subscribing will now connect subscribed to source 1 1 2 2 3 3
With autoConnect
:
Flux<Integer> source = Flux.range(1, 3)
.doOnSubscribe(s -> System.out.println("subscribed to source"));
Flux<Integer> autoCo = source.publish().autoConnect(2);
autoCo.subscribe(System.out::println, e -> {}, () -> {});
System.out.println("subscribed first");
Thread.sleep(500);
System.out.println("subscribing second");
autoCo.subscribe(System.out::println, e -> {}, () -> {});
Which outputs:
subscribed first subscribing second subscribed to source 1 1 2 2 3 3
8.4. Three Sorts of Batching
When you have lots of elements and you want to separate them into batches, you
have three broad solutions in Reactor: grouping, windowing, and buffering.
These three are conceptually close, because they redistribute a Flux<T>
into
an aggregate. Grouping and windowing create a Flux<Flux<T>>
, while buffering
aggregates into Collection<T>
.
8.4.1. Grouping: Flux<GroupedFlux<T>>
Grouping is the act of splitting the source Flux<T>
into multiple batches by
a key.
The associated operator is groupBy
.
Each group is represented as a GroupedFlux<T>
, which lets you retrieve the
key via its key()
method.
There is no necessary continuity in the content of the groups. Once a source element produces a new key, the group for this key is opened and elements that match the key end up in said group (several groups could be open at the same time).
This means that groups:
-
Are always disjoint (a source element belongs to 1 and only 1 group).
-
Can contain elements from different places in the original sequence.
-
Are never empty.
StepVerifier.create(
Flux.just(1, 3, 5, 2, 4, 6, 11, 12, 13)
.groupBy(i -> i % 2 == 0 ? "even" : "odd")
.concatMap(g -> g.defaultIfEmpty(-1) //if empty groups, show them
.map(String::valueOf) //map to string
.startWith(g.key())) //start with the group's key
)
.expectNext("odd", "1", "3", "5", "11", "13")
.expectNext("even", "2", "4", "6", "12")
.verifyComplete();
Grouping is best suited for when you have a medium to low number of
groups. The groups must also imperatively be consumed (eg. in a flatMap ) so
that groupBy will continue fetching data from upstream and feeding more
groups. Sometimes these two constraints multiply and lead to hangs, like when
you have a high cardinality and the concurrency of the flatMap consuming the
groups is too low.
|
8.4.2. Windowing: Flux<Flux<T>>
Windowing is the act of splitting the source Flux<T>
into windows, by
criteria of size, time, boundary-defining predicates, or boundary-defining
Publisher
.
The associated operators are window
, windowTimeout
, windowUntil
,
windowWhile
and windowWhen
.
A major difference with groupBy
is that windows are always sequential. No
more than 2 windows can be open at the same time.
They can overlap though. For instance, there is a variant with a maxSize
and skip
parameters. The maxSize is the number of elements after which a
window will close, and the skip parameter is the number of elements in the
source after which a new window is opened. So if maxSize > skip
, a new window
will open before the previous one closes and the 2 windows will overlap.
This example shows overlapping windows:
StepVerifier.create(
Flux.range(1, 10)
.window(5, 3) //overlapping windows
.concatMap(g -> g.defaultIfEmpty(-1)) //show empty windows as -1
)
.expectNext(1, 2, 3, 4, 5)
.expectNext(4, 5, 6, 7, 8)
.expectNext(7, 8, 9, 10)
.expectNext(10)
.verifyComplete();
With the reverse configuration (maxSize < skip ), some elements from
the source would be dropped and not be part of any window.
|
In the case of predicate-based windowing via windowUntil
and windowWhile
,
having subsequent source elements that don’t match the predicate can also lead
to empty windows, as demonstrated in this example:
StepVerifier.create(
Flux.just(1, 3, 5, 2, 4, 6, 11, 12, 13)
.windowWhile(i -> i % 2 == 0)
.concatMap(g -> g.defaultIfEmpty(-1))
)
.expectNext(-1, -1, -1) //respectively triggered by odd 1 3 5
.expectNext(2, 4, 6) // triggered by 11
.expectNext(12) // triggered by 13
.expectNext(-1) // empty completion window, would have been omitted if all matched before onComplete
.verifyComplete();
8.4.3. Buffering: Flux<List<T>>
Buffering is very close to the behavior of windowing, with a twist: instead of
emitting windows (which each are a Flux<T>
), it emits buffers (which are
Collection<T>
- by default List<T>
).
The operators for buffering mirror those for windowing: buffer
,
bufferTimeout
, bufferUntil
, bufferWhile
, and bufferWhen
.
Where the corresponding windowing operator would open a window, a buffering operator would create a new collection and start adding elements to it. Where a window would close, the buffering operator would emit the collection.
Buffering can also lead to dropping source elements or having overlapping buffers, as shown here:
StepVerifier.create(
Flux.range(1, 10)
.buffer(5, 3) //overlapping buffers
)
.expectNext(Arrays.asList(1, 2, 3, 4, 5))
.expectNext(Arrays.asList(4, 5, 6, 7, 8))
.expectNext(Arrays.asList(7, 8, 9, 10))
.expectNext(Collections.singletonList(10))
.verifyComplete();
Unlike in windowing, bufferUntil
and bufferWhile
don’t emit an empty
buffer, as shown here:
StepVerifier.create(
Flux.just(1, 3, 5, 2, 4, 6, 11, 12, 13)
.bufferWhile(i -> i % 2 == 0)
)
.expectNext(Arrays.asList(2, 4, 6)) // triggered by 11
.expectNext(Collections.singletonList(12)) // triggered by 13
.verifyComplete();
8.5. Parallelize work with ParallelFlux
With multi-core architectures being a commodity nowadays, being able to easily
parallelize work is very important. Reactor helps with that by providing a
special type, ParallelFlux
, that exposes operators that are optimized for
parallelized work.
To obtain a ParallelFlux
, one can use the parallel()
operator on any Flux
.
This will not by itself parallelize the work however, but rather will divide
the workload into "rails" (by default as many rails as there are CPU cores).
In order to tell the resulting ParallelFlux where to execute each rail (and
by extension to execute rails in parallel) you have to use runOn(Scheduler)
.
Note that there is a recommended dedicated Scheduler for parallel work:
Schedulers.parallel()
.
Compare the next two code examples:
Flux.range(1, 10)
.parallel(2) (1)
.subscribe(i -> System.out.println(Thread.currentThread().getName() + " -> " + i));
1 | We force a number of rails instead of relying on the number of CPU cores. |
with:
Flux.range(1, 10)
.parallel(2)
.runOn(Schedulers.parallel())
.subscribe(i -> System.out.println(Thread.currentThread().getName() + " -> " + i));
The first code block produces:
main -> 1 main -> 2 main -> 3 main -> 4 main -> 5 main -> 6 main -> 7 main -> 8 main -> 9 main -> 10
The second correctly parallelizes on two threads, as shown here:
parallel-1 -> 1 parallel-2 -> 2 parallel-1 -> 3 parallel-2 -> 4 parallel-1 -> 5 parallel-2 -> 6 parallel-1 -> 7 parallel-1 -> 9 parallel-2 -> 8 parallel-2 -> 10
If, once you’ve processed your sequence in parallel, you want to revert back to a
"normal" Flux
and apply the rest of the operator chain in a sequential manner,
you can use the sequential()
method on ParallelFlux
.
Note that sequential()
is implicitly applied if you subscribe
to the ParallelFlux
with a Subscriber
, but not when using the lambda-based variants of subscribe
.
Note also that subscribe(Subscriber<T>)
merges all the rails, while
subscribe(Consumer<T>)
runs all the rails. If the subscribe()
method has a
lambda, each lambda is executed as many times as there are rails.
You can also access individual rails or "groups" as a Flux<GroupedFlux<T>>
via
the groups()
method and apply additional operators to them via the
composeGroup()
method.
9. FAQ, Best Practices, and "How do I…?"
9.1. How do I wrap a synchronous, blocking call?
It’s often the case that a source of information is synchronous and blocking. To deal with such sources in your Reactor applications, apply the following pattern:
Mono blockingWrapper = Mono.fromCallable(() -> { (1)
return /* make a remote synchronous call */ (2)
});
blockingWrapper = blockingWrapper.subscribeOn(Schedulers.elastic()); (3)
1 | Create a new Mono by using fromCallable . |
2 | Return the asynchronous, blocking resource. |
3 | Ensure each subscription will happen on a dedicated single-threaded worker
from Schedulers.elastic() . |
You should use a Mono because the source returns one value. You should use
Schedulers.elastic
because it creates a dedicated thread to wait for the
blocking resource without tying up some other resource.
Note that subscribeOn
does not subscribe to the Mono
. It specifies what
kind of Scheduler
to use when a subscribe call happens.
9.2. I used an operator on my Flux
but it doesn’t seem to apply. What gives?
Make sure that the variable you .subscribe()
to has been affected by the
operators you think should have been applied to it.
Reactor operators are decorators. They return a different instance that wraps the source sequence and add behavior. That is why the preferred way of using operators is to chain the calls.
Compare the following two examples:
Flux<String> flux = Flux.just("foo", "chain");
flux.map(secret -> secret.replaceAll(".", "*")); (1)
flux.subscribe(next -> System.out.println("Received: " + next));
1 | The mistake is here. The result isn’t attached to the flux variable. |
Flux<String> flux = Flux.just("foo", "chain");
flux = flux.map(secret -> secret.replaceAll(".", "*"));
flux.subscribe(next -> System.out.println("Received: " + next));
This sample is even better (because it’s simpler):
Flux<String> secrets = Flux
.just("foo", "chain")
.map(secret -> secret.replaceAll(".", "*"))
.subscribe(next -> System.out.println("Received: " + next));
The first version will output:
Received: foo
Received: chain
Whereas the two other versions will output the expected:
Received: ***
Received: *****
9.3. My Mono
zipWith
/zipWhen
is never called
myMethod.process("a") // this method returns Mono<Void>
.zipWith(myMethod.process("b"), combinator) //this is never called
.subscribe();
If the source Mono
is either empty
or a Mono<Void>
(a Mono<Void>
is
empty for all intents and purposes), some combinations will never be called.
This is the typical case for any transformer like the zipWith
operator, which by
definition needs an
element from each source to produce its output. If any of the sources are
empty, and
produces an empty sequence as well, so be careful of its usage.
For example, using zipWith()
after a then()
may cause this problem.
This is even more true of the variants of and
that take a Function
, meaning
that the associated Mono is chosen lazily depending on the incoming value
(which never comes in the cases of empty or Void
sequences).
You can use .defaultIfEmpty(T)
to replace an empty sequence of T
(not a
Void
sequence) with a default value, which could help avoid some of these
situations. Here’s an example this:
defaultIfEmpty
before zipWhen
myMethod.emptySequenceForKey("a") // this method returns empty Mono<String>
.defaultIfEmpty("") // this converts empty sequence to just the empty String
.zipWhen(aString -> myMethod.process("b")) //this is called with the empty String
.subscribe();
9.4. How to use retryWhen
to emulate retry(3)
?
The retryWhen
operator can be quite complex. Hopefully this snippet of code
can help you understand how it works by attempting to emulate a simpler
retry(3)
:
Flux<String> flux =
Flux.<String>error(new IllegalArgumentException())
.retryWhen(companion -> companion
.zipWith(Flux.range(1, 4), (1)
(error, index) -> { (2)
if (index < 4) return index; (3)
else throw Exceptions.propagate(error); (4)
})
);
1 | Trick one: use zip and a range of "number of acceptable retries + 1". |
2 | The zip function lets you count the retries while keeping track of
the original error. |
3 | To allow for 3 retries, indexes before 4 return a value to emit… |
4 | …but, in order to terminate the sequence in error, we throw the original exception after these 3 retries. |
9.5. How to use retryWhen
for exponential backoff?
Exponential backoff produces retry attempts with a growing delay between each of the attempts, so as not to overload the source systems and risk an all out crash. The rationale is that if the source produces an error, it is already in an unstable state and not likely to immediately recover from it. So blindly retrying immediately is likely to produce yet another error and add to the instability.
Here is how to implement an exponential backoff that delays retries and increase the delay between each attempt (pseudocode: delay = attempt number * 100 milliseconds):
Flux<String> flux =
Flux.<String>error(new IllegalArgumentException())
.retryWhen(companion -> companion
.doOnNext(s -> System.out.println(s + " at " + LocalTime.now())) (1)
.zipWith(Flux.range(1, 4), (error, index) -> { (2)
if (index < 4) return index;
else throw Exceptions.propagate(error);
})
.flatMap(index -> Mono.delay(Duration.ofMillis(index * 100))) (3)
.doOnNext(s -> System.out.println("retried at " + LocalTime.now())) (4)
);
1 | We log the time of errors. |
2 | We use the retryWhen + zipWith trick to propagate the error after 3
retries. |
3 | Through flatMap , we cause a delay that depends on the attempt’s index. |
4 | We also log the time at which the retry happens. |
When subscribed to, this fails and terminates after printing out:
java.lang.IllegalArgumentException at 18:02:29.338 retried at 18:02:29.459 (1) java.lang.IllegalArgumentException at 18:02:29.460 retried at 18:02:29.663 (2) java.lang.IllegalArgumentException at 18:02:29.663 retried at 18:02:29.964 (3) java.lang.IllegalArgumentException at 18:02:29.964
1 | first retry after about 100ms |
2 | second retry after about 200ms |
3 | third retry after about 300ms |
9.6. How do I ensure thread affinity using publishOn()
?
As described in Schedulers, publishOn()
can be used to switch
execution contexts. The publishOn
operator influences the threading context
where the rest of the operators in the chain below it will execute, up to a new
occurrence of publishOn
. So the placement of publishOn
is significant.
For instance, in the example below, the transform
function in map()
is
executed on a worker of scheduler1
and the processNext
method in
doOnNext()
is executed on a worker of scheduler2
. Single threaded
schedulers may be used to ensure thread affinity for different stages in the
chain or for different subscribers.
EmitterProcessor<Integer> processor = EmitterProcessor.create();
processor.publishOn(scheduler1)
.map(i -> transform(i))
.publishOn(scheduler2)
.doOnNext(i -> processNext(i))
.subscribe();