data:image/s3,"s3://crabby-images/f5f68/f5f681890942923e08a22c216359b610b906fb3d" alt=""
Log4j 1.x performance in synchronous and asynchronous mode
Good day, harazhiteli!
I am currently working on improving the performance of a large enterprise solution. Its specificity allows many users to perform similar actions, and accordingly, behind the scenes, the application server works with the same code. And now, at one point in the long path to speeding up the application, it was noticed that log4j takes the top places in the top of the lowest performing sections. The first thought was - excessive logging, but subsequent analysis denied it. Moreover, this is extremely important data for support engineers, and if you remove it now, then either the code will be returned back or the analysis of server production problems will be significantly more difficult.
This prompted me to this study - is there a way to optimize the logging process?
Disclaimer: due to the abundance of diagrams in this article causing psychedelic associations (like this: this is not a comparison of synchronous and asynchronous logging modes, the dragon kills a snake here!), it is not recommended for minors, people with an unstable psyche and pregnant women whose code is already in production and the issue of the next patch is not this year.
Caution, traffic under the cut.
As usual, the reason is commonplace - “competitive access”. After several experiments with a floating number of parallel threads, it became clear that the working time of the logging call is not linear, so the output is that the hard drive loses a lot.
Here are the measurement results:
data:image/s3,"s3://crabby-images/71b4b/71b4b7e12a898c62134a65f4ca76d86182bac72b" alt=""
Look for other ways of logging, upgrade libraries, you can do all this, but the task is to achieve maximum results with minimum efforts. I can tell you about the log4j 2 library too, but this will be a separate article. Now we will consider the means provided to us “out of the box” in log4j 1.x.
Among the appenders supplied with the library, there is AsyncAppender, which allows using the intermediate buffer to accumulate logging events to organize asynchronous work with the file system (if the final appender is just a file system, because it was originally intended for the SMTP logger). When generated, logging events accumulate, and only when a certain level of buffer fullness is reached, they get to the file.
Now that the approach is defined, we need to understand how effective it is, for this we will carry out the appropriate measurements.
We will measure it this way:
0) I warn you in advance, “I didn’t make beautiful data,” in places it’s clear that the processor switched to another job, and I left these places as it was. This is also part of the real work of the system.
1) We divide the tests into 3 groups:
- 10 logging events (from 1 to 10 in increments of 1)
- 550 logging events (from 10 to 100 in increments of 10)
- 5500 logging events (from 100 to 1000 in increments of 100)
2) B each group will have 3 subgroups of tests - depending on the size of the buffer (try to find the optimal one):
- 500 events
- 1500 events
- 5000 events
3) Tests will be performed synchronously and asynchronously.
4) The tests themselves are simple logging calls interspersed with “random operation” (duration from 1 to 15 ms, in order to be able to alternate access to the file).
Both results (plus intermediate, so as not to lose the thread), synchronous and asynchronous, we will visualize with diagrams, for clarity.
So, let's begin.
Here are the results:
data:image/s3,"s3://crabby-images/970be/970be0268979d271d888c8e35e935bff39d5873d" alt=""
data:image/s3,"s3://crabby-images/f0980/f09807dac53d0ef814741f9ac9d0b7d4fae89fa6" alt=""
data:image/s3,"s3://crabby-images/52eef/52eef5a10f3b8169cc21147a376c3e99d7dc7eb0" alt=""
data:image/s3,"s3://crabby-images/5d261/5d261ad3d19f32594a01ff15b45053ba61c98e1f" alt=""
data:image/s3,"s3://crabby-images/3c737/3c7379d187dc592cf23b43d378ec6aa5ec3b33ab" alt=""
Now I think you can clearly see the acceleration in asynchronous mode.
But before drawing conclusions, let's repeat our tests on 5 and 10 threads.
data:image/s3,"s3://crabby-images/48285/48285e3603ad074e49a297715d811d46207a567e" alt=""
data:image/s3,"s3://crabby-images/f6121/f6121f292eb029b59570f0c1e4c1590c34513065" alt=""
data:image/s3,"s3://crabby-images/62cec/62cec884be8e25a43c15a77641494a02dcdaa516" alt=""
data:image/s3,"s3://crabby-images/77b4f/77b4f836b462422eb321b155d2b92e68984b2f83" alt=""
data:image/s3,"s3://crabby-images/6596e/6596e4f9ffefb056b6036a62d14679509987125a" alt=""
data:image/s3,"s3://crabby-images/065c5/065c527b2717e394ce80dd3f6aebb59a9832d1eb" alt=""
data:image/s3,"s3://crabby-images/3f477/3f4776a3d888df032448ae767abe1b9ca12bb3b8" alt=""
data:image/s3,"s3://crabby-images/81630/81630a99342412a2db18148dbf71aaa9681ab322" alt=""
data:image/s3,"s3://crabby-images/2ce74/2ce74457c3808e4081d537468d5ef56401a73ffc" alt=""
data:image/s3,"s3://crabby-images/a11b0/a11b059f920e4a97467a756e2b9772f0eecdc283" alt=""
data:image/s3,"s3://crabby-images/ce60a/ce60a972e7ce40c837cec2cf04330eb47d98658b" alt=""
data:image/s3,"s3://crabby-images/e2e92/e2e92045242e0221a21dd8a375ab89df6f132dc7" alt=""
The conclusion, as they say, on the face.
Since now we can definitely talk about the advantage of the asynchronous method of logging, then let's try to increase the volume of tests by another 10 times. Add a test for 55,000 logging events (from 1000 to 10000 in increments of 1000). We take the buffer equal to 500 (since it at first glance, and later it will be proved, is the most optimal in our tests).
data:image/s3,"s3://crabby-images/0bf2e/0bf2ecbaed548f0e7ba7afc774f6d10b79ab600a" alt=""
data:image/s3,"s3://crabby-images/066a7/066a7975f62282d3387da17933d3e0b293763e22" alt=""
data:image/s3,"s3://crabby-images/809d9/809d9a5f1dd14ae13e08f86a2f735ea283874c74" alt=""
data:image/s3,"s3://crabby-images/7d2d8/7d2d800107b4848438ef5548e0528c8a6eb4ca22" alt=""
data:image/s3,"s3://crabby-images/caacb/caacbc5f7be7dee3ac829001e76b13e72c58b9d9" alt=""
data:image/s3,"s3://crabby-images/519a8/519a835e6abfcb09936e61fd945e73ff417448ef" alt=""
data:image/s3,"s3://crabby-images/eb293/eb293a3a80e9691f93faff145167752a2b4c1400" alt=""
data:image/s3,"s3://crabby-images/941d7/941d767948f9fbf3d373bb9bc435b5b8e24c9517" alt=""
data:image/s3,"s3://crabby-images/32969/32969030ce3ae1fbfe8a1de67e6c23382c83ab5f" alt=""
data:image/s3,"s3://crabby-images/2d7e3/2d7e36145a3c770f7d687c32c1a83aba563daace" alt=""
data:image/s3,"s3://crabby-images/c06a1/c06a13ae6874101414c563461e9b0c332c8354ed" alt=""
data:image/s3,"s3://crabby-images/5a40a/5a40ad68a550f77f9f117bedc33ce1e0326bdb07" alt=""
Total - 500 events , it is our buffer that allows us to work most efficiently in asynchronous mode.
And now if we even compare the total (or average) runtime of all the tests, then you can get a constant that displays the winning order of the asynchronous mode before the synchronous one, I have it - 8.9 (times) .
The above material makes it clear that the asynchronous logging strategy gives a performance gain. Then the question begs - why not use it always? In order to make a choice in favor of a particular method, you need to imagine the mechanisms that are hidden inside. Below I give a few theses taken from the offsite:
1) AsyncAppender operates with its own thread (while FileAppender itself runs in the context of the current thread), therefore, when using it, it is desirable to increase the limit of application server threads.
2) When using AsyncAppender, memory overhead occurs, since the logging events are not dumped to the file instantly, but by first filling the buffer.
3) The lock of the log file lasts longer than when using the synchronous approach, since the message buffer contains much more information for recording.
In principle, everything is prosaic, but here you need to understand that synchronization itself is also a lock, so it is important to transfer it from one place to another does not make it worse.
Use asynchrony where it is really needed:
- long-playing appenders - SMTP, JDBC
- general resource blocking - FTP, Local File Storage
But first of all, be sure to profile your code.
Excel version of this article:
docs.google.com/spreadsheet/ccc?key=0AkyN15vTZD-ddHV0Y3p4QVUxTXVZRldPcU0tNzNucWc&usp=sharing
docs.google.com/spreadsheet/ccc?key=0AkyN15vTZD-ddVhjdjjdjjdjjdjjjdjjdjjdjjjdjjjdjjdjjjdjjjdjjjdjjjdjjjjdjjjjjjjdj
Thanks for attention. I hope the article will be useful to you.
Have a productive week!
I am currently working on improving the performance of a large enterprise solution. Its specificity allows many users to perform similar actions, and accordingly, behind the scenes, the application server works with the same code. And now, at one point in the long path to speeding up the application, it was noticed that log4j takes the top places in the top of the lowest performing sections. The first thought was - excessive logging, but subsequent analysis denied it. Moreover, this is extremely important data for support engineers, and if you remove it now, then either the code will be returned back or the analysis of server production problems will be significantly more difficult.
This prompted me to this study - is there a way to optimize the logging process?
Disclaimer: due to the abundance of diagrams in this article causing psychedelic associations (like this: this is not a comparison of synchronous and asynchronous logging modes, the dragon kills a snake here!), it is not recommended for minors, people with an unstable psyche and pregnant women whose code is already in production and the issue of the next patch is not this year.
Caution, traffic under the cut.
What is the reason?
As usual, the reason is commonplace - “competitive access”. After several experiments with a floating number of parallel threads, it became clear that the working time of the logging call is not linear, so the output is that the hard drive loses a lot.
Here are the measurement results:
data:image/s3,"s3://crabby-images/71b4b/71b4b7e12a898c62134a65f4ca76d86182bac72b" alt=""
What is the solution?
Look for other ways of logging, upgrade libraries, you can do all this, but the task is to achieve maximum results with minimum efforts. I can tell you about the log4j 2 library too, but this will be a separate article. Now we will consider the means provided to us “out of the box” in log4j 1.x.
Among the appenders supplied with the library, there is AsyncAppender, which allows using the intermediate buffer to accumulate logging events to organize asynchronous work with the file system (if the final appender is just a file system, because it was originally intended for the SMTP logger). When generated, logging events accumulate, and only when a certain level of buffer fullness is reached, they get to the file.
Measurements
Now that the approach is defined, we need to understand how effective it is, for this we will carry out the appropriate measurements.
We will measure it this way:
0) I warn you in advance, “I didn’t make beautiful data,” in places it’s clear that the processor switched to another job, and I left these places as it was. This is also part of the real work of the system.
1) We divide the tests into 3 groups:
- 10 logging events (from 1 to 10 in increments of 1)
- 550 logging events (from 10 to 100 in increments of 10)
- 5500 logging events (from 100 to 1000 in increments of 100)
2) B each group will have 3 subgroups of tests - depending on the size of the buffer (try to find the optimal one):
- 500 events
- 1500 events
- 5000 events
3) Tests will be performed synchronously and asynchronously.
Synchronous Logger Configuration
Asynchronous Logger Configuration
4) The tests themselves are simple logging calls interspersed with “random operation” (duration from 1 to 15 ms, in order to be able to alternate access to the file).
Test source code
package com.ice.logger_test;
import org.apache.commons.lang3.time.StopWatch;
import org.apache.log4j.Logger;
import java.util.Random;
public class SimpleTest {
private static Logger logger = Logger.getLogger(SimpleTest.class);
private static double NANOS_TO_SEC = 1000000000.0d;
private static String LOG_MESSAGE =
"One hundred bytes log message for performing some tests using sync/async appenders of log4j library";
public static void main(String[] args) throws InterruptedException {
//performTest("Single thread");
ThreadRunner t1 = new ThreadRunner();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
new Thread(t1).start();
}
private static void performTest(String message) throws InterruptedException {
logger.info("Warm-up...");
logger.info("Warm-up...");
logger.info("Warm-up...");
StopWatch timer = new StopWatch();
Random ran = new Random();
for(int i = 1; i <= 10000; i += getIncrementator(i)) {
timer.reset();
timer.start();
int iterations = 0;
for(int j = 1; j <= i; j++) {
timer.suspend();
Thread.sleep(ran.nextInt(15)+1); // some work
timer.resume();
logger.info(LOG_MESSAGE);
timer.suspend();
Thread.sleep(ran.nextInt(15)+1); // some work
timer.resume();
iterations = j;
}
timer.stop();
System.out.printf(message + " %d iteration(s) %f sec\n", iterations, (timer.getNanoTime() / NANOS_TO_SEC));
}
}
private static int getIncrementator(int i) {
if(i >= 0 && i < 10) return 1;
if(i >= 10 && i < 100) return 10;
if(i >= 100 && i < 1000) return 100;
if(i >= 1000 && i < 10000) return 1000;
if(i >= 10000 && i <= 100000) return 10000;
return 0;
}
static class ThreadRunner implements Runnable {
@Override
public void run() {
try {
performTest(Thread.currentThread().getName());
} catch (InterruptedException e) {
e.printStackTrace();
}
}
}
}
Both results (plus intermediate, so as not to lose the thread), synchronous and asynchronous, we will visualize with diagrams, for clarity.
So, let's begin.
Synchronous start, 2 threads
First, let's see how the logger behaves in a synchronous configuration. Run all the scripts on 2 threads.Here are the results:
data:image/s3,"s3://crabby-images/970be/970be0268979d271d888c8e35e935bff39d5873d" alt=""
Asynchronous start, 2 threads
Buffer = 500
Now switch to asynchronous mode and at the same time try to find the optimal buffer, start with 500data:image/s3,"s3://crabby-images/f0980/f09807dac53d0ef814741f9ac9d0b7d4fae89fa6" alt=""
Buffer = 1500
Let's increase the buffer 3 times and carry out the same tests:data:image/s3,"s3://crabby-images/52eef/52eef5a10f3b8169cc21147a376c3e99d7dc7eb0" alt=""
Buffer = 5000
Let's increase the buffer by 10 times and carry out the same tests:data:image/s3,"s3://crabby-images/5d261/5d261ad3d19f32594a01ff15b45053ba61c98e1f" alt=""
Total for 2 threads
Now, for clarity, I will collect all asynchronous (to try to determine the optimal buffer) and synchronous (for clarity, I wonder who won) tests in one diagram:data:image/s3,"s3://crabby-images/3c737/3c7379d187dc592cf23b43d378ec6aa5ec3b33ab" alt=""
Now I think you can clearly see the acceleration in asynchronous mode.
But before drawing conclusions, let's repeat our tests on 5 and 10 threads.
Synchronous start, 5 threads
data:image/s3,"s3://crabby-images/48285/48285e3603ad074e49a297715d811d46207a567e" alt=""
Asynchronous start, 5 threads
Buffer = 500
data:image/s3,"s3://crabby-images/f6121/f6121f292eb029b59570f0c1e4c1590c34513065" alt=""
Buffer = 1500
data:image/s3,"s3://crabby-images/62cec/62cec884be8e25a43c15a77641494a02dcdaa516" alt=""
Buffer = 5000
data:image/s3,"s3://crabby-images/77b4f/77b4f836b462422eb321b155d2b92e68984b2f83" alt=""
Total for 5 threads
data:image/s3,"s3://crabby-images/6596e/6596e4f9ffefb056b6036a62d14679509987125a" alt=""
Synchronous start, 10 threads
data:image/s3,"s3://crabby-images/065c5/065c527b2717e394ce80dd3f6aebb59a9832d1eb" alt=""
Asynchronous start, 10 threads
Buffer = 500
data:image/s3,"s3://crabby-images/3f477/3f4776a3d888df032448ae767abe1b9ca12bb3b8" alt=""
Buffer = 1500
data:image/s3,"s3://crabby-images/81630/81630a99342412a2db18148dbf71aaa9681ab322" alt=""
Buffer = 5000
data:image/s3,"s3://crabby-images/2ce74/2ce74457c3808e4081d537468d5ef56401a73ffc" alt=""
The result for 10 threads
data:image/s3,"s3://crabby-images/a11b0/a11b059f920e4a97467a756e2b9772f0eecdc283" alt=""
data:image/s3,"s3://crabby-images/ce60a/ce60a972e7ce40c837cec2cf04330eb47d98658b" alt=""
data:image/s3,"s3://crabby-images/e2e92/e2e92045242e0221a21dd8a375ab89df6f132dc7" alt=""
The conclusion, as they say, on the face.
Since now we can definitely talk about the advantage of the asynchronous method of logging, then let's try to increase the volume of tests by another 10 times. Add a test for 55,000 logging events (from 1000 to 10000 in increments of 1000). We take the buffer equal to 500 (since it at first glance, and later it will be proved, is the most optimal in our tests).
Synchronous start, 10 streams on large volumes of data
data:image/s3,"s3://crabby-images/0bf2e/0bf2ecbaed548f0e7ba7afc774f6d10b79ab600a" alt=""
data:image/s3,"s3://crabby-images/066a7/066a7975f62282d3387da17933d3e0b293763e22" alt=""
data:image/s3,"s3://crabby-images/809d9/809d9a5f1dd14ae13e08f86a2f735ea283874c74" alt=""
data:image/s3,"s3://crabby-images/7d2d8/7d2d800107b4848438ef5548e0528c8a6eb4ca22" alt=""
Asynchronous start, 10 threads on large amounts of data
data:image/s3,"s3://crabby-images/caacb/caacbc5f7be7dee3ac829001e76b13e72c58b9d9" alt=""
data:image/s3,"s3://crabby-images/519a8/519a835e6abfcb09936e61fd945e73ff417448ef" alt=""
data:image/s3,"s3://crabby-images/eb293/eb293a3a80e9691f93faff145167752a2b4c1400" alt=""
data:image/s3,"s3://crabby-images/941d7/941d767948f9fbf3d373bb9bc435b5b8e24c9517" alt=""
The result for 10 threads on large amounts of data
data:image/s3,"s3://crabby-images/32969/32969030ce3ae1fbfe8a1de67e6c23382c83ab5f" alt=""
data:image/s3,"s3://crabby-images/2d7e3/2d7e36145a3c770f7d687c32c1a83aba563daace" alt=""
data:image/s3,"s3://crabby-images/c06a1/c06a13ae6874101414c563461e9b0c332c8354ed" alt=""
data:image/s3,"s3://crabby-images/5a40a/5a40ad68a550f77f9f117bedc33ce1e0326bdb07" alt=""
Optimal buffer
At the moment, we have already accumulated quite a lot of statistics, so let's average them all and see what kind of buffer is most suitable.2 Threads | |
![]() ![]() ![]() | ![]() ![]() ![]() |
5 Threads | |
![]() ![]() ![]() | ![]() ![]() ![]() |
10 Threads | |
![]() ![]() ![]() | ![]() ![]() ![]() |
10 Threads, a large volume of tests | |
![]() | ![]() |
Total - 500 events , it is our buffer that allows us to work most efficiently in asynchronous mode.
And now if we even compare the total (or average) runtime of all the tests, then you can get a constant that displays the winning order of the asynchronous mode before the synchronous one, I have it - 8.9 (times) .
Conclusion
The above material makes it clear that the asynchronous logging strategy gives a performance gain. Then the question begs - why not use it always? In order to make a choice in favor of a particular method, you need to imagine the mechanisms that are hidden inside. Below I give a few theses taken from the offsite:
1) AsyncAppender operates with its own thread (while FileAppender itself runs in the context of the current thread), therefore, when using it, it is desirable to increase the limit of application server threads.
2) When using AsyncAppender, memory overhead occurs, since the logging events are not dumped to the file instantly, but by first filling the buffer.
3) The lock of the log file lasts longer than when using the synchronous approach, since the message buffer contains much more information for recording.
In principle, everything is prosaic, but here you need to understand that synchronization itself is also a lock, so it is important to transfer it from one place to another does not make it worse.
Use asynchrony where it is really needed:
- long-playing appenders - SMTP, JDBC
- general resource blocking - FTP, Local File Storage
But first of all, be sure to profile your code.
Excel version of this article:
docs.google.com/spreadsheet/ccc?key=0AkyN15vTZD-ddHV0Y3p4QVUxTXVZRldPcU0tNzNucWc&usp=sharing
docs.google.com/spreadsheet/ccc?key=0AkyN15vTZD-ddVhjdjjdjjdjjdjjjdjjdjjdjjjdjjjdjjdjjjdjjjdjjjdjjjdjjjjdjjjjjjjdj
Thanks for attention. I hope the article will be useful to you.
Have a productive week!
Only registered users can participate in the survey. Please come in.
Do you need a similar article about log4j 2.x?
- 23% Yes, only about log4j 2.x 36
- 60.2% Yes, about log4j 1.x compared to log4j 2.x 94
- 16.6% No 26