Improve performance of Java agent #134

kusalk · 2024-10-11T11:27:27Z

Hi @sohyun-ku, @taeyeon-Kim and others 👋🏾

I've been working on some performance optimisations for the Java agent. I'd be keen to get your thoughts on these, and if you think they are worthwhile.

The optimisations in this PR consist of the following:

MethodRegistry#extractSignature algorithm optimisation
Filter synthetic class signatures during ByteBuddy advice installation (instead of on invocation)
Replace MD5 hashing with MurmurHash3
Rewrite InvocationRegistry to use ConcurrentHashMap (rather than LinkedBlockingQueue) and remove worker thread

I've also made the following additional changes:

Use JDK21 with target level 8 for Java agent
1. Retain target level 21 for unit test module
Upgrade Mockito test dependency to latest version (unblocked by 1i.)
Upgrade Gradle Shadow plugin to latest version
Extract common WireMock integration test setup into abstract class
Add JMH benchmark runner integration test
Reworked dependency injection for InvocationTracker and Scheduler
Isolated SchedulerTest unit tests to only test Scheduler (unblocked by 6.)
Do not log exception stack trace if collector service unavailable

These changes can mostly be reviewed commit by commit but I can also split them into multiple PRs if it would be easier to review.

Whilst developing and concluding the implementation of these optimisations, I ran some JMH benchmarks on my local machine to verify and measure the performance improvements.

Whilst I have limited experience writing and running such benchmarks, I took some care to avoid the most common pitfalls. I collected these results using Java 21 and the compiler blackhole configuration. In saying that, there are some discrepancies that I wasn't able to completely explain. So whilst I'm not completely confident in the absolute numbers, I have a reasonable level of confidence that the ordering of the results is correct.

Method hashing - `MethodRegistry#getHash` (`ConcurrentHashMap` cache disabled)

Implementation	Measurement (ns/op)
Before	1016.663 ± 146.226
extractSignature optimised	865.494 ± 71.256
Above + RegEx moved	581.834 ± 46.560
Above + MurmurHash3	139.283 ± 6.211

Invocation registering - `InvocationRegistry#register`

In the following benchmarks, 'not in buffer' refers to the state of the invocations buffer immediately upon agent startup, whereas 'reset buffer' refers to the state immediately after invocation data is published and the buffer is cleared.

Implementation	Not in buffer (ns/op)	In buffer (ns/op)	Reset buffer (ns/op)
Before	261.195 ± 17.170 (+~220 async)	3.027 ± 0.117	Same as not in buffer
ConcurrentSet*	190.002 ± 18.514	7.155 ± 0.237	Same as not in buffer
ConcurrentMap	210.502 ± 6.470	6.298 ± 0.779	152.111 ± 22.679

*This was my initial alternative implementation (44c2418), I ended up settling on a slightly different approach which offers better 'reset buffer' performance and is arguably simpler.

Overall - `InvocationTracker#onInvocation`

Scenario	Before PR (ns/op)	After PR (ns/op)
Hash uncached, reset buffer	2540.221 ± 418.102 (+~220 async)	316.088 ± 36.943
Hash cached, reset buffer	267.910 ± 4.762 (+~220 async)	199.192 ± 9.958
Hash cached, in buffer	10.033 ± 0.389	12.836 ± 0.120

In a real world scenario, the vast majority of invocations fall into the 'hash cached, in buffer' scenario, where the latency is minimal irrespective of the optimisations. However, the latency spikes on application start up and momentarily after every publication. Also notably, the worker thread is eliminated, leading to further indirect performance gains. In complex web applications with many tracked methods, there can be 10,000+ tracked invocations per served request. This can sum up to a delay in the order of milliseconds and thus these optimisations should help measurably reduce the worst-case performance of the agent.

taeyeon-Kim · 2024-10-14T03:44:40Z

scavenger-model/src/main/java/com/navercorp/scavenger/util/HashGenerator.java

        public static String from(String signature) {
+            return Murmur.from(signature);


Changing the hash algorithm is a big task that requires migrating DB data.
It's better to keep md5 as it is.

Good point, I forgot to take backwards compatibility into consideration

How would you feel if I made it a command line option instead (retaining MD5 as default)? I can extract this change into a separate PR

Yes it seems like a good idea to default to md5 and add it as an option for the user to configure.

cc. @kojandy @sohyun-ku

taeyeon-Kim

First, thank you for contributing such a great PR.
An agent is characterized by the fact that it uses the client's resources.
If the scavenger agent is using more of the user's resources (CPU, memory, etc..) than before, I think it's a bigger issue than performance improvement.
Is it possible to measure this as well?
(It doesn't seem to be an issue from the looks of it).

cc. @sohyun-ku @kojandy

kusalk · 2024-10-15T05:55:36Z

I'll see what I can do :)

kusalk added 14 commits October 5, 2024 16:27

Upgrade Shadow plugin

2c31d67

Extract WireMock setup to abstract class

98a7cee

Implement JMH benchmarks

afa435a

Log advice installed

fe923e1

#extractSignature optimisation

f958769

Remove worker thread InvocationRegistry

44c2418

Refactor signature hashing

267d35e

Filter synthetic signatures at scan time

c910f0c

Refactor HashGenerator to support default hashing implementation

54d508c

Implement MurmurHash3

5e7447b

Rework InvocationRegistry to be faster on buffer reset

7a3a60a

Upgrade agent tests to Java 21

ec7bda3

Rework InvocationTracker injection

a662ccc

Do not flood logs if collector goes down

d368a73

kusalk requested review from junoyoon and a team as code owners October 11, 2024 11:27

Simplify Java target level build script

bc24ecf

taeyeon-Kim reviewed Oct 14, 2024

View reviewed changes

taeyeon-Kim reviewed Oct 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of Java agent #134

Improve performance of Java agent #134

kusalk commented Oct 11, 2024

taeyeon-Kim Oct 14, 2024

kusalk Oct 14, 2024

taeyeon-Kim Oct 15, 2024

taeyeon-Kim left a comment

kusalk commented Oct 15, 2024

		public static String from(String signature) {
		return Murmur.from(signature);

Improve performance of Java agent #134

Are you sure you want to change the base?

Improve performance of Java agent #134

Conversation

kusalk commented Oct 11, 2024

Method hashing - MethodRegistry#getHash (ConcurrentHashMap cache disabled)

Invocation registering - InvocationRegistry#register

Overall - InvocationTracker#onInvocation

taeyeon-Kim Oct 14, 2024

Choose a reason for hiding this comment

kusalk Oct 14, 2024

Choose a reason for hiding this comment

taeyeon-Kim Oct 15, 2024

Choose a reason for hiding this comment

taeyeon-Kim left a comment

Choose a reason for hiding this comment

kusalk commented Oct 15, 2024

Method hashing - `MethodRegistry#getHash` (`ConcurrentHashMap` cache disabled)

Invocation registering - `InvocationRegistry#register`

Overall - `InvocationTracker#onInvocation`