Possible memory leak in .net agent #1988

kaluznyt · 2023-10-18T10:08:05Z

We're seeing an increasing number of objects related to New Relic in our .net core app.

Description
Took a couple of memory dumps of a .net core app (running on kubernetes) and seeing an increasing number of objects from newrelic.agent namespace like:
MethodCallData
Segment
ParsedSqlStatement
ConnectionInfo
DataStoreSegmentData

Which is causing the application to hit the OOM when it's running for couple of days.

Expected Behavior
Not seeing a constant increase in memory used caused by the new relic objects

Steps to Reproduce
No particular steps required.

Your Environment
[TIP]: # ( Describe your environment, please include the running version of the agent, .NET Framework, .NET Core, or .NET versions, and any relevant configurations)
.NET Core API app running on Kubernetes, .NET Core 6, NewRelic agent version 10.17.0

Those screenshots are from dotMemory, dumps taken from the same container 1 day apart:

workato-integration · 2023-10-18T10:08:10Z

https://new-relic.atlassian.net/browse/NR-171232

nrcventura · 2023-10-18T16:58:00Z

Thank you for reporting this to us. The datatypes that you are reporting in this issue and in the screenshots are datatypes that we expect to see created for every single call to a database/datastore.

MethodCallData is an object that is created for every single call to an instrumented method
Segment is an object that typically represents a method call within a transaction
ParsedSqlStatement is an object that is created for each database/datastore call
ConnectionInfo is an object that is created for each database/datastore call
DatastoreSegmentData is an object that is created for each database/datastore call

In general, this data should only be considered alive until a transaction ends and is transformed into the wire models that are stored in reservoirs until they are ready to be transmitted to New Relic. The agent does have some caching in place for ParsedSqlStatement which can keep those instances alive longer.

We will need more information to better understand what is happening. Information like the following can help us understand what is going on.

How many database calls do you expect to see per transaction?
How many transactions do you expect to see executing concurrently?
Do you have a shareable reproduction of this memory problem?

kaluznyt · 2023-10-19T06:56:08Z

Thanks for your reply.

Ok, basically, the app mostly calls the Redis via the StackExchange.Redis multiplexers (cached/singletons). It calls different instances of Redis (like one hosted in the cloud, one in kubernetes itself).

It calls SQL Server, but that's small percentage of whole db traffic.

In terms of database calls, difficult to tell it really dependes on the request, but usually multiple calls are going out to Redis per Transaction/Request, however, there are many concurrent requests, I would say the traffic is quite high all the time.

As for the reproduction, I don't have anything at the moment, we're seeing this on live environment. Perhaps I'll try it. Or maybe I could run some new relic instrumentation if that can help anyhow ?

kaluznyt · 2023-10-19T07:05:10Z

So this is the breakdown in the sample transaction from New Relic:

kaluznyt · 2023-10-19T07:09:44Z

And this is dump from the same pod, but from today, we see a large increase in the objects counts compared to previous dumps:

kaluznyt · 2023-10-19T07:10:51Z

Is the ParsedSqlStatement used for Redis, or only for the SQL Queries ? From the name I suppose only for SQL Queries, perhaps that's somewhere we should look into ?

kaluznyt · 2023-10-19T07:22:03Z

I just took a look into one of the ParsedSqlStatement objects, and seems it's redis related:

kaluznyt · 2023-10-19T07:48:12Z

Also found that the transaction, thats still hold in memory, it's also showing in New Relic, (assuming that the _transactionGuid on the TransactionMetadata is the tripId. (and this one is from 2 days ago)

nrcventura · 2023-10-19T17:15:58Z

That definitely seems like a memory leak. Since you can find that transaction in the New Relic UI, the transaction probably ended and got transformed, which would allow the garbage collector to reclaim that memory. However, it's possible that it is being referenced on another thread and being kept in memory. This may be a side-effect of how the agent maintains state by leveraging AsyncLocal storage to allow the transaction to flow with all of the async thread hops that a transaction/request may go through. Things like starting a timer, and certain ways of kicking off async background work can cause AsyncLocal state to be captured even though that async work is not part of the request.

To further help you we will either need an application that we can use to reproduce the problem. Or you may need to work with our support team in order to share the memory dump with us and possibly some sample code. By working with our support team you can avoid sharing potentially sensitive information on github. For more information on our support team you can refer to Support Options. If you open a request with the support team please reference this github issue so that we can ensure that the issues are linked together.

kaluznyt · 2023-10-20T10:58:21Z

Thank you for directing us. We'll try to investigate what can be the cause based on your input, if we cannot figure it out, we'll open a support request to get help.

nrcventura · 2023-10-23T17:58:39Z

I'm closing this ticket for now. If a support request is opened for this, we will address the problem through the support process. If more information becomes available, and we can identify or reproduce the problem, we can reopen the ticket later.

kaluznyt added the bug Something isn't working label Oct 18, 2023

github-actions bot added the community To tag external issues and PRs label Oct 18, 2023

nrcventura closed this as not planned Won't fix, can't repro, duplicate, stale Oct 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible memory leak in .net agent #1988

Possible memory leak in .net agent #1988

kaluznyt commented Oct 18, 2023

workato-integration bot commented Oct 18, 2023

nrcventura commented Oct 18, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

nrcventura commented Oct 19, 2023

kaluznyt commented Oct 20, 2023

nrcventura commented Oct 23, 2023

Possible memory leak in .net agent #1988

Possible memory leak in .net agent #1988

Comments

kaluznyt commented Oct 18, 2023

workato-integration bot commented Oct 18, 2023

nrcventura commented Oct 18, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

kaluznyt commented Oct 19, 2023

nrcventura commented Oct 19, 2023

kaluznyt commented Oct 20, 2023

nrcventura commented Oct 23, 2023