[CPU] Add dumping of the memory statistcs #28441

maxnick · 2025-01-14T18:33:22Z

Details:

Add yet another debug capability: dumping the following memory statistics:

Memory statistics for specific memory managers: Type of the manager, number of memory regions, number of unique memory blocks, total memory size, theoretically optimal total memory size, the size of the largest memory region
The size of memory allocated for scratchpads
Weight cache statistics per socket: Total size, the number of memory objects

Standard output and *.csv file dump are supported

ToDo:

Add corresponding documentation

Tickets:

ticket-id

src/plugins/intel_cpu/src/utils/debug_caps_config.h

src/plugins/intel_cpu/src/utils/memory_stats_dump.cpp

src/plugins/intel_cpu/src/weights_cache.cpp

EgorDuplensky · 2025-01-19T17:37:38Z

src/plugins/intel_cpu/src/utils/debug_caps_config.cpp

@@ -66,6 +66,12 @@ void DebugCapsConfig::readProperties() {

    if ((envVarValue = readEnv("OV_CPU_AVERAGE_COUNTERS")))
        averageCountersPath = envVarValue;
+
+    if ((envVarValue = readEnv("OV_CPU_MEMORY_STATISTICS_LEVEL")))
+        memoryStatisticsDumpLevel = std::stoi(envVarValue);


Shouldn't we align the handling of those environment variables with the ones we already have?
I mean OV_CPU_MEMORY_STATISTICS_PATH could have options like :

cout

*.csv

*.etc

And automatically enable the level 1 if specified.

And OV_CPU_MEMORY_STATISTICS_LEVEL could be used to increase the level

This way one could dump the statistic to csv file without a need to specify two environment variables

maxnick · 2025-01-24T10:40:35Z

@EgorDuplensky , do you have any further comments?

EgorDuplensky · 2025-01-29T12:57:53Z

src/plugins/intel_cpu/src/memory_control.cpp

+
+private:
+    std::shared_ptr<MemoryBlockWithRelease> m_pBlock;
+    size_t m_max_requested_size = 0;


Is having a size the only reason we are introducing debug caps wrapper here?
If yes, is it possible to implement a size() method for all the non-debug-caps implementations instead? This can be helpful for other troubleshooting use cases, i.e. debug logs.

Yes, that is the reason. But not the only one. In the release build the memory blocks are shared across many tensors, therefore it's impossible to track size request from each tensor, we only maintain the minimal size that can accommodate the biggest tensor. Thus it's impossible to retrieve the information about each tensor size at this level.
The fact that we don't have size() method in the dynamic memory block interface is a deliberate design choice, which allows to avoid additional semantic constraints. The thing is that the memory block may or may not allocate memory, for example a partitioned memory block doesn't allocate anything. But when we work through the interface, we don't even know which specific type of the memory block has the object, therefore it's difficult to correctly interpret the result. In the partitioned memory block example, returning the partition size may give the impression that a memory block of this size is allocated, while it isn't, it's just a view. Also receiving a max size of the "memory block with reuse" may be unexpected when we call this method to an abstract object. Therefore:

Introducing this method to the memory block interface won't allow us to avoid the wrappers, as being shared, such memory blocks don't store information about each resize request.

Without the knowledge of the dynamic type of the object, it's not that clear how to interpret the result of an abstract size() request. And for types that may unambiguously define the meaning of the size() request result such method is introduced (e.g. MemoryBlockWithReuse)

Beyond this low level memory management subsystem, the memory size may be requested from the memory descriptor of the corresponding memory object (for logging and troubleshooting purposes).

EgorDuplensky · 2025-01-29T13:02:10Z

src/plugins/intel_cpu/src/memory_control.cpp

+        }
+
+    private:
+        std::vector<std::shared_ptr<MemoryBlockWithRelease>> m_unique_blocks;


It should be possible to count unique_blocks even without this data structure, isn't?
I mean it will be slower for sure, but do we care how fast we collect those statistics?

This is also used to calculate the actually allocated memory size. The wrappers stores the memory size requested from each tensor, but not the really allocated memory.

EgorDuplensky · 2025-01-29T13:07:02Z

src/plugins/intel_cpu/src/memory_control.cpp

@@ -371,10 +618,20 @@ void MemoryControl::releaseMemory() {
    m_allocated = false;
 }

-edgeClusters MemoryControl::findEdgeClusters(const std::vector<EdgePtr>& graphEdges) {
+#ifdef CPU_DEBUG_CAPS
+MemoryStatistics MemoryControl::dumpStatistics() const {


What about using free friend functions instead, which will be able to access the private fields, so we can move every debug caps related logic into a separate file to avoid cluttering the production logic.

Yes, it can make the code cleaner but my idea was that this statics calculation is strictly bound to the specific memory manager type (in terms of data members and the underlining algorithm) so once the main memory management implementation is changed, this memory statistics collection subroutine most likely needs to be changed too.
Moreover, I even didn't want to wrap them into the CPU_DEBUG_CAPS macro, but since we want to keep the main version as lightweight and fast as possible, some of this implementations become ill formed as they access only debug versions of data members.
Thus, if you still sure that it's better to move them into a separate file even though it will be more difficult to keep them relevant, I'll do it.
What do you think?

maxnick added 18 commits December 9, 2024 19:23

Add memory stats dump interface

a700aaf

Dump statistics initial working state

13ce11e

Store initial boxes in static manager as they are

3ab9711

Fix optimal memory calculation algo

d14103f

Add optimal memory size calculation for dynamic memory regions

d2f5a3f

Merge remote-tracking branch 'origin/master' into memory_stats

b8a05ce

Merge remote-tracking branch 'origin/master' into memory_stats

3b7185d

Print unique boxes

b63b482

Merge remote-tracking branch 'origin/master' into memory_stats

4287d47

Dump constants statistics

54f62a4

Remove redundant includes

0254602

Merge remote-tracking branch 'origin/master' into memory_stats

258baa4

Resolve merge conflict

d761c35

Merge remote-tracking branch 'origin/master' into memory_stats

186d257

Fix typos

4241086

Partially put under the DEBUG_CAPS macro

94bd8ff

Move memory statistics dumping into a specific routine

142f266

Add dump to csv

1080454

maxnick requested review from a team as code owners January 14, 2025 18:33

github-actions bot added the category: CPU OpenVINO CPU plugin label Jan 14, 2025

maxnick added this to the 2025.1 milestone Jan 14, 2025

maxnick added 3 commits January 14, 2025 19:40

Split Debug Caps and ordinary compilation

48505af

Merge remote-tracking branch 'origin/master' into memory_stats

258f1a8

Fix code format

c11eac5

maxnick requested a review from a team as a code owner January 15, 2025 10:49

maxnick requested review from kblaszczak-intel and removed request for a team January 15, 2025 10:49

github-actions bot added the category: docs OpenVINO documentation label Jan 15, 2025

maxnick removed the request for review from kblaszczak-intel January 15, 2025 10:49

maxnick removed the category: docs OpenVINO documentation label Jan 15, 2025

Enhance docs with the information about mem stats usage

0382ce8

maxnick commented Jan 15, 2025

View reviewed changes

src/plugins/intel_cpu/src/utils/debug_caps_config.h Outdated Show resolved Hide resolved

src/plugins/intel_cpu/src/utils/memory_stats_dump.cpp Outdated Show resolved Hide resolved

src/plugins/intel_cpu/src/weights_cache.cpp Show resolved Hide resolved

github-actions bot added the category: docs OpenVINO documentation label Jan 15, 2025

maxnick added 7 commits January 15, 2025 18:42

Clean up code

158285f

Fix code style

f75bd16

Support multi compiled model scenarios

2047ce1

Use size_t accumulators in the statistics routines

9c4c38f

Merge remote-tracking branch 'origin/master' into memory_stats

3d7d9ef

Fix merge

1e7af3d

Fix clang tidy

cb0d647

EgorDuplensky reviewed Jan 19, 2025

View reviewed changes

maxnick requested a review from EgorDuplensky January 22, 2025 10:22

maxnick added 3 commits January 22, 2025 11:37

Remove level add cout as output type

2b76687

Code style fix

6d93b32

Merge remote-tracking branch 'origin/master' into memory_stats

5047c48

Merge remote-tracking branch 'origin/master' into memory_stats

7f71ad1

EgorDuplensky reviewed Jan 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CPU] Add dumping of the memory statistcs #28441

[CPU] Add dumping of the memory statistcs #28441

maxnick commented Jan 14, 2025 •

edited

Loading

EgorDuplensky Jan 19, 2025 •

edited

Loading

maxnick Jan 22, 2025

maxnick commented Jan 24, 2025

EgorDuplensky Jan 29, 2025

maxnick Jan 29, 2025

EgorDuplensky Jan 29, 2025

maxnick Jan 29, 2025 •

edited

Loading

EgorDuplensky Jan 29, 2025

maxnick Jan 29, 2025 •

edited

Loading

[CPU] Add dumping of the memory statistcs #28441

Are you sure you want to change the base?

[CPU] Add dumping of the memory statistcs #28441

Conversation

maxnick commented Jan 14, 2025 • edited Loading

Details:

ToDo:

Tickets:

EgorDuplensky Jan 19, 2025 • edited Loading

Choose a reason for hiding this comment

maxnick Jan 22, 2025

Choose a reason for hiding this comment

maxnick commented Jan 24, 2025

EgorDuplensky Jan 29, 2025

Choose a reason for hiding this comment

maxnick Jan 29, 2025

Choose a reason for hiding this comment

EgorDuplensky Jan 29, 2025

Choose a reason for hiding this comment

maxnick Jan 29, 2025 • edited Loading

Choose a reason for hiding this comment

EgorDuplensky Jan 29, 2025

Choose a reason for hiding this comment

maxnick Jan 29, 2025 • edited Loading

Choose a reason for hiding this comment

maxnick commented Jan 14, 2025 •

edited

Loading

EgorDuplensky Jan 19, 2025 •

edited

Loading

maxnick Jan 29, 2025 •

edited

Loading

maxnick Jan 29, 2025 •

edited

Loading