Feature idea: perf analyzer behavior control #808

Talavig · 2024-01-07T16:09:27Z

We would like to suggest an idea we thought about and we do not think is currently available in model analyzer or perf analyzers:
When using model analyzer, we want to replicate the behavior of the prod environment when testing. For example, we want to send a lot of requests at once (a local spike of usage), or a gradual, more controlled increased in requests in specific timestamps, or any other pattern of usage.
We would like to propose the following solution: add a field in the config file with that can take on of two kinds of values: either keywords that represent different usage behaviors like spikes or gradual increase, or a path to a file. This file can contain a json generated by the customer with specific "directions" to the wanted behavior. For example, for 3 seconds send x request and then send a spike of requests.
Is such a feature possible to implement? It can be very valuable to us.

tgerdesnv · 2024-01-11T14:26:17Z

I do believe that Perf Analyzer supports something similar to what you are asking for via custom interval mode:
https://github.com/triton-inference-server/client/blob/main/src/c%2B%2B/perf_analyzer/docs/inference_load_modes.md#custom-interval-mode

Talavig · 2024-01-15T14:44:32Z

We tried using this option through model analyzer, and got the following error:

Cannot use --concurrency-range or --request-rate-range with --request-intervals.

We added it to the config.yaml in the following way:

perf_analyzer_flags:
  {request-intervals: ./time_intervals.txt}

Do you have any idea why this may happen?

tgerdesnv · 2024-01-16T16:06:23Z

That is likely a bug :(. MA generally specifies the concurrency with every request and probably needs to be updated to not specify it when request-intervals are specified. I don't think it will be hard to fix. I'll try to reproduce and fix today.

tgerdesnv · 2024-01-16T21:49:39Z

@Talavig Are you running into this from a brute search? Or quick search?

Talavig · 2024-01-17T08:50:37Z

We have originally encountered it using brute search, but we have tried using quick search as well and encountered the same error.

eladamittai · 2024-03-20T11:20:47Z

Hello, I'm currently experiencing the same issue with model analyzer 24.02. Is there an update on the progress or a certain thing I need to do in the yaml to enable it?

YaliEkstein · 2024-03-25T08:26:22Z

That is likely a bug :(. MA generally specifies the concurrency with every request and probably needs to be updated to not specify it when request-intervals are specified. I don't think it will be hard to fix. I'll try to reproduce and fix today.

Hey, I as well try to use it and the bug still exists. Is there any news on an update or a solve to this issue?

tgerdesnv · 2024-03-25T18:29:46Z

I am going to try to find time to get this fixed this week

eladamittai · 2024-03-25T18:32:37Z

That will be awesome! Thanks for the update! Can you tell whether it'll come out as part of the 24.03 image?

tgerdesnv · 2024-03-25T19:42:30Z

It will definitely not be 24.03. That being said, once I get a fix in you can always work with the main branch if you want it asap:
https://github.com/triton-inference-server/model_analyzer/blob/main/docs/install.md#alternative-installation-methods

eladamittai · 2024-03-25T19:52:28Z

Great! Thank you so much!

tgerdesnv · 2024-03-30T21:33:17Z

This work is still in-progress

eladamittai · 2024-05-07T06:28:26Z

Hey, is there an update?

eladamittai · 2024-07-01T15:32:03Z

@tgerdesnv hey, is the feature done?

tgerdesnv linked a pull request Jan 17, 2024 that will close this issue

Add support for custom intervals #814

Open

dyastremsky added the enhancement New feature or request label Mar 19, 2024

tgerdesnv mentioned this issue Mar 27, 2024

use of time intervals in model analyzer #850

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature idea: perf analyzer behavior control #808

Feature idea: perf analyzer behavior control #808

Talavig commented Jan 7, 2024

tgerdesnv commented Jan 11, 2024 •

edited

Loading

Talavig commented Jan 15, 2024

tgerdesnv commented Jan 16, 2024

tgerdesnv commented Jan 16, 2024

Talavig commented Jan 17, 2024

eladamittai commented Mar 20, 2024

YaliEkstein commented Mar 25, 2024

tgerdesnv commented Mar 25, 2024

eladamittai commented Mar 25, 2024

tgerdesnv commented Mar 25, 2024

eladamittai commented Mar 25, 2024

tgerdesnv commented Mar 30, 2024

eladamittai commented May 7, 2024

eladamittai commented Jul 1, 2024

Feature idea: perf analyzer behavior control #808

Feature idea: perf analyzer behavior control #808

Comments

Talavig commented Jan 7, 2024

tgerdesnv commented Jan 11, 2024 • edited Loading

Talavig commented Jan 15, 2024

tgerdesnv commented Jan 16, 2024

tgerdesnv commented Jan 16, 2024

Talavig commented Jan 17, 2024

eladamittai commented Mar 20, 2024

YaliEkstein commented Mar 25, 2024

tgerdesnv commented Mar 25, 2024

eladamittai commented Mar 25, 2024

tgerdesnv commented Mar 25, 2024

eladamittai commented Mar 25, 2024

tgerdesnv commented Mar 30, 2024

eladamittai commented May 7, 2024

eladamittai commented Jul 1, 2024

tgerdesnv commented Jan 11, 2024 •

edited

Loading