Skip to content

Releases: triton-inference-server/model_analyzer

Release 1.45.0 corresponding to NGC container 24.10

29 Oct 15:38
6f7ea79
Compare
Choose a tag to compare
Update README for r24.10 (#939)

* Update README for Release 24.10

* Update README.md

Co-authored-by: Misha Chornyi <[email protected]>

* Revert "Update README.md"

This reverts commit 682c999fbd35b514cb7ae5f52b1ecb842651365e.

---------

Co-authored-by: Misha Chornyi <[email protected]>
Co-authored-by: Misha Chornyi <[email protected]>

Release 1.44.0 corresponding to NGC container 24.09

27 Sep 16:51
046f9e6
Compare
Choose a tag to compare
Update README for r24.09 (#934)

* Update README for r24.09

* Update branch name in README

Release 1.43.0 corresponding to NGC container 24.08

27 Aug 18:03
4a3412e
Compare
Choose a tag to compare
v1.43.0

Update README.md for 24.08 Release. (#928)

Release 1.42.0 corresponding to NGC container 24.07

24 Jul 19:33
2b6668c
Compare
Choose a tag to compare

New Features and Improvements

  • Optuna search mode
    • Allows you to search any parameter that can be specified in the model configuration, using a hyperparameter optimization framework

Release 1.41.0 corresponding to NGC container 24.06

28 Jun 00:32
6213afe
Compare
Choose a tag to compare
v1.41.0

Update README.md for 24.06 (#905)

Release 1.40.0 corresponding to NGC container 24.05

25 May 02:04
ce28ec9
Compare
Choose a tag to compare
Update README and versions for 1.40.0 / 24.05 (#883)

* Update README and versions for 1.40.0 / 24.05

Release 1.39.0 corresponding to NGC container 24.04

30 Apr 17:46
01f9e1a
Compare
Choose a tag to compare

New Features and Improvements

  • Model Analyzer now supports profiling Large Language Models (LLMs) using GenAI-Perf

Release 1.38.0 corresponding to NGC container 24.03

27 Mar 17:01
39dd647
Compare
Choose a tag to compare
v1.38.0

Update README.md for 1.38.0 / 24.03 (#848)

Release 1.37.0 corresponding to NGC container 24.02

01 Mar 01:12
5366d6f
Compare
Choose a tag to compare
v1.37.0

Update README.md for 24.02 (#830)

Release 1.36.0 corresponding to NGC container 24.01

30 Jan 01:17
4e45d78
Compare
Choose a tag to compare

New Features and Improvements

  • Model Analyzer now correctly loads and optimizes ensemble models
  • Model Analyzer now correctly works with SSL via gRPC
  • Model Analyzer now handles the case of optimizing a model on a remote Triton server without requiring a local GPU