Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[TRANSFORMATIONS] SDPAToPagedAttention transformation: support decompression case in the Qwen-7b-Chat pattern #28514

Open
wants to merge 2 commits into
base: releases/2025/0
Choose a base branch
from

Conversation

CuriousPanCake
Copy link
Contributor

Qwen-7b-Chat has a decompression if the model is executed in lower
precision resulting into the model having additional Converts (i.e. FP16
to FP32).

Handle this case of optional Convert in PositionIDsReplacerQwen

Added a unit test for it.

Tickets:

Signed-off-by: Andrii Staikov [email protected]
Signed-off-by: Ivan Tikhonov [email protected]

…ression case in the Qwen-7b-Chat pattern (openvinotoolkit#28493)

Qwen-7b-Chat has a decompression if the model is executed in lower
precision resulting into the model having additional Converts (i.e. FP16
to FP32).

Handle this case of optional Convert in PositionIDsReplacerQwen

Added a unit test for it.

### Tickets:
 - *CVS-157308*

Signed-off-by: Andrii Staikov <[email protected]>
Signed-off-by: Ivan Tikhonov <[email protected]>

---------

Co-authored-by: Ivan Tikhonov <[email protected]>
@github-actions github-actions bot added the category: transformations OpenVINO Runtime library - Transformations label Jan 17, 2025
@itikhono itikhono marked this pull request as ready for review January 17, 2025 11:16
@itikhono itikhono requested a review from a team as a code owner January 17, 2025 11:16
@itikhono itikhono requested review from itikhono and removed request for a team January 17, 2025 11:16
@itikhono itikhono added this to the 2025.0 milestone Jan 17, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
category: transformations OpenVINO Runtime library - Transformations
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants