update ipex api #1650

intellinjun · 2024-07-03T01:31:49Z

Type of Change

feature or bug fix or documentation or others
API changed or not
yes

Description

update ipex api
detail description

Signed-off-by: intellinjun <[email protected]>

github-actions · 2024-07-03T01:32:13Z

⛈️ Required checks status: Has failure 🔴

Warning
If you do not have the access to re-run the CI-Summary bot, please contact VincyZhang for help. If you push a new commit, all of the workflow will be re-triggered.

Groups summary

🟢 Format Scan Tests workflow

Check ID	Status
format-scan (pylint)	success	✅
format-scan (bandit)	success	✅
format-scan (cloc)	success	✅
format-scan (cpplint)	success	✅

These checks are required after the changes to intel_extension_for_transformers/neural_chat/models/model_utils.py.

🔴 NeuralChat Unit Test

Check ID	Status	Error details
neuralchat-unit-test-baseline	cancelled		🚫
neuralchat-unit-test-PR-test	failure	download	❌
Generate-NeuralChat-Report	skipped		❓

These checks are required after the changes to intel_extension_for_transformers/neural_chat/models/model_utils.py.

🟡 Chat Bot Test workflow

Check ID	Status	Error details
call-inference-llama-2-7b-chat-hf / inference test	no_status		❓
call-inference-mpt-7b-chat / inference test	no_status		❓

These checks are required after the changes to intel_extension_for_transformers/neural_chat/models/model_utils.py.

Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact VincyZhang or XuehaoSun for help.

a32543254 · 2024-07-03T02:15:36Z

@changwangss could you help review this pr?

changwangss · 2024-07-03T03:54:23Z

intel_extension_for_transformers/neural_chat/models/model_utils.py

                                model.eval(),
                                dtype=torch_dtype,
                                inplace=True,
-                                level="O1",
-                                auto_kernel_selection=True,
                            )
                        except AssertionError:
                            model = intel_ipex.optimize(


Could you also change it (Line 849)? as I know, ipex.llm.optimize is recommanded to run bf16 LLMs inference by IPEX.

a32543254

LGTM

Signed-off-by: intellinjun <[email protected]>

a32543254 · 2024-07-05T03:16:43Z

convert to draft since the intel_ipex.llm.optimize will make the model cannot run.

update ipex api

fdd4d34

Signed-off-by: intellinjun <[email protected]>

intellinjun requested a review from lvliang-intel as a code owner July 3, 2024 01:31

intellinjun requested review from a32543254 and kevinintel July 3, 2024 01:32

a32543254 requested a review from changwangss July 3, 2024 02:15

changwangss approved these changes Jul 3, 2024

View reviewed changes

changwangss reviewed Jul 3, 2024

View reviewed changes

a32543254 approved these changes Jul 3, 2024

View reviewed changes

update ipex api

a1c847e

Signed-off-by: intellinjun <[email protected]>

a32543254 marked this pull request as draft July 5, 2024 03:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update ipex api #1650

update ipex api #1650

intellinjun commented Jul 3, 2024

github-actions bot commented Jul 3, 2024 •

edited

Loading

a32543254 commented Jul 3, 2024

changwangss Jul 3, 2024 •

edited

Loading

a32543254 left a comment

a32543254 commented Jul 5, 2024

update ipex api #1650

Are you sure you want to change the base?

update ipex api #1650

Conversation

intellinjun commented Jul 3, 2024

Type of Change

Description

github-actions bot commented Jul 3, 2024 • edited Loading

⛈️ Required checks status: Has failure 🔴

Groups summary

a32543254 commented Jul 3, 2024

changwangss Jul 3, 2024 • edited Loading

Choose a reason for hiding this comment

a32543254 left a comment

Choose a reason for hiding this comment

a32543254 commented Jul 5, 2024

github-actions bot commented Jul 3, 2024 •

edited

Loading

changwangss Jul 3, 2024 •

edited

Loading