-
Notifications
You must be signed in to change notification settings - Fork 9
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Disable MLP Fused Ops if Not SwiGLU, Depracted Fast Quantized Peft Pl…
…ugin, Update Benchmarks (#106) * disable MLP fused op for non-silu, and removed all qpeft plugin Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fix the filter drops rule Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fix all models Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fix Signed-off-by: Yu Chin Fabian Lim <[email protected]> * accurately set trl in bnb qpeft fix and file rename Signed-off-by: Yu Chin Fabian Lim <[email protected]> * update bench Signed-off-by: Yu Chin Fabian Lim <[email protected]> --------- Signed-off-by: Yu Chin Fabian Lim <[email protected]>
- Loading branch information
Showing
13 changed files
with
391 additions
and
370 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
192 changes: 0 additions & 192 deletions
192
...s/fused-ops-and-kernels/src/fms_acceleration_foak/framework_plugin_fast_quantized_peft.py
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.