Skip to content
This repository has been archived by the owner on Jan 13, 2025. It is now read-only.

Refactor level1 nrm2 operator #466

Merged
merged 2 commits into from
Oct 16, 2023

Conversation

s-Nick
Copy link
Collaborator

@s-Nick s-Nick commented Sep 27, 2023

This PR refactors the implementation of the nrm2 operator.

These changes are based on the work done for asum operator which was merged with PR #458. Now nrm2 implementation shares the same structure of asum adjusting the execution call to its needs.
Update for nrm2 each architecture backend file to manage the different kernel sizes and these numbers are set empirically using available hardware.

Copy link
Collaborator

@OuadiElfarouki OuadiElfarouki left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All looks neat to me thanks for the refacto!

Copy link
Collaborator

@muhammad-tanvir-1211 muhammad-tanvir-1211 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, Thanks.

The nrm2 implementation is reduction based, so now it shares the
same implementation of asum with its adjustments.
Preliminary update of all backend to support the newest changes.
@muhammad-tanvir-1211 muhammad-tanvir-1211 merged commit dd587dd into codeplaysoftware:master Oct 16, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants