Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add ExpertParallel Mixture-of-Experts Plugin (#99)
* initial commit Signed-off-by: Yu Chin Fabian Lim <[email protected]> * include prepare_scattermoe Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fixes and add scenarios-moe. Allow gradient_accum=null mode Signed-off-by: Yu Chin Fabian Lim <[email protected]> * missed out on CONTENTS.yaml Signed-off-by: Yu Chin Fabian Lim <[email protected]> * update readme, code cleanup, add comments and initial bench Signed-off-by: Yu Chin Fabian Lim <[email protected]> * more cleanup and update pf bench Signed-off-by: Yu Chin Fabian Lim <[email protected]> * add more comments and minor refactoring Signed-off-by: Yu Chin Fabian Lim <[email protected]> * finish up comments Signed-off-by: Yu Chin Fabian Lim <[email protected]> * add padding free to granite moe Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fmt and lint. Signed-off-by: Yu Chin Fabian Lim <[email protected]> * install workflow + more fmt + fix test Signed-off-by: Yu Chin Fabian Lim <[email protected]> * go back to dtensors for sharded checkpoints Signed-off-by: Yu Chin Fabian Lim <[email protected]> * add scattermoe checkpoint restorer utility Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fmt + lint Signed-off-by: Yu Chin Fabian Lim <[email protected]> * more cleanup Signed-off-by: Yu Chin Fabian Lim <[email protected]> * improved documention on state dict inferernce Signed-off-by: Yu Chin Fabian Lim <[email protected]> * add more test on inferring checkpoint metadat Signed-off-by: Yu Chin Fabian Lim <[email protected]> * update configs for mixtral Signed-off-by: Yu Chin Fabian Lim <[email protected]> * update granite configs Signed-off-by: Yu Chin Fabian Lim <[email protected]> * fix readme and update GraniteMoE to FOAK Signed-off-by: Yu Chin Fabian Lim <[email protected]> * commit benches Signed-off-by: Yu Chin Fabian Lim <[email protected]> --------- Signed-off-by: Yu Chin Fabian Lim <[email protected]>
- Loading branch information