Releases · bytedance/lightseq

It's been a long time since our last release (v2.2.0). For the past one year, we have focused on int8 quantization.

In this release, LightSeq supports int8 quantized training and inference. Compared with PyTorch QAT, LightSeq int8 training has a speedup of 3x without any performance loss. Compared with previous LightSeq fp16 inference, int8 engine has a speedup up to 1.7x.

LightSeq int8 engine supports multiple models, such as Transformer, BERT, GPT, etc. For int8 training, the users only need to apply quantization mode to the model using model.apply(enable_quant). For int8 inference, the users only need to use QuantTransformer instead of fp16 Transformer.

Other releases include supporting models like MoE, fix bugs, performance improvement, etc.

It's been a long time since our last release (v1.2.0). For the past six months, we have focused on training efficiency.

In this release, LightSeq supports fast training for models in the Transformer family!

We provide highly optimized custom operators for PyTorch and TensorFlow, which cover the entire training process for Transformer-based models. Users of LightSeq can use these operators to build their own models with efficient computation.

In addition, we integrate our custom operators into popular training libraries like Fairseq, Hugging Face, NeurST, which enables a 1.5X-3X end-to-end speedup campred to the native version.

With only a small amount of code, you can enjoy the excellent performance provided by LightSeq. Try it now!

Training

support lightseq-train to accelerate fairseq training, including optimized transformer model, adam, and label smoothed loss
huggingface bert training example
neurst transformer training example for Tensorflow users

Inference

support GPT python wrapper
inference APIs are moved to lightseq.inference

This release has API change for inference, all inference API has moved to lightseq.inference. For example, use import lightseq.inference and model = lightseq.inference.Transformer("$PB_PATH", max_batch_size)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

Inference

Fixes

Training

Inference

Fixes

Training

Inference

Fixes

Training

Inference

Releases: bytedance/lightseq

Support HIP

Release 3.0.1

What's Changed

Contributors

Release 3.0.0

Release 2.2.0

Inference

Fixes

Release 2.1.3

Training

Inference

Fixes

Release 2.1.0

Training

Inference

Fixes

Release 2.0.2

Release 2.0.1

Release 2.0.0

Training

Inference

Release 1.2.0