fix(deps): update dependency bitsandbytes to ^0.44.0 #169

renovate · 2024-10-30T18:51:02Z

This PR contains the following updates:

Package	Change	Age	Adoption	Passing	Confidence
bitsandbytes	`^0.43.1` -> `^0.44.0`

Release Notes

TimDettmers/bitsandbytes (bitsandbytes)

New optimizer: AdEMAMix

The AdEMAMix optimizer is a modification to AdamW which proposes tracking two EMAs to better leverage past gradients. This allows for faster convergence with less training data and improved resistance to forgetting.

We've implemented 8bit and paged variations: AdEMAMix, AdEMAMix8bit, PagedAdEMAMix, and PagedAdEMAMix8bit. These can be used with a similar API to existing optimizers.

import bitsandbytes as bnb

optimizer = bnb.optim.PagedAdEMAMix8bit(
    model.parameters(),
    lr=1e-4,
    betas=(0.9, 0.999, 0.9999),
    alpha=5.0,
    eps=1e-8,
    weight_decay=1e-2,
    alpha=5.0,
)

8-bit Optimizers Update

The block size for all 8-bit optimizers has been reduced from 2048 to 256 in this release. This is a change from the original implementation proposed in the paper which improves accuracy.

CUDA Graphs support

A fix to enable CUDA Graphs capture of kernel functions was made in #1330. This allows for performance improvements with inference frameworks like vLLM. Thanks @jeejeelee!

Quantization for Embeddings

The trend of LLMs to use larger vocabularies continues. The embeddings can take up a significant portion of a quantized model's footprint. We now have an implementation of Embedding4bit and Embedding8bit thanks to @galqiwi!

Example usage:

import torch
import torch.nn as nn

from bitsandbytes.nn import Embedding4bit

fp16_module = nn.Embedding(128, 64)
quantized_module = Embedding4bit(128, 64)

quantized_module.load_state_dict(fp16_module.state_dict())

quantized_module = quantized_module.to(0)

Continuous Builds

We are now building binary wheels for each change on main. These builds can be used to preview upcoming changes.

🚤 Continuous Build

What's Changed

Embedding4bit and Embedding8bit implementation by @galqiwi in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1292
Bugfix: Load correct nocublaslt library variant when BNB_CUDA_VERSION override is set by @matthewdouglas in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1318
Enable certain CUDA kernels to accept specified cuda stream by @jeejeelee in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1330
Initial support for ppc64le by @mgiessing in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1316
Cuda source cleanup , refactor and fixes by @abhilash1910 in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1328
Update for VS2022 17.11 compatibility with CUDA < 12.4 by @matthewdouglas in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1341
Bump the minor-patch group with 3 updates by @dependabot in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1362
Update matplotlib requirement from ~=3.9.1 to ~=3.9.2 in the major group by @dependabot in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1361
docs: add internal reference to multi-backend guide by @Titus-von-Koeller in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1352
Add move_to_device kwarg to the optimizer's load_state_dict by @koute in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1344
Add AdEMAMix optimizer by @matthewdouglas in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1360
Change 8bit optimizer blocksize 2048->256; additional bf16 support by @matthewdouglas in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1365

New Contributors

@jeejeelee made their first contribution in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1330
@mgiessing made their first contribution in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1316
@abhilash1910 made their first contribution in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1328
@koute made their first contribution in https://github.com/bitsandbytes-foundation/bitsandbytes/pull/1344

Full Changelog: bitsandbytes-foundation/bitsandbytes@0.43.3...v0.44.0

Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.

If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

fix(deps): update dependency bitsandbytes to ^0.44.0

c5ef113

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(deps): update dependency bitsandbytes to ^0.44.0 #169

fix(deps): update dependency bitsandbytes to ^0.44.0 #169

renovate bot commented Oct 30, 2024

fix(deps): update dependency bitsandbytes to ^0.44.0 #169

Are you sure you want to change the base?

fix(deps): update dependency bitsandbytes to ^0.44.0 #169

Conversation

renovate bot commented Oct 30, 2024

Release Notes

v0.44.1

What's Changed

v0.44.0: : New AdEMAMix optimizer, Embeddings quantization, and more!

New optimizer: AdEMAMix

8-bit Optimizers Update

CUDA Graphs support

Quantization for Embeddings

Continuous Builds

What's Changed

New Contributors

Configuration

`v0.44.1`

`v0.44.0`: : New AdEMAMix optimizer, Embeddings quantization, and more!