Releases: chroma-core/chroma
Latest
Version: 0.5.19.dev62
Git ref: refs/heads/main
Build Date: 2024-11-16T01:45
PIP Package: chroma-0.5.19.dev62.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.19.dev62
DockerHub Image: chromadb/chroma:0.5.19.dev62
0.5.18
Version: 0.5.18
Git ref: refs/tags/0.5.18
Build Date: 2024-11-05T19:15
PIP Package: chroma-0.5.18.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.18
DockerHub Image: chromadb/chroma:0.5.18
What's Changed
- [ENH]: refine some default arguments for foyer by @MrCroxx in #3037
- [BUG] Fix NAC getting stuck by @sanketkedia in #2991
- [ENH] Remove collection_id from get_collection by @drewkim in #3040
- [TST] Move test utilities to individual crates by @Sicheng-Pan in #3041
- [ENH] Output block size when writing to storage. by @rescrv in #3042
- [ENH] Add fastapi middleware for http.disconnect by @HammadB in #3038
- [CHORE] More details logged to understand compaction. by @rescrv in #3050
- [CHORE] Bump github actions/checkout to v4. by @rescrv in #3053
- [CLN] Remove quota_rules from QuotaProvider by @drewkim in #3054
- [CHORE] Log block id on cold get. by @rescrv in #3055
- [DOCS] Update CIP Chroma Improvement Proposals reference by @emmanuel-ferdman in #3056
- [BUG] Convert OpenCLIP embeddings to numpy arrays by @itaismith in #3051
- [ENH]: replace
get_block_ids_*
withget_block_ids_range()
in SparseIndex by @codetheweb in #2921 - [ENH]: replace
get_*
methods on Arrow blocks withget_range()
by @codetheweb in #2934 - [ENH]: replace
get_*
methods on memory blockfile impl withget_range()
by @codetheweb in #2935 - [ENH]: replace
.get_*
methods on blockfile API with.get_range()
by @codetheweb in #2936 - [ENH]: make
get_range()
a lazy Stream by @codetheweb in #3009 - [ENH] Comment about integer division. by @rescrv in #3059
- [CLN] Remove QuotaProvider by @drewkim in #3061
- [RELEASE] 0.5.18 by @itaismith in #3063
New Contributors
- @emmanuel-ferdman made their first contribution in #3056
Full Changelog: 0.5.17...0.5.18
0.5.17
Version: 0.5.17
Git ref: refs/tags/0.5.17
Build Date: 2024-10-31T21:00
PIP Package: chroma-0.5.17.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.17
DockerHub Image: chromadb/chroma:0.5.17
What's Changed
- [ENH] In-process filtering of foyer traces under 1ms. by @rescrv in #3014
- [BUG] Buffer unordered to limit NAC by @HammadB in #3019
- [CLN] Update chat-with-docs example by @itaismith in #3021
- Docs update by @itaismith in #3023
- [CLN] Docs Tweaks by @itaismith in #3024
- [CLN] Clean up log service migration template by @HammadB in #3016
- [ENH] Azure Deployment Guide by @itaismith in #3012
- [CLN] make ValueError and TypeError present as InvalidArgumentError by @HammadB in #3017
- [ENH] Latency histograms for get/insert/remove/clear of cache. by @rescrv in #3018
- [CLN] Separate validation and transformation logic by @atroyn in #2899
- [ENH] Rendezvous hashing support replication by @HammadB in #2969
- [ENH] Sometimes simulation disagrees with math by @HammadB in #3030
- [CLN] Fix broken links in LlamaIndex docs by @itaismith in #3034
- [BUG] Remove unnecessary call to
get_metadata
for query by @Sicheng-Pan in #3035 - [RELEASE] 0.5.17 by @itaismith in #3036
Full Changelog: 0.5.16...0.5.17
0.5.16
Version: 0.5.16
Git ref: refs/tags/0.5.16
Build Date: 2024-10-29T00:33
PIP Package: chroma-0.5.16.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.16
DockerHub Image: chromadb/chroma:0.5.16
What's Changed
- [ENH] Add metadata to block by @HammadB in #2915
- [ENH] Add blockfile root by @HammadB in #2916
- [ENH] Sparse Index Reader/Writer Split by @HammadB in #2918
- [CLN] Refactor sysdb and log service by @sanketkedia in #2893
- [ENH] Add count index by @HammadB in #2938
- [BUG] Fix migration out of order by @sanketkedia in #2965
- [ENH] Production guides update by @itaismith in #2966
- [ENH] Bump foyer to 0.12 by @MrCroxx in #2955
- Update Chart.yaml by @HammadB in #2970
- [BUG] fix blockfile migration by @HammadB in #2971
- [CHORE] Remove unused import by @eculver in #2973
- [ENH] HTTP API routes overhaul by @drewkim in #2954
- [ENH]: disable Posthog profiles by @codetheweb in #2974
- [TST] Consolidate shared package version helpers by @drewkim in #2975
- [ENH] Add tenant/database to Collection operations by @drewkim in #2982
- [ENH] Stand up @rate_limit in front of add and query by @drewkim in #2980
- [ENH] Stand up @rate_limit in front of all operations by @drewkim in #2986
- [CHORE] Update host path for disk cache by @eculver in #2972
- [BUG] Ensure tenant/database are threaded through everywhere by @drewkim in #2989
- [BUG]: fix sparse index writer panic by @codetheweb in #2976
- [TST]: update test_persist to create 1+ collections by @codetheweb in #2933
- [CHORE] Add QuotaError as a ChromaError by @drewkim in #2990
- [CHORE] cleanup spammy log line. by @rescrv in #2994
- [CHORE] enable tracing for our crates by @rescrv in #2993
- [ENH] Enable tracing::instrument for foyer cache calls. by @rescrv in #2996
- [ENH] Configure tracing via RUST_LOG. by @rescrv in #2999
- [ENH] Fix error propagation in sysdb by @sanketkedia in #2896
- [CHORE] Cleanup clippy warnings for Rust 1.82 by @rescrv in #3000
- [CLN] Update QuotaEnforcer to be naive by @drewkim in #2997
- [CHORE] Set chroma-cache to error-level tracing. by @rescrv in #3005
- [CHORE] Update docker to avoid critical authz regression by @rescrv in #3002
- [CHORE] upgrade rustls and h2 to avoid bugs by @rescrv in #3001
- [BUG] Silence NotFound HSNW errors. by @rescrv in #3004
- [CHORE] having chroma=tracing enabled tracing for chroma-cache too. by @rescrv in #3006
- [PERF] Flush blocks in parallel by @HammadB in #3008
- [ENH] Add typed Uuids to separate IndexUuid from CollectionUuid. by @rescrv in #3007
- [CHORE] Upgrade tracing libraries to latest. by @rescrv in #3013
- [RELEASE] 0.5.16 by @itaismith in #3015
New Contributors
Full Changelog: 0.5.15...0.5.16
0.5.15
Version: 0.5.15
Git ref: refs/tags/0.5.15
Build Date: 2024-10-16T08:58
PIP Package: chroma-0.5.15.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.15
DockerHub Image: chromadb/chroma:0.5.15
What's Changed
- [CHORE] Bump rust version and CI/CD to 1.81.0 by @rescrv in #2958
- [RELEASE] 0.5.15 by @itaismith in #2959
Full Changelog: 0.5.14...0.5.15
0.5.13
Version: 0.5.13
Git ref: refs/tags/0.5.13
Build Date: 2024-10-10T23:05
PIP Package: chroma-0.5.13.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.13
DockerHub Image: chromadb/chroma:0.5.13
What's Changed
- [ENH] Make block and sparse index caches use persistent type. by @rescrv in #2904
- [ENH] Use binary search for gt/gte/lt/lte by @Sicheng-Pan in #2894
- [ENH] Deployment guide updates by @itaismith in #2906
- [BUG]: Lint job - python setup failure by @tazarov in #2924
- [BUG]: Log purge correctness by @tazarov in #2923
- [CLN] Make delete return None by @itaismith in #2880
- [BUG] Remove callouts to discord production support in docs by @itaismith in #2929
- [RELEASE] 0.5.13 by @itaismith in #2930
Full Changelog: 0.5.12...0.5.13
0.5.12
Version: 0.5.12
Git ref: refs/tags/0.5.12
Build Date: 2024-10-08T21:42
PIP Package: chroma-0.5.12.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.12
DockerHub Image: chromadb/chroma:0.5.12
What's Changed
- [CHORE] Switch to debug_struct over f.write by @HammadB in #2867
- [CHORE] Cleanup some clippy errors and all warnings for rust/types by @rescrv in #2858
- [CHORE] Remove the need to pop from the disk cache during compaction by @rescrv in #2865
- [CHORE] clean up cache, distance, index, storage clippy warnings by @rescrv in #2860
- [CHORE] Make a ChromaError type for RuntimeTypeError by @rescrv in #2866
- [ENH] We own the
chroma
crate by @rescrv in #2870 - hosted in early access by @jeffchuber in #2874
- [CHORE] Clippy for more. by @rescrv in #2871
- [ENH]: 21x faster full text querying by @codetheweb in #2833
- [ENH] Implement
SignedRoaringBitmap
and update where/where_document AST by @Sicheng-Pan in #2846 - [ENH] Implement
MetadataProvider
,RoaringMetadataFilter
, and refactorMetadataFilteringOperator
by @Sicheng-Pan in #2847 - [CLN] Refactor
MergeMetadataResultsOperator
by @Sicheng-Pan in #2848 - [CLN] Cleanup codebase with the refactored metadata filtering pipeline by @Sicheng-Pan in #2849
- [CHORE] Finish clippy work. Enable it. Make docs not warn too. by @rescrv in #2879
- [ENH] Implement serde for SparseIndex by @HammadB in #2881
- [ENH] Change the semantics of
$ne
,$nin
, and$not_contains
for local Chroma by @Sicheng-Pan in #2875 - [CHORE] Cleanup clippy warnings from merges pre-clippy. by @rescrv in #2885
- [TST] Move clippy into the lint job by @HammadB in #2886
- [ENH]: add tenant, database, and collection IDs to trace when available by @codetheweb in #2889
- [TST] Move rust benchmark crate to test crate by @Sicheng-Pan in #2900
- [ENH] Disk and memory-backed cache with Foyer 0.10. by @rescrv in #2890
- [TST] Implement utility functions to instantiate blockfile provider by @Sicheng-Pan in #2901
- [TST] Bound hypothesis version for breaking changes by @Sicheng-Pan in #2907
- [TST] Implement utility functions to instantiate segment by @Sicheng-Pan in #2902
- [ENH] RateLimitEnforcer abstract class by @drewkim in #2898
- [ENH] Automatically set version release as the latest release on Github by @itaismith in #2908
- [TST] Benchmark metadata filtering by @Sicheng-Pan in #2903
- [TST] Enable tests for $ne, $nin, and $not_contains for hosted Chroma by @Sicheng-Pan in #2909
- [RELEASE] 0.5.12 by @itaismith in #2913
Full Changelog: 0.5.11...0.5.12
0.5.11
Version: 0.5.11
Git ref: refs/tags/0.5.11
Build Date: 2024-09-27T00:35
PIP Package: chroma-0.5.11.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.11
DockerHub Image: chromadb/chroma:0.5.11
What's Changed
- [CHORE] cleanup Debug of HnswIndexProvider by @rescrv in #2864
- [CHORE] Cleanup clippy warnings in rust/blockstore by @rescrv in #2859
- [PERF] Convert embeddings representation to numpy by @drewkim in #2803
- [RELEASE] 0.5.11 by @itaismith in #2869
Full Changelog: 0.5.10...0.5.11
0.5.10
Version: 0.5.10
Git ref: refs/tags/0.5.10
Build Date: 2024-09-27T00:28
PIP Package: chroma-0.5.10.tar.gz
Github Container Registry Image: ghcr.io/chroma-core/chroma:0.5.10
DockerHub Image: chromadb/chroma:0.5.10
What's Changed
- [RELEASE] 0.5.10 by @itaismith in #2862
Full Changelog: 0.5.9...0.5.10
0.5.7
What's Changed
- [ENH] Add support for FE tolerations by @HammadB in #2560
- [ENH] Support FE replica count by @HammadB in #2561
- [ENH] fix types export for CJS consumers/NPM types badge by @codetheweb in #2563
- [ENH] CIP: Write-Ahead Log Pruning & Vacuuming by @codetheweb in #2498
- [BUG]: Pydantic 1.9+ compatibility by @tazarov in #2229
- [BUG] HNSW Rust params should have defaults by @HammadB in #2564
- [ENH] enable retries on all gRPC channels by @codetheweb in #2572
- [TST] add hypothesis collection id to logs by @sanketkedia in #2573
- [ENH] return
chroma-trace-id
header, include trace ID in thrown errors by @codetheweb in #2570 - [ENH] Connection pool FD leak v2 by @tazarov in #2014
- [ENH] Properly display memberlist in SysDB by @Ishiihara in #2565
- [CHORE] Add logs to memberlist by @HammadB in #2583
- [CHORE] Add more debugs to memberlist by @HammadB in #2584
- [PERF] Trace auth requests by @HammadB in #2589
- [ENH] simplify logic for when to persist index changes (re-apply with fix) by @codetheweb in #2545
- [ENH] add
.clean_log()
to Producers by @codetheweb in #2549 - [ENH] automatically clean WAL by @codetheweb in #2557
- [ENH] add vacuum CLI command by @codetheweb in #2519
- [BUG]: Pydantic 1.x Union Types by @tazarov in #2577
- [PERF] Add sysdb catalog traces by @HammadB in #2593
- [ENH] Add sysdb cpu/memory requests and limits to helm chart by @HammadB in #2596
- [BUG] Sysdb k8s resource typo by @HammadB in #2600
- [ENH] JS Client Refactor (Full Stack) by @AlabasterAxe in #2542
- [ENH] add tracing to gorm by @codetheweb in #2606
- [ENH] propogate nested error message by @codetheweb in #2608
- [BUG] fix nil dereference during error log by @codetheweb in #2609
- [ENH] prefetch APIs for Record segment and blockfile by @sanketkedia in #2547
- [ENH] Dispatch prefetch() as operator by @sanketkedia in #2562
- [ENH] Tag operators as IO or other + dispatcher schedules IO tasks on the main runtime by @sanketkedia in #2603
- [TST] Isolated Test Environments using testcontainers by @AlabasterAxe in #2607
- [ENH] multipart S3 file uploads by @codetheweb in #2590
- [CLN] Refactor rust codebase into many crates by @HammadB in #2618
- [ENH]: only use multipart upload if object size > part size by @codetheweb in #2622
- [CLN] Move distance, and index into their own crates by @HammadB in #2624
- [BUG]: fix nightly test workflow by @codetheweb in #2635
- [BLD]: fix JS dev release workflow by @codetheweb in #2636
- [BUG] fix multi collection log purge by @codetheweb in #2617
- [ENH]: don't allow a null collection ID on segments by @codetheweb in #2626
- [CHORE] Move pull log trace record count to structured field by @HammadB in #2642
- [PERF] Make pull logs an I/O operator so it runs on main runtime by @HammadB in #2641
- [BUG] S3 client not respecting timeout by @HammadB in #2643
- [ENH] gRPC segment requests should include the collection ID by @codetheweb in #2613
- [PERF]: bump compaction partition size to 10k by @codetheweb in #2647
- [ENH]: allow configuring replica count of compaction service by @codetheweb in #2650
- [BUG]: fix tracing for IO tasks by @codetheweb in #2652
- [ENH]: fix compactor traces, add additional info to trace by @codetheweb in #2653
- [ENH]: add record count to span when submitting embeddings by @codetheweb in #2654
- [ENH] fix embedding validation span pollution by @codetheweb in #2658
- [ENH]: lift frontend gRPC retry to application & trace by @codetheweb in #2659
- [ENH] Fetch blocks in parallel and make load_blocks take a slice instead by @HammadB in #2662
- [CLN] Only use distance avx/neon/sse if enabled by @HammadB in #2664
- [BLD]: fix nightly tests pod log upload by @codetheweb in #2670
- [ENH] Make Rust/C++ FFI error handling robust by @HammadB in #2667
- [ENH] Add timeouts to FE grpc clients by @HammadB in #2671
- [ENH]: allow using node selectors for pods instead of tolerations by @codetheweb in #2672
- [ENH] [PERF] add indices to metadata columns by @levand in #2623
- [BLD]: bump
time
dependency by @codetheweb in #2679 - [ENH]: skip brute force search when log is empty by @codetheweb in #2666
- [ENH]: allow configuring replicaCount and nodeSelector on log service by @codetheweb in #2678
- [ENH]: improve compaction traces by @codetheweb in #2680
- [ENH] Network Admission Control APIs by @sanketkedia in #2628
- [ENH] Construct and pass NAC by @sanketkedia in #2630
- [ENH] Block manager and HNSW provider consume Network Admission Control by @sanketkedia in #2631
- [ENH] NAC rate limits requests by @sanketkedia in #2632
- [ENH]: use
follows_from()
standard instead of manually adding parent attribute by @codetheweb in #2691 - [ENH]: cleanup HNSW temporary files after registering result by @codetheweb in #2646
- [PERF] Multipart s3 downloads passing through NAC by @sanketkedia in #2698
- [CLN] Reorganize delta module into seperate module and split out impls and [PERF] Refactor bf get_size to avoid nested loops by @HammadB in #2674
- [ENH] Evict other versions of hnsw index of the collection when another version is fetched by @sanketkedia in #2707
- [ENH]: add metric for total # of uncompacted log records by @codetheweb in #2696
- [ENH] Add gc for log service by @Ishiihara in #2634
- [BUG]: fix log service metrics service name by @codetheweb in #2713
- [PERF] Binary search get_prefix() calls in blockfile by @sanketkedia in #2717
- [ENH]: update JS CloudClient to accept
tenant
parameter by @codetheweb in #2726 - [ENH]: add config fields for max partition size & max compaction size in compactor by @codetheweb in #2732
- [PERF] Decrease compaction RAM usage and increase speed by @sanketkedia in #2729
- [PERF]: remove mutex around tokenizer by @codetheweb in #2735
- [PERF]: better locking of uncommitted tracking maps (decrease compaction time by 3x) by @codetheweb in #2736
- [ENH]: purge block cache after compaction by @codetheweb in #2741
- [PERF] Move write to blockfiles off main runtime by @sanketkedia in #2740
- [ENH] Synchronize post s3.get() operations in providers by @sanketkedia in #2712
- [ENH]: make test_sanity recall based by @codetheweb in #2742
- [PERF] Add include metadata to MetadataReader to avoid unncessary payload size by @HammadB in #2750
- [PERF]: speed up
get_target_block_id()
by @codetheweb in #2743 - [ENH]: skip querying
MetadataSegmentReader
for empty where clause by @codetheweb in https://githu...