Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feat] Add LiteLLMEmbeddings - Support SemanticChunking through LiteLLM #154

Open
wants to merge 3 commits into
base: development
Choose a base branch
from

Conversation

Dhan996
Copy link

@Dhan996 Dhan996 commented Jan 25, 2025

added litellm support to identify and embed with any Huggingface embedding model that has feature extraction
added the same to the registry to support AutoEmbeddings.
added tests for the same.
key things to notice:

timeout is dependent on model size, since the model needs to be loaded onto the local hardware first.
context length, dimensions, and such measures are dependent on the model
token_counter is a callable from litellm, which would also need time to load.
currently only supports huggingface models. Litellm can support more models such as voyage, mistral, etc, but the API keys should be given in parameters.

bhavnicksm and others added 2 commits January 29, 2025 20:13
…run timings (chonkie-ai#156)

* [DOCS] Benchmarking update (chonkie-ai#145)

* Add wiki 500k benchmark results

* Update benchmarks

* bahut tej hai chonkie bhai

* blah blah

---------

Co-authored-by: Bhavnick Minhas <[email protected]>

* Update benchmarks with corrected memory usage and size metrics

---------

Co-authored-by: Shreyash Nigam <[email protected]>
@bhavnicksm bhavnicksm changed the base branch from main to development January 29, 2025 14:54
@bhavnicksm bhavnicksm changed the title LiteLLM Integration [Feat] Add LiteLLMEmbeddings - Support SemanticChunking through LiteLLM Jan 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEAT] Add an ability to use OpenAI / VoyageAI / Cohere embeddings with SDPMChunker via LiteLLM
2 participants