mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for `MistralAIEmbeddings` #27818

keenborder786 · 2024-11-01T02:28:38Z

Description:: In the event of a Rate Limit Error from the MistralAI server, the response JSON raises a KeyError. To address this, a simple retry mechanism has been implemented to handle cases where the request limit is exceeded.
Issue: Cannot create MistralAI embeddings from pdf or urls #27790

vercel · 2024-11-01T02:28:50Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Dec 11, 2024 9:42pm

keenborder786 · 2024-11-09T23:24:00Z

@eyurtsev

eyurtsev · 2024-11-13T15:35:28Z

libs/partners/mistralai/langchain_mistralai/embeddings.py

+            batch_responses = []
+
+            @retry(
+                retry=retry_if_exception_type(Exception),


Generally, retries should never be implemented on 4xx errors (except for 408 and 429). e.g., 403 should not be retried by default

What do we do in other parts of the code? Perhaps there's a better example that can be adopted?

What do other models do in the code base in terms of exposing the retry parameters so users can adjust? (e.g.,what if someone wants to have the first retry after 1 second rather than 30 seconds?)

@eyurtsev it is a 429 error therefore retrying makes sense.

I was not able to find any related example.

Yes I can make the wait and stop seconds as parameters.

eyurtsev · 2024-11-13T15:36:47Z

libs/partners/mistralai/langchain_mistralai/embeddings.py

-                for batch in self._get_batches(texts)
-            )
+                if response.status_code == 429:
+                    raise Exception("Requests rate limit exceeded")


This code takes an exception that was good and informative and turns it into one that's a broad Exception of type Exception -- this is usually not a good pattern for exception handling. Stack trace will be partially lost, the exception type is less specific etc.

Their were no specific exception being raised to being with. But I can change it from a general Exception.

eyurtsev

Makes sense to add retry mechanism. Added a few questions to see if we can improve how it's configured

keenborder786 · 2024-11-13T21:16:43Z

@eyurtsev please check now

keenborder786 · 2024-11-16T01:06:18Z

@eyurtsev

keenborder786 · 2024-11-22T23:30:34Z

@eyurtsev

keenborder786 · 2024-11-23T22:25:17Z

@eyurtsev this is really important, please give feedback if needed

keenborder786 · 2024-11-28T23:20:28Z

@eyurtsev

keenborder786 · 2024-11-30T18:25:16Z

@eyurtsev

keenborder786 · 2024-12-10T07:21:01Z

@eyurtsev

eyurtsev · 2024-12-11T21:36:39Z

libs/partners/mistralai/langchain_mistralai/embeddings.py

                    url="/embeddings",
                    json=dict(
                        model=self.model,
                        input=batch,
                    ),
                )
-                for batch in self._get_batches(texts)
-            )
+                if response.status_code == 429:


any reason not to use raise_for_status? We're trying not to drop the original exception which might have useful information inside it

eyurtsev · 2024-12-11T21:39:37Z

libs/partners/mistralai/langchain_mistralai/embeddings.py

+            batch_responses = []
+
+            @retry(
+                retry=retry_if_exception_type(httpx.TimeoutException),


(no need to change if you don't want) This is OK b/c it's probably the dominant failure mode

But it's very common to retry 5xx errors as well. And 408 (request timeout)

eyurtsev · 2024-12-11T21:44:49Z

@keenborder786 sorry for how long it took. Pushed a minor change to raise on error, so originally information isn't lost

[chore]: Added Retrying Mechanism in case of Request Rate Limit Error

4fcc456

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Nov 1, 2024

dosubot bot added Ɑ: embeddings Related to text embedding models module 🤖:bug Related to a bug, vulnerability, unexpected error with an existing feature labels Nov 1, 2024

efriis assigned eyurtsev Nov 4, 2024

eyurtsev reviewed Nov 13, 2024

View reviewed changes

eyurtsev requested changes Nov 13, 2024

View reviewed changes

keenborder786 and others added 2 commits November 14, 2024 00:07

Merge branch 'master' into mistral_fix

839bbfe

[chore]: Added as parameter and changed the exception being raised

91c8925

Merge branch 'master' into mistral_fix

a67baf1

keenborder786 added 2 commits November 23, 2024 04:30

Merge branch 'master' into mistral_fix

320d141

Merge branch 'master' into mistral_fix

833f267

Merge branch 'master' into mistral_fix

6562510

Merge branch 'master' into mistral_fix

86d6c63

Merge branch 'master' into mistral_fix

d18f5c8

eyurtsev reviewed Dec 11, 2024

View reviewed changes

x

cb94661

eyurtsev approved these changes Dec 11, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Dec 11, 2024

eyurtsev changed the title ~~Added Retrying Mechanism in case of Request Rate Limit Error for MistralAIEmbeddings~~ mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for MistralAIEmbeddings Dec 11, 2024

eyurtsev merged commit a37afbe into langchain-ai:master Dec 11, 2024
30 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for `MistralAIEmbeddings` #27818

mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for `MistralAIEmbeddings` #27818

keenborder786 commented Nov 1, 2024

vercel bot commented Nov 1, 2024 •

edited

Loading

keenborder786 commented Nov 9, 2024

eyurtsev Nov 13, 2024

keenborder786 Nov 13, 2024

eyurtsev Nov 13, 2024

keenborder786 Nov 13, 2024

eyurtsev left a comment

keenborder786 commented Nov 13, 2024

keenborder786 commented Nov 16, 2024

keenborder786 commented Nov 22, 2024

keenborder786 commented Nov 23, 2024

keenborder786 commented Nov 28, 2024

keenborder786 commented Nov 30, 2024

keenborder786 commented Dec 10, 2024

eyurtsev Dec 11, 2024

eyurtsev Dec 11, 2024

eyurtsev commented Dec 11, 2024

mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for MistralAIEmbeddings #27818

mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for MistralAIEmbeddings #27818

Conversation

keenborder786 commented Nov 1, 2024

vercel bot commented Nov 1, 2024 • edited Loading

keenborder786 commented Nov 9, 2024

eyurtsev Nov 13, 2024

Choose a reason for hiding this comment

keenborder786 Nov 13, 2024

Choose a reason for hiding this comment

eyurtsev Nov 13, 2024

Choose a reason for hiding this comment

keenborder786 Nov 13, 2024

Choose a reason for hiding this comment

eyurtsev left a comment

Choose a reason for hiding this comment

keenborder786 commented Nov 13, 2024

keenborder786 commented Nov 16, 2024

keenborder786 commented Nov 22, 2024

keenborder786 commented Nov 23, 2024

keenborder786 commented Nov 28, 2024

keenborder786 commented Nov 30, 2024

keenborder786 commented Dec 10, 2024

eyurtsev Dec 11, 2024

Choose a reason for hiding this comment

eyurtsev Dec 11, 2024

Choose a reason for hiding this comment

eyurtsev commented Dec 11, 2024

mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for `MistralAIEmbeddings` #27818

mistral[minor]: Added Retrying Mechanism in case of Request Rate Limit Error for `MistralAIEmbeddings` #27818

vercel bot commented Nov 1, 2024 •

edited

Loading