Auto Recovery Logic for chat completion clients on different types of server errors · microsoft/autogen#3632

(4 留言) (0 反應) (0 負責人)Python (8,759 fork)batch import

help wantedproj-extensionssize-medium

倉庫指標

Star: (58,033 star)
PR 合併指標: (30 天內沒有已合併 PR)

描述

What feature would you like to be added?

It would be nice instead if we can support a model client that auto-recovers from various host-related errors with a configurable logic such as retries.

Here is a good example of an apparently transient error:

openai.APIStatusError: Error code: 424 - {'error': {'message': 'Error occurred while processing image(s).', 'type': 'failed_dependency', 'param': None, 'code': None}}

Moreover, even for rate limit errors like below, it doesn't always retry, there is some inconsistency happening

openai.RateLimitError: Error code: 429 - {'error': {'code': '429', 'message': 'Rate limit is exceeded. Try again in 1 seconds.'}}

Why is this needed?

Currently, the implementation of the chat completion client for OpenAI fails to recover and crashes for a host of errors (except for retries that are rate-limit related where it auto-retries), however, a host of other errors randomly occur and that forces the applications built on top of the client to have their own logic for auto-recovery.

貢獻者指南

研究方向: 檢查 autogen/oai/client.py 或類似檔案中的現有客戶端程式碼，以了解目前的速率限制重試邏輯。實作一個可透過參數配置的重試機制，用於處理 424 和 429 錯誤。使用類似 tenacity 的函式庫來實現可配置的重試。確保解決方案與現有的錯誤處理模式保持一致。
技術棧: python
領域: aibackend
議題類型: 功能
難度: 3
預計時間: 1-2 天
活動狀態: 活躍
清晰度: 大致清晰
前置要求: PythonOpenAI API experience
新手友善度: 40

倉庫指標

描述

What feature would you like to be added?

Why is this needed?

貢獻者指南

每天在信箱收到新鮮 Easy issues。