Support model APIs that require strict alternating user-assistant roles · microsoft/autogen#5965

Repository metrics

Stars: (58,033 stars)
PR merge metrics: (No merged PRs in 30d)

Description

For the 400 server error about alternating user-assistant roles in messages, we need to handle this outside of model client, i.e., in AssistantAgent and SelectorGroupChat, where the model clients are used. Based on the model family in model_info field, we should inject an empty user message when there are consecutive assistant messages.

Originally posted by @ekzhu in https://github.com/microsoft/autogen/issues/5961#issuecomment-2727207546

Reference of 400 error:

openai.BadRequestError: Error code: 400 - {'error': {'message': 'deepseek-reasoner does not support successive user or assistant messages (messages[1] and messages[2] in your input). You should interleave the user/assistant messages in the message sequence.', 'type': 'invalid_request_error', 'param': None, 'code': 'invalid_request_error'}}

Steps:

Investigate what are the model families that require this strict alternating user-assistant roles? (DeepSeek R1, Mistral AI)
In AssistantAgent and SelectorGroupChat, where model client is used, ensure the messages are following the strict order when the above model families are involved. We can do this by concatenation of messages with repeated roles, or injecting empty message -- need to test them.

Contributor guide

Research direction: Identify model families requiring strict alternating roles (e.g., DeepSeek R1, Mistral AI) by testing their API behaviors. Then modify AssistantAgent and SelectorGroupChat to inject empty user messages or concatenate consecutive assistant messages when those models are used.
Tech stack: python
Domain: aibackend
Issue type: Feature
Difficulty: 3
Estimated time: 1-2 days
Activity status: Active
Clarity: Clear
Prerequisites: PythonAI model API knowledge
Newbie friendliness: 65

Repository metrics

Description

Contributor guide

Get fresh easy issues in your inbox.