Why such restrictions in ChatPromptParser.cs ? #10252
-
Hello, When parsing a chat prompt, "invalid" messages are discarded
private static bool IsValidChatMessage(PromptNode node)
{
return
node.TagName.Equals(MessageTagName, StringComparison.OrdinalIgnoreCase) &&
node.Attributes.ContainsKey(RoleAttributeName) &&
IsValidChildNodes(node);
}
private static bool IsValidChildNodes(PromptNode node)
{
var textTagsCount = node.ChildNodes.Count(n => n.TagName.Equals(TextTagName, StringComparison.OrdinalIgnoreCase));
return textTagsCount == 1 || (textTagsCount == 0 && node.Content is not null);
} I get it for the first condition but not the other 2
For instance the following payload is valid {
"messages": [
{
"role": "user",
"content": [
{
"type": "image_url",
"image_url": {
"url": ""
}
}
]
},
{
"role": "user",
"content": [
{
"type": "text",
"text": "This is a sample payload to demonstrate that having multiple text part is ok"
},
{
"type": "text",
"text": "Just say \"Hi github\""
}
]
},
{
"role": "user",
"content": []
}
],
"model": "gpt-4o",
"frequency_penalty": 0,
"presence_penalty": 0,
"stream": false,
"temperature": 0,
"top_p": 0,
"max_tokens": 1000
} When using I'm asking this because I noticed that GPT-4o answer differently these 2 payloads, and the one working is not possible with this parser
{
"messages": [
{
"role": "user",
"content": [
{
"type": "text",
"text": "The name of this image is image1.png"
},
{
"type": "image_url",
"image_url": {
"url": ""
}
}
]
},
{
"role": "user",
"content": [
{
"type": "text",
"text": "What do you see in the image ?"
}
]
}
],
"model": "gpt-4o",
"frequency_penalty": 0,
"presence_penalty": 0,
"stream": false,
"temperature": 0,
"top_p": 0,
"max_tokens": 1000
} GPT-4o's response :
{
"messages": [
{
"role": "user",
"content": [
{
"type": "text",
"text": "The name of this image is image1.png"
}
]
},
{
"role": "user",
"content": [
{
"type": "image_url",
"image_url": {
"url": ""
}
}
]
},
{
"role": "user",
"content": [
{
"type": "text",
"text": "What do you see in the image ?"
}
]
}
],
"model": "gpt-4o",
"frequency_penalty": 0,
"presence_penalty": 0,
"stream": false,
"temperature": 0,
"top_p": 0,
"max_tokens": 1000
} Response :
The second payload would be considered invalid because the message containing the image does not contain a Thanks in advance |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Tagging @markwallace-microsoft |
Beta Was this translation helpful? Give feedback.
-
@ThDuquennoy I have created a new issue to track this request #10278. If you are interested in contributing the enhancements please create a PR and we will work with you to get it merged. |
Beta Was this translation helpful? Give feedback.
@ThDuquennoy I have created a new issue to track this request #10278. If you are interested in contributing the enhancements please create a PR and we will work with you to get it merged.