fix: Remove evaluation metric key from schema which failed on some LLMs by jsonbailey · Pull Request #105 · launchdarkly/python-server-sdk-ai

jsonbailey · 2026-03-11T22:40:27Z

Note

Medium Risk
Changes the structured output schema and parsing contract for judge evaluations, which can affect compatibility with existing providers/models and downstream consumers of the judge response shape. Test updates reduce risk but runtime behavior may differ if any integrations still return the old evaluations object.

Overview
Updates judge structured-output handling to use a fixed schema { "evaluation": { "score", "reasoning" } } rather than generating a dynamic evaluations object keyed by evaluation_metric_key.

Judge now always builds the schema (no key parameter), parses only the single evaluation payload, and marks evaluations as failed when that object is missing/invalid; tests are updated accordingly and dynamic schema helper methods are removed.

^{Written by Cursor Bugbot for commit 916df2a. This will update automatically on new commits. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 3 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.}

packages/sdk/server-ai/tests/test_judge.py

fix: Remove evaluation metric key from schema which failed on some LLMs

f8c6eba

jsonbailey requested a review from a team as a code owner March 11, 2026 22:40

additional properties is required for openai schemas

49f5e2e

cursor bot reviewed Mar 11, 2026

View reviewed changes

packages/sdk/server-ai/tests/test_judge.py Outdated Show resolved Hide resolved

packages/sdk/server-ai/tests/test_judge.py Show resolved Hide resolved

packages/sdk/server-ai/tests/test_judge.py Show resolved Hide resolved

fix tests

916df2a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Remove evaluation metric key from schema which failed on some LLMs#105

fix: Remove evaluation metric key from schema which failed on some LLMs#105
jsonbailey wants to merge 3 commits intomainfrom
jb/aic-1897/remove-keys-from-evaluation-structure

jsonbailey commented Mar 11, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jsonbailey commented Mar 11, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jsonbailey commented Mar 11, 2026 •

edited by cursor bot

Loading