feat: add code-based evaluator support by jariy17 · Pull Request #739 · aws/agentcore-cli

jariy17 · 2026-03-31T04:44:43Z

Summary

Add managed and external code-based evaluator support across schema, CLI flags, TUI wizard, and template scaffolding
EvaluatorConfigSchema becomes a mutual-exclusion union of llmAsAJudge / codeBased
New CLI flags: --type code-based|llm-as-a-judge, --lambda-arn, --timeout
TUI wizard with 3 branching flows: LLM, managed code-based, external code-based
Scaffold Python Lambda template with @custom_code_based_evaluator() decorator
Block code-based evaluators from online eval configs at schema, CLI, and TUI layers

Test plan

Unit tests for EvaluatorPrimitive (add, remove, previewRemove)
Online eval config blocking (remove blocked when referenced)
ESLint, Prettier, Secretlint pass
TUI manual testing (add managed, add external, remove)
Integration test with agentcore deploy

Note

Second commit vendors the SDK wheel temporarily until bedrock-agentcore is published to PyPI with code-based evaluator support.

Add managed and external code-based evaluator support across schema, CLI flags, TUI wizard, and template scaffolding. Block code-based evaluators from online eval configs at schema, CLI, and TUI layers.

Vendor the SDK wheel and add binary-aware template rendering until the SDK is published to PyPI. To be removed once the SDK is publicly available.

github-actions · 2026-03-31T04:45:27Z

Package Tarball

aws-agentcore-0.5.0.tgz

How to install

npm install https://github.com/aws/agentcore-cli/releases/download/pr-739-tarball/aws-agentcore-0.5.0.tgz

- Update asset file listing snapshot for new evaluator templates - Regenerate package-lock.json to fix stale aws-cdk bundled dep (@aws-cdk/cloud-assembly-schema 52.2.0 -> 53.11.0)

Status command was hardcoding "LLM-as-a-Judge" for all evaluators. Now derives the label from item.config.codeBased to distinguish code-based evaluators.

github-actions · 2026-03-31T16:48:59Z

Coverage Report

Status	Category	Percentage	Covered / Total
🔵	Lines	45.78%	6596 / 14407
🔵	Statements	45.34%	7007 / 15453
🔵	Functions	44.53%	1178 / 2645
🔵	Branches	45.97%	4369 / 9502

Generated in workflow #1554 for commit 0d7d519 by the Vitest Coverage Report Action

jariy17 added 2 commits March 31, 2026 00:37

feat: add code-based evaluator support

85a6d7d

Add managed and external code-based evaluator support across schema, CLI flags, TUI wizard, and template scaffolding. Block code-based evaluators from online eval configs at schema, CLI, and TUI layers.

temp: use pyproject.toml with vendored SDK wheel

8ffe3a7

Vendor the SDK wheel and add binary-aware template rendering until the SDK is published to PyPI. To be removed once the SDK is publicly available.

jariy17 requested a review from a team March 31, 2026 04:44

github-actions bot added the size/l PR size: L label Mar 31, 2026

jariy17 temporarily deployed to e2e-testing March 31, 2026 04:44 — with GitHub Actions Inactive

jariy17 had a problem deploying to e2e-testing March 31, 2026 14:32 — with GitHub Actions Failure

github-actions bot added size/l PR size: L and removed size/l PR size: L labels Mar 31, 2026

jariy17 force-pushed the code-based-evaluator branch from 2637c56 to d182cc5 Compare March 31, 2026 14:48

github-actions bot removed the size/l PR size: L label Mar 31, 2026

jariy17 had a problem deploying to e2e-testing March 31, 2026 14:48 — with GitHub Actions Failure

github-actions bot added the size/l PR size: L label Mar 31, 2026

jariy17 force-pushed the code-based-evaluator branch from d182cc5 to 14b84b5 Compare March 31, 2026 15:05

github-actions bot added size/l PR size: L and removed size/l PR size: L labels Mar 31, 2026

jariy17 had a problem deploying to e2e-testing March 31, 2026 15:05 — with GitHub Actions Failure

fix: update asset snapshot and regenerate package-lock.json

5e46523

- Update asset file listing snapshot for new evaluator templates - Regenerate package-lock.json to fix stale aws-cdk bundled dep (@aws-cdk/cloud-assembly-schema 52.2.0 -> 53.11.0)

jariy17 force-pushed the code-based-evaluator branch from 14b84b5 to 5e46523 Compare March 31, 2026 15:28

github-actions bot removed the size/l PR size: L label Mar 31, 2026

jariy17 temporarily deployed to e2e-testing March 31, 2026 15:29 — with GitHub Actions Inactive

github-actions bot added the size/l PR size: L label Mar 31, 2026

fix: show correct evaluator type in status display

0d7d519

Status command was hardcoding "LLM-as-a-Judge" for all evaluators. Now derives the label from item.config.codeBased to distinguish code-based evaluators.

github-actions bot added size/l PR size: L and removed size/l PR size: L labels Mar 31, 2026

jariy17 temporarily deployed to e2e-testing March 31, 2026 16:42 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add code-based evaluator support#739

feat: add code-based evaluator support#739
jariy17 wants to merge 4 commits intomainfrom
code-based-evaluator

jariy17 commented Mar 31, 2026

Uh oh!

github-actions bot commented Mar 31, 2026

Uh oh!

github-actions bot commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jariy17 commented Mar 31, 2026

Summary

Test plan

Note

Uh oh!

github-actions bot commented Mar 31, 2026

Package Tarball

How to install

Uh oh!

github-actions bot commented Mar 31, 2026

Coverage Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant