Help Center

Improved

Fixed

Checkly Agent

Checkly CLI

Monthly update

Deprecation

💡 Share your ideas

checkly

In Review

Planned

In Progress

Completed

Rejected

Status Pages

API Checks

Browser Checks

Multistep checks

Heartbeat Checks

Check Types

Alerting

Groups

Auth / Access / Roles

Scheduling

General

Runtimes

Insights

Maintenance Windows

Integrations

Test Sessions

Public API

Playwright

Private locations

Visual regression monitoring

Dashboards

Traces

Observability

I18n

Main Roadmap

Quarterly Roadmap

Monthly Roadmap

Tell Checkly how they could make the product more useful to you.

Share your product feedback!

Share your feedback

Problem: I have a a Playwright test for an onboarding flow currently assesses LLM-generated content using an LLM-as-a-judge method with a binary pass/fail outcome (score &gt; 60). While functional, this approach lacks the ability to track score trends over time, which would provide more valuable insights. Suggested solution: Implement a custom metrics system that:<ol><li>Captures the actual LLM evaluation scores from tests</li></ol><ol><li>Exposes these scores to Checkly's dashboard</li></ol><ol><li>Enables visualization of individual scores per run and aggregated metrics (7-day and 14-day averages)</li></ol><ol><li>Creates an extensible framework for any future custom metrics beyond just LLM evaluation scores </li></ol>

Custom Metrics on Dashboard

Berk Durmus

Checkly

Custom Metrics on Dashboard

Subscribe to post

Subscribe to post