API view set for retrieving results. This view set allows users to access results associated with their organization, providing endpoints to list and retrieve detailed results.
API Key Authentication. It should be included in the header of each request.
A unique integer value identifying this result.
Name of the result
Example: "Test Result 1"
255Current status of the result
running - Runningcompleted - Completedfailed - Failedpending - Pendingin_progress - In Progressevaluating - Evaluatingin_queue - In Queuetimeout - Timeoutcancelled - Cancelledrunning, completed, failed, pending, in_progress, evaluating, in_queue, timeout, cancelled Number of runs that fully met their expected outcomes with a score of 100
Total number of runs that had expected outcomes defined
Success rate of the test runs
Whether this test was run in text mode instead of voice mode
Example: true or false
Whether this result was created by a scheduled cronjob
Runs
{
"<run_id>": {
"id": "<integer>",
"scenario": "<integer>",
"expected_outcome": {
"score": 0,
"explanation": [
"❌ The main agent did not provide the standard greeting, emergency disclaimer, or ask how they can help (00:21).",
"❌ The main agent did not explain the distinction between Nacogdoches Health Partners and FastTrack Express Clinic (entire call)."
],
"outcome_alignments": [
{
"aligned": false,
"outcome": "The main agent did not provide the standard greeting, emergency disclaimer, or ask how they can help (00:21).",
"prompt_part": "The main agent should provide the standard greeting and emergency disclaimer, asking how they can help."
},
{
"aligned": false,
"outcome": "The main agent did not explain the distinction between Nacogdoches Health Partners and FastTrack Express Clinic (entire call).",
"prompt_part": "The main agent should explain the distinction between Nacogdoches Health Partners (focused on comprehensive primary care and chronic conditions) and FastTrack Express Clinic (focused on acute or urgent matters)."
}
]
}
}Overall evaluation of the test runs Example:
{
"success_rate": "number",
"metric_summary": {
"metric_id": {
"id": "integer",
"name": "string",
"type": "string",
"score": "number",
"explanation": "string (optional)",
"function_name": "string",
"vocera_defined_metric_code": "string (optional)",
"p50": "number (for numeric metrics)"
}
},
"worst_performing_metrics": {
"binary_adherence": [
"array of metric_ids"
]
},
"numeric_metrics": [
{
"name": "string",
"type": "numeric",
"value": "number",
"percentiles": {
"p50": "number"
}
}
],
"enum_metrics": [
"array of metric_ids"
],
"extra_metrics": [
{
"name": "string (e.g., 'Expected Outcome', 'Average Ringing Duration')",
"type": "string",
"value": "number",
"percentiles": {
"p50": "number (optional)"
}
}
]
}Total duration of the test runs for this result
Example: 22:30
Total number of test runs associated with this result
Example: 10
Number of test runs that have completed successfully
Example: 10
Number of test runs that were marked as successful
Example: 10
Number of test runs that failed or encountered errors
Example: 10
List of scenario names used in the test runs for this result Example: ```
[
{
"id": 123,
"name": "Scenario 1"
},
{
"id": 456,
"name": "Scenario 2"
}
]List of critical categories for this result Example:
[
{
"id": 2950,
"name": "Pronunciation Analysis",
"eval_type": "continuous_qualitative",
"simulation_enabled": true,
"observability_enabled": true
},
{
"id": 3284,
"name": "Latency",
"eval_type": "numeric",
"simulation_enabled": true,
"observability_enabled": false
},
{
"id": 3295,
"name": "Detect Silence in Conversation",
"eval_type": "binary_qualitative",
"simulation_enabled": true,
"observability_enabled": true
}
]Failed reasons of the test runs Example:
{
"issues": [
{
"rank": 1,
"run_ids": [
34588
],
"description": "The agent did not provide the standard greeting, emergency disclaimer, or ask how they could help.",
"affected_count": 1
},
{
"rank": 2,
"run_ids": [
34588
],
"description": "The agent did not explain the distinction between the primary care and express clinics.",
"affected_count": 1
}
],
"total_failed_runs": 1
}Timestamp when this test result was created
Example: 2021-01-01 00:00:00
Timestamp when this test result was last updated
Example: 2021-01-01 00:00:00