Documentation Index
Fetch the complete documentation index at: https://docs.cekura.ai/llms.txt
Use this file to discover all available pages before exploring further.
What it does
For every project and every LLM Judge metric, Cekura runs a daily audit that:- Pulls all failing calls for that (project, metric) from the last 24 hours (falling back to 7 days if the day’s pool is small).
- Hands the transcripts, the metric card, and an audit prompt to an LLM agent.
- Has the agent read a sample of the calls, cluster them by root cause, and write a 1-6 theme breakdown.
- Saves the result on the metric’s Insights card in the dashboard.
Viewing and generating insights in the dashboard
Navigate to Observability → Insights in the sidebar. The page displays a card for each LLM Judge metric enabled on your project. Use the View dropdown at the top of the page to filter by view. When a view is selected, the page shows only metrics and insights for agents in that view. Each card shows one of the following:- The latest failure-mode audit with identified themes
- A message indicating not enough failures were found in the analyzed window
- A Generate button if no audit has run yet
Which metrics are eligible
Only supported for LLM Judge type metrics as of now. Code based metrics such as Latency, WPM, Talk Ratio are excluded.- Custom metrics with
type: llm_judge. - Predefined metrics — the following: CSAT, Critical Deviations Continuous, Critical Info Check, Critical Info Check bool, Expected Outcome, Gibberish Detection, Hallucination, Letterwise Pronunciation Detection, Main Agent Early End Call, Not Early Termination, Pronunciation Analysis, Pronunciation test, Relevancy, Response Consistency, STT Errors, Tool call Accuracy, Unnecessary Repetition Count, Unnecessary Repetition Score.
How the daily run works
A Celery Beat cron fires at 06:00 UTC daily. For each (project, eligible-metric) pair, our agent analyses the failing calls and writes the resulting themes to the metric’s Insights card. If a metric has fewer than the minimum failing-call threshold even over 7 days, the card shows “Not enough metric failure instances in last 7 days.”On-demand generation via API
For programmatic access, you can trigger an audit without waiting for the next daily run by calling the generate endpoint:status field until it’s succeeded or failed:
status is succeeded, the failure_modes array contains the themes.