AI Quality Report

The Quality (AI) tab shows graded conversation counts, average score, good/bad/borderline breakdown, flagged conversations with issue tags, quality trend over time, per-agent scores, and a topic table with bad-rate per topic.

Updated June 11, 2026

The Quality (AI) tab is at Reports - Quality (AI). Every conversation your team handles can be automatically graded by Cove AI using a 0-100 scale. This tab surfaces the grades, flags the worst conversations, and breaks quality down by agent and topic.

💡

Quality grading requires AI Auto-QA to be enabled. If you see “No graded conversations in range yet”, go to Settings - Reports to turn on Auto-QA.

Quality AI report tab

The five headline stats

💡

Stat	What it measures
Graded	Total conversations graded by the AI in the period
Avg score	Mean quality score across all graded conversations (0-100)
Good	Count and percentage of conversations graded “good”
Bad	Count and percentage of conversations graded “bad”
Borderline	Count of “neutral” conversations whose score falls below the borderline threshold (quietly drifting toward bad)

Scores are assigned per conversation by period date (the day the conversation was assessed, in your org’s time zone).

Flagged conversations

The “Flagged conversations” card is the most actionable part of this tab. It lists conversations graded bad, plus borderline neutrals that are close to bad but not quite there.

Three filter tabs let you narrow the list:

Bad - conversations explicitly graded “bad” (default view)
Borderline - “neutral” conversations whose score is below the borderline threshold
All - both bad and borderline

Each flagged conversation shows:

A colored left border (red = bad, amber = borderline)
The grade badge (“Bad” or “Borderline”)
The agent’s name
The conversation topic (AI-generated)
The score out of 100
The date of assessment
Issue tags (see below)
A plain-English summary of what went wrong
A quoted excerpt from the conversation as evidence
A direct link to open the conversation in the inbox

The list shows 5 conversations at a time, sorted by lowest score first. Click Load more to fetch the next page.

Issue tags

Each bad or borderline conversation can carry one or more issue tags. These are set by the AI grader to identify the specific failure mode:

💡

Tag	Meaning
Rude	Agent used language that was impolite or disrespectful
Dismissive	Agent brushed off the customer’s concern without engaging with it
Wrong info	Agent gave incorrect information
Unclear	Agent’s response was hard to understand or ambiguous
Ignored question	Agent did not address a question the customer asked
Missed context	Agent missed relevant information available in the conversation
Long delay	Agent took an unusually long time to respond
No follow-up	Agent promised to follow up but did not
Overpromised	Agent committed to something that was not delivered
Abandoned	The conversation was left without resolution

A conversation can have multiple issue tags if more than one failure mode occurred.

Quality over time

A stacked area chart shows the daily mix of good, neutral, and bad grades across the period. This makes it easy to spot degradations - if the “bad” band suddenly thickens on a particular day, something changed.

By agent

The “By agent” section shows two things side by side:

A score list - each agent’s name and their average quality score for the period, colored from green (high) to red (low).
A stacked bar chart - each agent’s conversation count broken into good, neutral, and bad segments.

If an agent has fewer than 5 graded conversations in the period, a warning badge appears next to their score. A score based on a small sample is less reliable.

Topics

The topics table shows what subjects appeared most in graded conversations and which topics are driving the most bad grades.

💡

Column	What it shows
Topic	AI-generated topic label for the conversation (free-form, not a fixed enum)
Graded	How many graded conversations carried this topic in the period
Bad rate	Percentage of graded conversations with this topic that received a “bad” grade

Topics with a non-zero bad rate are shown in red. A high bad rate on a specific topic (e.g. “billing”, “refund”) suggests a training or process problem in that area, not a random sample.

The table shows up to 12 topics, ranked by volume.

Email reports

If AI Auto-QA email delivery is not configured, a banner appears at the top of the tab offering to set it up. Daily quality report emails are sent from Settings - Reports when enabled.

Role access

Only organization owners and admins can view the Quality tab. The Quality tab also accepts the agent filter (top of the reports page), so admins can scope the view to a single agent’s conversations.

Was this article helpful?

Reports Overview Agent Performance Agent Availability Live Escalations Report Your Dashboard