Clusters

Clusters is in beta

Clusters is currently in beta. We'd love to hear your feedback as we develop this feature.

Requirements

Traces or generations captured – Set up LLM event capture using the installation guide.
AI data processing enabled – Your organization must have AI data processing consent enabled, the same requirement as trace summarization.
Automatic daily runs – Clustering runs automatically every day. No manual setup is needed.

Clusters automatically group similar LLM traces or generations together, helping you discover patterns in how users interact with your LLM features without reading every conversation.

Why use clusters?

Discover usage patterns – See what users are actually asking your LLM to do.
Find problem areas – Identify clusters with high error rates, costs, or latency.
Monitor trends – Daily runs show how usage patterns evolve over time.
Surface outliers – Items that don't fit any cluster are flagged as unusual behavior.

How clusters work

PostHog analyzes your recent traces or generations from the past 7 days.
Similar items are grouped together based on their content.
AI generates a title and description for each cluster.
Metrics like average cost, latency, token usage, and error rate are computed per cluster.

Clustering runs automatically every day — no setup needed beyond having traces captured.

Viewing clusters

Navigate to LLM analytics > Clusters to see the overview.

Trace vs generation clustering

Clusters can operate at two levels:

Traces – Cluster entire conversations, giving you a high-level view of what users are doing.
Generations – Cluster individual LLM calls, useful for analyzing specific model interactions.

Toggle between them using the selector at the top of the page.

Distribution bar

The color-coded bar at the top shows the proportional size of each cluster. Hover over a segment to see the cluster name and item count. Click a segment to jump to that cluster.

Scatter plot

The interactive 2D scatter plot visualizes how items are distributed across clusters.

Feature	Description
Dots	Each dot represents one trace or generation, color-coded by cluster
Centroids	Larger dots marking the center of each cluster
Click dot	Navigate to the trace or generation detail
Click centroid	Navigate to the cluster detail page
Drag to zoom	Zoom into a region of the plot
Double-click	Reset zoom

Cluster cards

Below the scatter plot, a grid of cards shows each cluster with:

AI-generated title and description summarizing the cluster's content
Size – Item count and percentage of total
Metrics – Average cost, average latency, average tokens, error rate, and total cost

Expand a card to preview the traces or generations in that cluster.

Items that don't fit any cluster appear in a special Outliers cluster with a dashed border.

Cluster detail page

Click a cluster card or centroid to open the detail page. Here you'll find:

Cluster title, description, and item count with the date range
Focused scatter plot showing only items in this cluster
Paginated list of traces or generations with AI summaries you can expand to see flow diagrams, summary bullets, and notes
Click any item to view the full trace timeline

Clusters in trace view

When viewing an individual trace, the Clusters tab shows which cluster(s) that trace belongs to. This is useful for understanding what category a specific conversation falls into.