Segmentation/Clustering (Common Groups)

Discover hidden patterns and natural groupings in your data with AI-powered clustering

Clustering automatically discovers natural groupings in your data—customer segments, behavior patterns, or performance tiers—that you might not have thought to look for. Scoop's AI handles the complex math and delivers clear, actionable segment descriptions in plain English.

What Is Clustering?

AspectDescription
DefinitionMachine learning that groups similar records together
MethodAnalyzes multiple attributes simultaneously
OutputNamed segments with clear descriptions
Skill RequiredNone—Scoop handles the data science

Clustering vs. Filtering

ApproachHow It WorksExample
FilteringYou define the rule"Show deals over $100K"
GroupingYou pick the column"Group by Region"
ClusteringAI finds the patterns"Discover natural segments"

Clustering considers combinations of attributes across multiple columns simultaneously, finding patterns you might never have thought to look for.

Tip: Use clustering when you want to discover "unknown unknowns"—patterns you didn't know existed in your data.

How Clustering Works in Scoop

The Workflow

Select Dataset → Click "Find Common Groups" → AI Analyzes → Segments Revealed
      ↓                    ↓                        ↓              ↓
   Your data          One click              EM algorithm    Named segments
                                            + Rules models   with descriptions

Step-by-Step

  1. Select Your Dataset: Load the dataset to analyze (deals, customers, tickets, etc.)
  2. Click "Find Common Groups": Access from Data Science Studio tab
  3. AI Builds Clusters: Scoop determines optimal number of clusters automatically
  4. Clusters Named: AI translates statistical patterns into human descriptions
  5. Results Delivered: See summaries, profiles, and defining rules

What Happens Behind the Scenes

StageAlgorithmPurpose
ClusteringExpectation Maximization (EM)Finds natural groupings
InterpretationDecision TreesIdentifies defining attributes
NamingRules ModelsCreates human-readable descriptions

Tip: You don't need to understand the algorithms. Scoop handles variable selection, scaling, and cluster assignment automatically.

Understanding Cluster Results

Cluster Profiles

Each cluster includes:

ComponentDescription
NameDescriptive label based on key characteristics
SizeNumber and percentage of records
Key AttributesWhat defines this segment
RulesConditions that predict membership

How Clusters Are Named

Scoop creates meaningful names from the data:

Pattern DetectedCluster Name
High revenue, short cycles"Quick Wins"
Large deals, many stakeholders"Strategic Accounts"
Frequent engagement"High Touch Customers"
Low activity, at-risk signals"Needs Attention"

Names are derived from actual patterns—not random labels.

Cluster Rules

Each cluster comes with defining rules:

Cluster: "Fast Closers"
Rules:
- Deal Size < $50K
- Sales Cycle < 30 days
- Decision Makers = 1-2
- Industry IN (Tech, Retail)

These rules tell you exactly why records belong to this segment.

Example: Sales Opportunity Clusters

Sample Results

After running clustering on opportunities:

ClusterSizeKey Characteristics
Quick Wins35%Small deals, single decision maker, fast close
Enterprise Deals15%Large value, long cycle, multiple stakeholders
Stalled Opportunities20%Aging deals, low engagement, needs intervention
Competitive Battles18%Competitor mentioned, price sensitive
Expansion Deals12%Existing customers, upsell/cross-sell

What Each Cluster Tells You

ClusterActionable Insight
Quick WinsPrioritize for this month's close
Enterprise DealsAssign senior reps, plan resources
Stalled OpportunitiesIntervention needed—re-engage or disqualify
Competitive BattlesArm reps with competitive positioning
Expansion DealsLeverage existing relationship

Example: Support Ticket Clusters

Sample Results

ClusterKey Characteristics
Quick ResolutionsResolved < 1 hour, repeat customers, known issues
Escalated CriticalsHigh severity, specific products, long resolution
Low EngagementMinimal customer response, incomplete descriptions
Complex InvestigationsMultiple back-and-forths, cross-team involvement

Insights

  • Quick Resolutions: Document common fixes for knowledge base
  • Escalated Criticals: Route to senior agents immediately
  • Low Engagement: Implement proactive outreach
  • Complex Investigations: Allocate specialist time

Example: Customer Segmentation

Sample Results

ClusterSizeProfile
Power Users8%High engagement, many features, advocates
Steady State45%Regular usage, core features only
Growing22%Increasing activity, adding users
At Risk15%Declining usage, no recent logins
Dormant10%Minimal activity, churn candidates

Actions by Segment

ClusterRecommended Action
Power UsersReference program, beta access
Steady StateFeature adoption campaigns
GrowingUpsell opportunities, success check-ins
At RiskIntervention, health check calls
DormantRe-engagement campaign or churn prevention

Using Cluster Results

Filter by Cluster

Once clusters are identified, use them as filters:

  • View all records in a specific cluster
  • Compare metrics across clusters
  • Track cluster distribution over time

Export for Action

Use CaseHow
Targeted campaignsExport cluster for marketing automation
Sales prioritizationFilter opportunities by cluster
Support routingAssign tickets based on cluster

Track Changes

Monitor how clusters evolve:

  • Are "At Risk" customers increasing?
  • Is "Quick Wins" cluster growing?
  • Movement between clusters over time

Best Practices

When to Use Clustering

Good FitPoor Fit
Discover unknown patternsConfirm known groups
Explore large datasetsVery small datasets
Find natural segmentsSimple filtering needed
Multi-dimensional analysisSingle-attribute grouping

Data Preparation

PracticeWhy
Include relevant attributesMore columns = richer segments
Clean data qualityMissing values affect clustering
Sufficient recordsNeed enough data for patterns
Mix of attributesVariety enables discovery

Interpreting Results

TipDescription
Review cluster sizesVery small clusters may be noise
Check rulesUnderstand what defines each segment
Validate with domain knowledgeDo segments make business sense?
Test actionabilityCan you act on these insights?

Common Applications

DomainClustering Use
SalesOpportunity segmentation, rep assignment
MarketingCustomer segments, campaign targeting
SupportTicket routing, workload planning
ProductUser personas, feature usage patterns
FinanceCustomer value tiers, risk groups

Troubleshooting

Too Many/Few Clusters

Scoop automatically determines optimal cluster count. If results seem off:

  • Check if data has clear natural groupings
  • Verify data quality and completeness
  • Consider filtering to relevant subset

Clusters Don't Make Sense

If segments aren't meaningful:

  • Review which columns are included
  • Check for data quality issues
  • Validate with domain experts

Clusters Too Similar

If segments overlap significantly:

  • Data may not have strong natural groupings
  • Consider adding more discriminating attributes
  • Focus on the most distinct clusters

Related Topics