Back to overview
Lesson 3 of 6

Building a Traffic Quality Scoring Framework

8 min read

Why a Scoring Framework Beats Manual Review

When an affiliate program has 20 partners, an experienced manager can mentally track who sends quality traffic and who does not. At 200 partners, that is impossible. A scoring framework replaces subjective judgment with consistent, measurable evaluation. It allows programs to process hundreds of traffic streams simultaneously, flag degradation before it impacts revenue, and make commission decisions based on data rather than relationships.

The goal is not to build a perfect model. It is to build a model that is consistently better than no model. Even a basic scoring framework that catches the bottom 10% of traffic quality will improve program economics by reducing wasted commission spend on partners who deliver negative-value users.

The Four Scoring Dimensions

A robust traffic quality score combines four distinct measurement dimensions. Each captures a different aspect of quality and compensates for blind spots in the others. Relying on any single dimension creates exploitable gaps -- affiliates will optimize for the metric you measure and ignore everything else.

DimensionWhat It MeasuresExample MetricsWeight Range
Technical SignalsWhether the traffic is human and legitimateBot detection rate, IP diversity, device fingerprint uniqueness, VPN/proxy percentage15-25%
Behavioral EngagementHow users interact after arrivingTime on site, pages per session, verification completion rate, deposit within 24h25-35%
Conversion QualityWhether conversions translate to revenueFTD-to-active ratio, average deposit size, first-week revenue, bonus-to-deposit ratio25-35%
Retention and LTVWhether users generate long-term valueDay-7 retention, day-30 retention, months active, cumulative revenue per user15-25%

Start with equal weights across all four dimensions and adjust after 60-90 days of data collection. Programs with strong fraud detection can reduce the Technical Signals weight. Programs with high churn should increase the Retention and LTV weight. The weighting reflects what matters most to your specific business model.

Selecting Metrics for Each Dimension

Each dimension needs 3-5 specific metrics that are measurable, comparable across partners, and resistant to manipulation. Avoid metrics that affiliates can easily game -- registration count is gameable through incentivized traffic, but day-7 retention is much harder to fake because it requires genuine user engagement over time.

  • Technical: Unique IP ratio (unique IPs / total clicks), device diversity score, percentage of clicks from known data center IPs, average time between click and conversion
  • Behavioral: Registration-to-verification rate, average session duration on first visit, number of product pages viewed, deposit within first 48 hours
  • Conversion: Average first deposit amount, FTD-to-second-deposit ratio, revenue generated in first 7 days, bonus claim rate (lower is often better)
  • Retention: Day-7 active rate, day-30 active rate, average sessions per week in month one, cumulative net revenue at day 60

Scoring Methodology

The simplest effective approach is percentile-based scoring within each dimension. For each metric, rank all active affiliates by their 30-day rolling average, then assign a percentile score from 0-100. Average the percentile scores within each dimension, then apply dimension weights to produce a composite Quality Score from 0-100.

This percentile approach has two advantages: it automatically adjusts to your program's baseline (no need to set absolute thresholds for each metric), and it clearly identifies which partners are below average relative to their peers. A partner with a Quality Score of 35 is in the bottom third of your program -- that is actionable regardless of what the absolute numbers look like.

Quality Score RangeClassificationTypical Action
80-100PremiumEligible for enhanced commission tiers and early access to promotions
60-79StandardNormal commission rates, standard program terms
40-59Watch ListReduced commission rates or additional qualification requirements applied
20-39ProbationAccount under review, payouts held pending quality improvement
0-19CriticalAccount suspended or terminated, payouts forfeited per program terms

Document your scoring methodology and communicate quality expectations to partners during onboarding. Affiliates who understand how quality is measured will self-select into higher-quality traffic sources. Opacity about quality standards attracts partners who rely on not being measured.

Common Scoring Mistakes

  • Over-weighting conversion rate: High conversion rate from incentivized traffic looks great on paper but produces zero LTV
  • Ignoring sample size: A partner with 5 conversions and 100% retention is not higher quality than one with 500 conversions and 70% retention -- apply minimum volume thresholds before scoring
  • Static scoring: Recalculate scores on a rolling 30-day basis, not quarterly snapshots that miss rapid degradation
  • Single-metric shortcuts: Using deposit amount alone as a quality proxy misses users who deposit large and chargeback immediately
  • Not accounting for vertical mix: If your program spans iGaming and Forex, score each vertical separately because baseline metrics differ significantly

Key Takeaways

  • A four-dimension scoring framework (technical, behavioral, conversion, retention) provides comprehensive quality measurement that resists manipulation
  • Percentile-based scoring automatically adjusts to your program baseline and clearly identifies below-average partners
  • Apply minimum volume thresholds before scoring to avoid small-sample distortions
  • Recalculate quality scores on a rolling 30-day basis to catch rapid traffic quality degradation
  • Communicate scoring methodology to affiliates -- transparency attracts quality partners and deters low-quality sources