Scoring Configuration

Debtmap uses a weighted scoring model to calculate technical debt priority. This chapter explains how to configure scoring weights, role multipliers, and related settings that affect how functions and files are prioritized.

Quick Reference

Here’s a quick overview of all scoring defaults (from src/config/scoring.rs):

Configuration	Default Value	Purpose
Scoring Weights
`coverage`	0.50	Weight for test coverage gaps
`complexity`	0.35	Weight for code complexity
`dependency`	0.15	Weight for dependency criticality
Role Multipliers
`pure_logic`	1.2	Prioritize pure computation
`orchestrator`	0.8	Reduce for delegation functions
`io_wrapper`	0.7	Reduce for I/O wrappers
`entry_point`	0.9	Slight reduction for main/CLI
`pattern_match`	0.6	Reduce for pattern matching
`debug`	0.3	Debug/diagnostic functions
`unknown`	1.0	No adjustment
Role Coverage Weights
`entry_point`	0.6	Reduce coverage penalty
`orchestrator`	0.8	Reduce coverage penalty
`pure_logic`	1.0	No reduction
`io_wrapper`	0.5	Reduce for I/O wrappers
`pattern_match`	1.0	Standard penalty
`debug`	0.3	Lowest coverage expectations
`unknown`	1.0	Standard penalty
Role Multiplier Clamping
`clamp_min`	0.3	Minimum multiplier
`clamp_max`	1.8	Maximum multiplier
`enable_clamping`	true	Enable clamping

Scoring Weights

The [scoring] section controls how different factors contribute to the overall debt score. Debtmap uses a weighted sum model where weights must sum to 1.0.

[scoring]
coverage = 0.50      # Weight for test coverage gaps (default: 0.50)
complexity = 0.35    # Weight for code complexity (default: 0.35)
dependency = 0.15    # Weight for dependency criticality (default: 0.15)

Active weights (used in scoring):

coverage - Prioritizes untested code (default: 0.50)
complexity - Identifies complex areas (default: 0.35)
dependency - Considers impact radius (default: 0.15)

Unused weights (reserved for future features):

semantic - Not currently used (default: 0.00)
security - Not currently used (default: 0.00)
organization - Not currently used (default: 0.00)

Validation rules:

All weights must be between 0.0 and 1.0
Active weights (coverage + complexity + dependency) must sum to 1.0 (±0.001 tolerance)
If weights don’t sum to 1.0, they will be automatically normalized

Example - Prioritize complexity over coverage:

[scoring]
coverage = 0.30
complexity = 0.55
dependency = 0.15

Source: src/config/scoring.rs:14-40 (ScoringWeights)

Complexity Weights

The [complexity_weights] section controls how cyclomatic and cognitive complexity are combined in the final scoring:

[complexity_weights]
cyclomatic = 0.3      # Weight for cyclomatic complexity (default: 0.3)
cognitive = 0.7       # Weight for cognitive complexity (default: 0.7)
max_cyclomatic = 50.0 # Maximum cyclomatic for normalization (default: 50.0)
max_cognitive = 100.0 # Maximum cognitive for normalization (default: 100.0)

Why cognitive complexity is weighted higher:

Cognitive complexity correlates better with bug density
Cyclomatic complexity is a proxy for test cases needed
The 70/30 split balances maintainability with testability

Source: src/config/scoring.rs:335-381 (ComplexityWeightsConfig)

Role Multipliers

Role multipliers adjust complexity scores based on a function’s semantic role:

[role_multipliers]
pure_logic = 1.2        # Prioritize pure computation (default: 1.2)
orchestrator = 0.8      # Reduce for delegation functions (default: 0.8)
io_wrapper = 0.7        # Reduce for I/O wrappers (default: 0.7)
entry_point = 0.9       # Slight reduction for main/CLI (default: 0.9)
pattern_match = 0.6     # Reduce for pattern matching (default: 0.6)
debug = 0.3             # Debug/diagnostic functions (default: 0.3)
unknown = 1.0           # No adjustment (default: 1.0)

These multipliers help reduce false positives by recognizing that different function types have naturally different complexity levels. The debug role has the lowest multiplier (0.3) since debug and diagnostic functions typically have low testing priority.

Source: src/config/scoring.rs:206-333 (RoleMultipliers)

Role-Based Scoring Configuration

DebtMap uses a two-stage role adjustment mechanism to accurately score functions based on their architectural role and testing strategy. This section explains how to configure both stages.

Stage 1: Role Coverage Weights

The first stage adjusts how much coverage gaps penalize different function types. This recognizes that not all functions need the same level of unit test coverage.

Configuration (.debtmap.toml under [scoring.role_coverage_weights]):

[scoring.role_coverage_weights]
entry_point = 0.6       # Reduce coverage penalty (often integration tested)
orchestrator = 0.8      # Reduce coverage penalty (tested via higher-level tests)
pure_logic = 1.0        # Pure logic should have unit tests, no reduction (default: 1.0)
io_wrapper = 0.5        # I/O wrappers are integration tested (default: 0.5)
pattern_match = 1.0     # Standard penalty
debug = 0.3             # Debug functions have lowest coverage expectations (default: 0.3)
unknown = 1.0           # Standard penalty (default behavior)

Rationale:

Function Role	Weight	Why This Value?
Entry Point	0.6	CLI handlers, HTTP routes, `main` functions are integration tested, not unit tested
Orchestrator	0.8	Coordination functions tested via higher-level tests
Pure Logic	1.0	Core business logic should have unit tests (default: 1.0)
I/O Wrapper	0.5	File/network operations tested via integration tests (default: 0.5)
Pattern Match	1.0	Standard coverage expectations
Debug	0.3	Debug/diagnostic functions have lowest testing priority (default: 0.3)
Unknown	1.0	Default when role cannot be determined

Example Impact:

# Emphasize pure logic testing strongly
[scoring.role_coverage_weights]
pure_logic = 1.5        # 50% higher penalty for untested logic
entry_point = 0.5       # 50% lower penalty for untested entry points
io_wrapper = 0.4        # 60% lower penalty for untested I/O

# Conservative approach (smaller adjustments)
[scoring.role_coverage_weights]
pure_logic = 1.1        # Only 10% increase
entry_point = 0.9       # Only 10% decrease

How It Works:

When a function has 0% coverage:

Entry Point (weight 0.6): Gets 60% penalty instead of 100% penalty
Pure Logic (weight 1.0): Gets 100% penalty (standard emphasis on testing)
I/O Wrapper (weight 0.5): Gets 50% penalty

This prevents entry points from dominating the priority list due to low unit test coverage while emphasizing the importance of testing pure business logic.

Source: src/config/scoring.rs:383-455 (RoleCoverageWeights)

Stage 2: Role Multiplier with Clamping

The second stage applies a final role-based multiplier to reflect architectural importance. This multiplier is clamped by default to prevent extreme score variations.

Configuration (.debtmap.toml under [scoring.role_multiplier]):

[scoring.role_multiplier]
clamp_min = 0.3           # Minimum multiplier (default: 0.3)
clamp_max = 1.8           # Maximum multiplier (default: 1.8)
enable_clamping = true    # Enable clamping (default: true)

Parameters:

Parameter	Default	Description
`clamp_min`	0.3	Minimum allowed multiplier - prevents functions from becoming invisible
`clamp_max`	1.8	Maximum allowed multiplier - prevents extreme score spikes
`enable_clamping`	true	Whether to apply clamping (disable for prototyping only)

Clamp Range Rationale:

Default [0.3, 1.8]: Balances differentiation with stability

Lower bound (0.3): I/O wrappers still contribute 30% of their base score
- Prevents them from becoming invisible in the priority list
- Ensures simple wrappers aren’t completely ignored
Upper bound (1.8): Critical functions get at most 180% of base score
- Prevents one complex function from dominating the entire list
- Maintains balanced prioritization across different issues

When to Adjust Clamp Range:

# Wider range for more differentiation
[scoring.role_multiplier]
clamp_min = 0.2           # Allow more reduction
clamp_max = 2.5           # Allow more emphasis

# Narrower range for more stability
[scoring.role_multiplier]
clamp_min = 0.5           # Less reduction
clamp_max = 1.5           # Less emphasis

# Disable clamping (not recommended for production)
[scoring.role_multiplier]
enable_clamping = false   # Allow unclamped multipliers
# Warning: May cause unstable prioritization

When to Disable Clamping:

Prototyping: Testing extreme multiplier values for custom scoring strategies
Special cases: Very specific project needs requiring wide multiplier ranges
Not recommended for production use as it can lead to unstable prioritization

Example Impact:

Without clamping:

Function: critical_business_logic (Pure Logic)
  Base Score: 45.0
  Role Multiplier: 2.5 (unclamped)
  Final Score: 112.5 (dominates entire list)

With clamping (default):

Function: critical_business_logic (Pure Logic)
  Base Score: 45.0
  Role Multiplier: 1.8 (clamped from 2.5)
  Final Score: 81.0 (high priority, but balanced)

Source: src/config/scoring.rs:457-493 (RoleMultiplierConfig)

Complete Example Configuration

Here’s a complete example showing both stages configured together:

# Stage 1: Coverage weight adjustments
[scoring.role_coverage_weights]
pure_logic = 1.0        # Pure logic should have unit tests (default: 1.0)
entry_point = 0.6       # Reduce penalty for integration-tested entry points
orchestrator = 0.8      # Partially reduce penalty for orchestrators
io_wrapper = 0.5        # I/O wrappers are integration tested (default: 0.5)
pattern_match = 1.0     # Standard
debug = 0.3             # Debug functions have lowest coverage expectations (default: 0.3)
unknown = 1.0           # Standard

# Stage 2: Role multiplier with clamping
[scoring.role_multiplier]
clamp_min = 0.3         # I/O wrappers contribute at least 30%
clamp_max = 1.8         # Critical functions get at most 180%
enable_clamping = true  # Keep clamping enabled for stability

How the Two Stages Work Together

The two-stage approach ensures role-based coverage adjustments and architectural importance multipliers work independently:

Example Workflow:

1. Calculate base score from complexity (10) and dependencies (5)
   -> Base = 15.0

2. Stage 1: Apply coverage weight based on role (Entry Point, weight 0.6)
   -> Coverage penalty reduced from 1.0 to 0.4
   -> Preliminary score = 15.0 * 0.4 = 6.0

3. Stage 2: Apply clamped role multiplier (Entry Point, multiplier 1.2)
   -> Clamped to [0.3, 1.8] -> stays 1.2
   -> Final score = 6.0 * 1.2 = 7.2

Key Benefits:

Coverage adjustments don’t interfere with role multiplier
Both mechanisms contribute independently to final score
Clamping prevents instability from extreme values
Configuration flexibility for different project needs

Verification

To see how role-based adjustments affect your codebase:

# Show detailed scoring breakdown
debtmap analyze . --verbose

# Look for lines like:
#   Coverage Weight: 0.6 (Entry Point adjustment)
#   Adjusted Coverage Penalty: 0.4 (reduced from 1.0)
#   Role Multiplier: 1.2 (clamped from 1.5)

For more details on how role-based adjustments reduce false positives, see the Role-Based Adjustments section in the Scoring Strategies guide.

Score Normalization

The [normalization] section controls how raw scores are normalized to a 0-10 scale:

[normalization]
linear_threshold = 10.0         # Use linear scaling below this value
logarithmic_threshold = 100.0   # Use logarithmic scaling above this value
sqrt_multiplier = 3.33          # Multiplier for square root scaling
log_multiplier = 10.0           # Multiplier for logarithmic scaling
show_raw_scores = true          # Show both raw and normalized scores

Normalization ensures scores are comparable across different codebases and prevents extreme outliers from dominating the results.

Source: src/config/scoring.rs:557-610 (NormalizationConfig)

Context Multipliers

The [context_multipliers] section dampens scores for non-production code (spec 191):

[context_multipliers]
examples = 0.1              # 90% reduction for example files
tests = 0.2                 # 80% reduction for test files
benchmarks = 0.3            # 70% reduction for benchmarks
build_scripts = 0.3         # 70% reduction for build scripts
documentation = 0.1         # 90% reduction for documentation
enable_context_dampening = false  # Disabled by default (use --context flag)

When enabled with --context, these multipliers reduce false positive urgency scores for code that doesn’t represent production complexity.

Source: src/config/scoring.rs:612-676 (ContextMultipliers)

Data Flow Scoring

The [data_flow_scoring] section configures analysis of data flow patterns (spec 218):

[data_flow_scoring]
enabled = true              # Enable data flow scoring
purity_weight = 0.4         # Weight for function purity
refactorability_weight = 0.3 # Weight for refactorability
pattern_weight = 0.3        # Weight for pattern recognition

Data flow scoring rewards functions with:

Pure data transformations (no side effects)
High refactorability potential
Recognized functional patterns (map, filter, fold)

Source: src/config/scoring.rs:678-724 (DataFlowScoringConfig)

Rebalanced Scoring Presets

The [rebalanced_scoring] section allows using predefined presets or custom weights (spec 136):

[rebalanced_scoring]
preset = "balanced"         # Preset: balanced, quality-focused, size-focused, test-coverage
# Or override with custom weights:
# complexity_weight = 0.35
# coverage_weight = 0.30
# structural_weight = 0.15
# size_weight = 0.10
# smell_weight = 0.10

Available Presets:

Preset	Focus	Best For
`balanced`	Even distribution	General use
`quality-focused`	Complexity + coverage	Mature projects
`size-focused`	File/function size	Refactoring projects
`test-coverage`	Coverage gaps	Test improvement efforts

Source: src/config/scoring.rs:495-554 (RebalancedScoringConfig)

Scoring Strategies - In-depth explanation of scoring algorithms
Role-Based Adjustments - How semantic roles affect scoring
Thresholds Configuration - Configure when code is flagged as debt
Tiered Prioritization - How scores map to priority tiers

Keyboard shortcuts

Debtmap Documentation