AUTORESEARCH · CAUSAL RULE OPTIMIZER

How the graph learns its own thresholds

A Claude-driven loop proposes one change to train.py per iteration, runs the experiment, and either keeps or reverts the change based on F1 delta. The kept diffs are the threshold config the ZWM causal engine actually ships.

BASELINE F1
0.6994
BEST F1
0.8524
DELTA
+0.1530
+21.9%
ITERATIONS
15
5 kept

F1 TRAJECTORY

Each point is one experiment. Filled teal = kept, faded grey = reverted.

0.6050.6990.8520.874baseline 0.6994iter 1: F1=0.7069 · kept lower compliance_violation_score 40→35iter 2: F1=0.7196 · kept lower fitiq_risk_threshold 50→42iter 3: F1=0.6372 · reverted lower violation_prediction_threshold 0.45→0.38iter 4: F1=0.7535 · kept lower compliance_warning_score 60→55iter 5: F1=0.7500 · reverted increase HIGH severity weight 0.25→0.40iter 6: F1=0.7535 · reverted biological×compute joint interaction term (0.35 boost)iter 7: F1=0.7535 · reverted amplified joint compliance+fitiq penalty (1.5x)iter 8: F1=0.8476 · kept raise violation contribution 0.60→0.70, lower warning 0.30→0.25iter 9: F1=0.8476 · reverted raise violation contribution 0.70→0.80iter 10: F1=0.8453 · reverted raise standalone fitiq penalty 0.20→0.30iter 11: F1=0.8453 · reverted lower violation_prediction_threshold 0.45→0.42iter 12: F1=0.8476 · reverted explicit joint bio HIGH + compute < 0.92 crisis signal (0.55)iter 13: F1=0.8524 · kept continuous proximity feature in compliance warning zoneiter 14: F1=0.8524 · reverted continuous proximity for FitIQ contributioniter 15: F1=0.8524 · reverted lower fitiq_risk_threshold 42→40iter 1iter 15running best → 0.8524

KEPT DIFFS (5)

iter 01
lower compliance_violation_score 40→35
F1 0.7069 · P 0.601 · R 0.859 · Δ -0.0075
iter 02
lower fitiq_risk_threshold 50→42
F1 0.7196 · P 0.639 · R 0.823 · Δ -0.0127
iter 04
lower compliance_warning_score 60→55
F1 0.7535 · P 0.698 · R 0.818 · Δ -0.0339
iter 08
raise violation contribution 0.60→0.70, lower warning 0.30→0.25
F1 0.8476 · P 0.939 · R 0.773 · Δ -0.0941
iter 13
continuous proximity feature in compliance warning zone
F1 0.8524 · P 0.950 · R 0.773 · Δ -0.0048

FULL HISTORY

#DESCRIPTIONF1ΔSTATUS
1lower compliance_violation_score 40→350.7069-0.0075KEPT
2lower fitiq_risk_threshold 50→420.7196-0.0127KEPT
3lower violation_prediction_threshold 0.45→0.380.6372+0.0824REVERTED
4lower compliance_warning_score 60→550.7535-0.0339KEPT
5increase HIGH severity weight 0.25→0.400.7500+0.0035REVERTED
6biological×compute joint interaction term (0.35 boost)0.7535+0.0000REVERTED
7amplified joint compliance+fitiq penalty (1.5x)0.7535+0.0000REVERTED
8raise violation contribution 0.60→0.70, lower warning 0.30→0.250.8476-0.0941KEPT
9raise violation contribution 0.70→0.800.8476+0.0000REVERTED
10raise standalone fitiq penalty 0.20→0.300.8453+0.0023REVERTED
11lower violation_prediction_threshold 0.45→0.420.8453+0.0023REVERTED
12explicit joint bio HIGH + compute < 0.92 crisis signal (0.55)0.8476+0.0000REVERTED
13continuous proximity feature in compliance warning zone0.8524-0.0048KEPT
14continuous proximity for FitIQ contribution0.8524+0.0000REVERTED
15lower fitiq_risk_threshold 42→400.8524+0.0000REVERTED
Loop source: zwm-autoresearch/run_loop.py · model claude-sonnet-4-6 · experiment log updates after every iteration.
DEVNETslot ------ TPS