Graphonomous — GraphMemBench v2

A κ-sensitive benchmark for cyclic memory

Eight deterministic synthetic tiers designed to isolate what κ-topology actually contributes to retrieval. T1–T6 test κ-sensitive cycle detection; T7–T8 benchmark graph algorithm quality (evidence paths, causal ordering). Each tier runs topology ON vs OFF side by side.

T1

κ = 0

Linear chains

Acyclic control. Direct causal chains A→B→C→D with no loop-back. Topology should stay inert — no SCCs, no κ, no deliberate routing.

Δ topology on/off: 0pp
max κ observed: 0 / 0 ✓ correct

T2

κ = 0

Branching DAG

Acyclic control. Tree-shaped causal decomposition: root→{A,B}→{A1,A2,B1}. Tests that fan-out alone doesn't trigger false-positive κ routing.

Δ topology on/off: 0pp
max κ observed: 0 / 0 ✓ correct

T3

κ = 1

Simple-cycle

Disjoint 3–5 node SCCs forming directed cycles A→B→C→A. Primary test for cycle-root paradox and membership-in-cycle retrieval.

Δ kappa_recall: +100pp
Δ routing_precision: +100pp
Δ cycle_root_accuracy: +100pp

T4

κ ≥ 2

Multi-SCC fault-line

Dense bidirectional ring + chord edges per SCC. Tests fault-line detection and high-κ routing. Configurable chord density scales κ upward.

Δ kappa_recall: +100pp
Δ faultline_mrr: +signal
max κ observed: 2 / 0

T5

κ = 0–1

Adversarial contradiction

Paired old/new facts per subject, half joined by bidirectional contradicts (κ=1), half by one-way supersedes (κ=0). Tests belief revision under κ pressure.

Δ kappa_recall (κ=1 subset): +100pp
fresh_belief_top1_rate tracked
belief_revision_rank_mean tracked

T6

mixed κ

κ-discrimination

SCCs planted across a density ladder [1..5] — retriever must distinguish κ values, not just detect that a cycle exists. New metrics: discrim_accuracy and κ-MAE.

Δ kappa_discrim_accuracy: +100pp
Δ kappa_discrim_mae: −2.0
max κ observed: 2 / 0

T7

algorithm

Evidence path tracing

Weighted DAGs with known shortest paths (Dijkstra). Tests whether retrieval preserves evidence chain ordering and hop counts. Alternate paths probe multi-path awareness.

path_node_recall: 1.00
path_order_accuracy: 0.60
hop_count_mae: 4.00 (baseline)

T8

algorithm

Causal DAG ordering

Layered DAGs with known topological order and critical path depth. Tests causal sequencing (toposort), longest-path detection, and source/sink identification.

ordering_accuracy: 0.46 (baseline)
critical_depth_mae: 2.67
source_sink_recall: 1.00

Tier	κ class	kappa_recall on	Δ	max κ on / off	p50 on / off	Gate
T1	κ=0 linear	0.00	0	0 / 0	30ms / 32ms	✓ correct
T2	κ=0 branching	0.00	0	0 / 0	30ms / 29ms	✓ correct
T3	κ=1 cycle	1.00	+100pp	1 / 0	10.2s / 32ms	✓ pass
T4	κ≥2 multi-SCC	1.00	+100pp	2 / 0	43ms / 32ms	✓ pass
T5	κ=0–1 contradict	1.00	+100pp	1 / 0	41ms / 39ms	✓ pass
T6	mixed κ	1.00	+100pp	2 / 0	46ms / 38ms	✓ pass

Tier

κ class

kappa_recall on

kappa_recall off

max κ on / off

p50 on / off

Gate

κ=0 linear

0.00

0 / 0

30ms / 32ms

✓ correct

κ=0 branching

0.00

0 / 0

30ms / 29ms

✓ correct

κ=1 cycle

1.00

0.00

+100pp

1 / 0

10.2s / 32ms

✓ pass

κ≥2 multi-SCC

1.00

0.00

+100pp

2 / 0

43ms / 32ms

✓ pass

κ=0–1 contradict

1.00

0.00

+100pp

1 / 0

41ms / 39ms

✓ pass

mixed κ

1.00

0.00

+100pp

2 / 0

46ms / 38ms

✓ pass

Metric	Topology ON	Topology OFF	Δ
path_node_recall	1.00	1.00	0
path_order_accuracy	0.60	0.60	0
hop_count_mae	4.00	4.00	0
alt_path_detected	1.00	1.00	0
latency p50 / p95	32ms / 200ms	35ms / 275ms	~0

Metric

Topology ON

Topology OFF

path_node_recall

1.00

path_order_accuracy

0.60

hop_count_mae

4.00

alt_path_detected

1.00

latency p50 / p95

32ms / 200ms

35ms / 275ms

Metric	Topology ON	Topology OFF	Δ
ordering_accuracy	0.46	0.46	0
critical_depth_mae	2.67	2.67	0
source_sink_recall	1.00	1.00	0
latency p50 / p95	34ms / 205ms	33ms / 201ms	~0

Metric

Topology ON

Topology OFF

ordering_accuracy

0.46

critical_depth_mae

2.67

source_sink_recall

1.00

latency p50 / p95

34ms / 205ms

33ms / 201ms

Six graph algorithms, 72 new tests

Beyond T7/T8 baselines, Graphonomous v0.3.3 ships a full graph algorithms library in graphonomous/lib/graphonomous/algorithms/. All algorithms are pure functions callable independently or via MCP tools.

Algorithm	Module	Complexity	Tests	Portfolio Reuse
Weighted Dijkstra + Yen's K-shortest	`dijkstra.ex`	O((V+E) log V)	22	Delegatic, Deliberatic, GeoFleetic, AgenTroMatic
Kahn's toposort + longest-path DP	`dag.ex`	O(V+E)	22	AgenTroMatic, SpecPrompt, OS-008
Hopcroft-Karp + Hungarian	`matching.ex`	O(E√V) / O(n³)	12	FleetPrompt, GeoFleetic, AgenTroMatic
Louvain community detection	`louvain.ex`	O(n log n)	10	Consolidation, forget_by_policy
Incremental SCC maintenance	`incremental_scc.ex`	O(m½) amortized	13	Topology (replaces cold Tarjan)
Triangle counting + clustering	`triangles.ex`	O(m^1.5)	15	Graph health instrumentation
Total	72	Codebase total: 455 tests

Algorithm

Module

Complexity

Tests

Portfolio Reuse

Weighted Dijkstra + Yen's K-shortest

dijkstra.ex

O((V+E) log V)

Delegatic, Deliberatic, GeoFleetic, AgenTroMatic

Kahn's toposort + longest-path DP

dag.ex

O(V+E)

AgenTroMatic, SpecPrompt, OS-008

Hopcroft-Karp + Hungarian

matching.ex

O(E√V) / O(n³)

FleetPrompt, GeoFleetic, AgenTroMatic

Louvain community detection

louvain.ex

O(n log n)

Consolidation, forget_by_policy

Incremental SCC maintenance

incremental_scc.ex

O(m½) amortized

Topology (replaces cold Tarjan)

Triangle counting + clustering

triangles.ex

O(m^1.5)

Graph health instrumentation

Total

Codebase total: 455 tests

A κ-sensitive benchmark for cyclic memory

From κ=0 controls to graph algorithm benchmarks

T1

T2

T3

T4

T5

T6

T7

T8

T1–T6: topology on vs off

T7–T8: graph algorithm quality

T7 · Evidence Path Tracing (Dijkstra)

T8 · Causal DAG Ordering (Toposort)

Six graph algorithms, 72 new tests

Scale the benchmark until it breaks

1 · Density

2 · Distractor flood

3 · Retrieval squeeze

4 · Mixed-κ tier (T6)

5 · Content homogenization

Run it locally