4 Debugging and Profiling

4.1 Tracing with rust_log

panproto uses the tracing crate for structured logging. Control verbosity through the RUST_LOG environment variable:

# Debug output for a single crate
RUST_LOG=panproto_inst=debug cargo nextest run -p panproto-inst

# Trace-level output (very verbose) for the migration engine
RUST_LOG=panproto_mig=trace cargo nextest run -p panproto-mig

# Multiple crates at different levels
RUST_LOG=panproto_gat=info,panproto_mig=debug cargo nextest run --workspace

# Everything at debug
RUST_LOG=debug cargo nextest run --workspace

Filtering a single test with tracing

Combine RUST_LOG with a test filter to get focused output:

RUST_LOG=panproto_mig=debug cargo nextest run -p panproto-mig -- test_lift_roundtrip

4.2 Stepping through the w-type restrict pipeline

The restrict pipeline in panproto-inst transforms a W-type instance (a tree-shaped data record whose structure is determined by the schema) when a migration removes or restructures schema elements. The pipeline has five key functions that serve as natural debugging waypoints:

anchor_surviving() in crates/panproto-inst/src/wtype.rs: determines which nodes survive the migration. If output is missing data, check whether the surviving set is too small.
reachable_from_root() in crates/panproto-inst/src/wtype.rs: BFS from root through anchor-surviving nodes. If nodes appear unreachable despite surviving, the reachability check is the place to look.
ancestor_contraction() in crates/panproto-inst/src/wtype.rs: merges ancestor chains when intermediate vertices are removed. Panics here usually mean the surviving set is inconsistent with the vertex remap.
resolve_edge() in crates/panproto-inst/src/wtype.rs: rewires edges according to the compiled migration’s edge remap. If edges point to wrong targets, inspect the resolver.
reconstruct_fans() in crates/panproto-inst/src/wtype.rs: rebuilds fan structures (ordered child collections) after restrict. Missing or reordered children point to fan reconstruction issues.

Set RUST_LOG=panproto_inst=debug to see entry/exit traces for each function, including the sizes of intermediate data structures.

4.3 Inspecting WASM boundary data

The WASM boundary (panproto-wasm) serializes all data as MessagePack via rmp-serde. When debugging WASM integration issues, you may need to inspect the raw bytes.

4.3.1 Hex-dump a MessagePack payload

In the TypeScript SDK, add a temporary log before deserialization:

// In sdk/typescript/src/msgpack.ts
console.log('msgpack hex:', Buffer.from(bytes).toString('hex'));

Then decode the hex with a MessagePack inspector (e.g., msgpack.org or the msgpack-tools CLI).

4.3.2 Common deserialization errors

Error	Cause	Fix
`InvalidMarkerRead`	Empty or truncated payload	Check that the Rust side returned data (not an empty `Vec<u8>`)
`TypeMismatch`	Schema version skew between Rust and TS	Rebuild WASM and TS SDK from the same commit
`LengthMismatch` on arrays	Struct field added/removed on one side	Ensure `#[serde(rename)]` and TS types match
`UnknownVariant`	Enum variant added in Rust but not in TS	Add the variant to the TS `types.ts` discriminated union

4.4 Benchmarking with divan

Three crates have benchmarks using the divan framework:

# GAT engine benchmarks (theory construction, morphism checking, colimit)
cargo bench -p panproto-gat

# Instance benchmarks (W-type construction, restrict, parse)
cargo bench -p panproto-inst

# Migration benchmarks (compile, lift, compose)
cargo bench -p panproto-mig

4.4.1 Interpreting output

divan produces output like:

panproto_gat         fastest       │ slowest       │ median        │ mean
├─ colimit_small     142.3 ns      │ 289.1 ns      │ 148.7 ns      │ 152.4 ns
├─ colimit_large     12.41 µs      │ 18.92 µs      │ 13.05 µs      │ 13.28 µs
╰─ morphism_check    84.21 ns      │ 127.6 ns      │ 88.43 ns      │ 90.12 ns

Focus on the median column. Differences under 5% are noise; investigate anything over 10%.

4.4.2 CI regression checks

CI runs benchmarks on every PR and compares against the main branch baseline. If a regression exceeds the threshold (currently 15%), CI reports it as a warning (not a failure). The benchmark data is attached as a PR comment.

To reproduce a CI benchmark comparison locally:

# Baseline on main
git checkout main
cargo bench -p panproto-gat -- --save-baseline main

# Your branch
git checkout your-branch
cargo bench -p panproto-gat -- --baseline main

4.5 Common debugging scenarios

4.5.1 Migration compiles but lift returns wrong data

Check the surviving set. Run with RUST_LOG=panproto_inst=debug and look at the anchor_surviving output. Are all expected nodes present?
Inspect the vertex remap. Print the compiled migration’s vertex_remap field. Does every source vertex map to the correct target vertex?
Trace the lift. The lift reconstructs source-schema data from a target instance. If it produces wrong values, the resolver functions in the compiled migration may be incorrect.

4.5.2 Existence check passes but restrict panics

The existence check (panproto-mig/src/existence.rs) verifies that a migration can be applied, but it does not run the full restrict pipeline. A panic during restrict usually means:

Resolver coverage gap. The compiled migration has a resolver for every edge in the target schema, but one of them returns an invalid handle. Add a test that calls restrict() directly on a minimal instance.
Fan arity mismatch. A hyper-edge changed its arity (number of targets), but the fan reconstruction did not account for it. Check reconstruct_fans() with tracing enabled.

4.5.3 Property test found a failure but cannot reproduce it

Look for the seed in the failure output: proptest: seed = 0x1234abcd....

Re-run with that seed:

PROPTEST_SEED=0x1234abcd cargo nextest run -p panproto-gat -- test_name

If the test is non-deterministic (it should not be), check for HashMap iteration order sensitivity. panproto uses rustc-hash (FxHashMap) which has a fixed hash seed, so iteration order should be deterministic.

4.5.4 WASM tests pass locally but fail in CI

CI runs WASM tests in headless Chrome. If you are testing locally with Node.js, there may be differences in TextEncoder/TextDecoder behavior.
Reproduce CI conditions: wasm-pack test --headless --chrome crates/panproto-wasm.
If headless Chrome is not installed, use --headless --firefox or install Chrome via your system package manager.

4.5.5 Colimit produces unexpected merged theory

When two theories are merged via colimit() and the result has unexpected sorts or operations:

Verify the span. The colimit is computed over a span (two structure-preserving maps from a shared base theory to each of the theories being merged). Print both morphisms and confirm that the shared sorts map correctly in each direction.
Check equation preservation. If the merged theory has conflicting equations, colimit() will return an error. But if equations are merely missing, the merged theory may be valid but semantically wrong. Inspect the equations in the output theory.

Trace with debug logging:

RUST_LOG=panproto_gat=debug cargo nextest run -p panproto-gat -- test_colimit

The debug output shows each sort and operation merge step.

4.5.6 Schema validation passes but normalize changes structure

The normalize() function in panproto-schema collapses reference chains and canonicalizes vertex ordering. If your schema looks correct before normalization but wrong after:

Check for reference cycles. normalize() follows reference edges to collapse chains. A cycle causes infinite recursion (which Rust will catch as a stack overflow).
Inspect edge targets. Print the schema’s edges before and after normalization. Look for edges whose targets changed unexpectedly; this usually means a reference chain was collapsed through an intermediate vertex you did not expect.
Run validate before and after: If validate() passes before normalization but fails after, the normalization may have introduced an edge-rule violation. File a bug.

4.6 Profiling with system tools

For deeper performance analysis beyond divan benchmarks:

4.6.1 CPU profiling on macOS

# Using Instruments (Xcode)
cargo build -p panproto-gat --release
xcrun xctrace record --template "Time Profiler" --launch -- \
  ./target/release/deps/panproto_gat-*  # adjust binary name

# Using samply
cargo install samply
samply record cargo bench -p panproto-gat

4.6.2 Memory profiling

panproto avoids allocator-heavy patterns, but if you suspect a memory issue:

# Using DHAT (heap profiling via valgrind)
cargo install cargo-valgrind
cargo valgrind --tool=dhat -p panproto-inst -- test_name

# Or use the built-in allocator stats in tests
RUST_LOG=panproto_inst=trace cargo nextest run -p panproto-inst -- test_large_instance

Look for unexpected Vec growth or bumpalo arena sizes in the trace output.

4.7 Quick reference

Goal	Tool / Command
Structured logs	`RUST_LOG=panproto_crate=debug cargo nextest run ...`
WASM payload inspection	Hex-dump MessagePack bytes, decode with msgpack tools
Handle debugging	`slab_stats()` API in `panproto-wasm`
Benchmarks	`cargo bench -p panproto-gat` (also `-inst`, `-mig`)
Reproduce property test	`PROPTEST_SEED=... cargo nextest run ...`
Reproduce CI WASM tests	`wasm-pack test --headless --chrome crates/panproto-wasm`

# Debugging and Profiling {#sec-debugging-profiling} ## Tracing with rust_log panproto uses the `tracing` crate for structured logging. Control verbosity through the `RUST_LOG` environment variable: ```bash # Debug output for a single crate RUST_LOG=panproto_inst=debug cargo nextest run -p panproto-inst # Trace-level output (very verbose) for the migration engine RUST_LOG=panproto_mig=trace cargo nextest run -p panproto-mig # Multiple crates at different levels RUST_LOG=panproto_gat=info,panproto_mig=debug cargo nextest run --workspace # Everything at debug RUST_LOG=debug cargo nextest run --workspace ``` ::: {.callout-tip} ## Filtering a single test with tracing Combine `RUST_LOG` with a test filter to get focused output: ```bash RUST_LOG=panproto_mig=debug cargo nextest run -p panproto-mig -- test_lift_roundtrip ``` ::: ## Stepping through the w-type restrict pipeline The restrict pipeline in `panproto-inst` transforms a W-type instance (a tree-shaped data record whose structure is determined by the schema) when a migration removes or restructures schema elements. The pipeline has five key functions that serve as natural debugging waypoints: 1. **`anchor_surviving()`** in `crates/panproto-inst/src/wtype.rs`: determines which nodes survive the migration. If output is missing data, check whether the surviving set is too small. 2. **`reachable_from_root()`** in `crates/panproto-inst/src/wtype.rs`: BFS from root through anchor-surviving nodes. If nodes appear unreachable despite surviving, the reachability check is the place to look. 3. **`ancestor_contraction()`** in `crates/panproto-inst/src/wtype.rs`: merges ancestor chains when intermediate vertices are removed. Panics here usually mean the surviving set is inconsistent with the vertex remap. 4. **`resolve_edge()`** in `crates/panproto-inst/src/wtype.rs`: rewires edges according to the compiled migration's edge remap. If edges point to wrong targets, inspect the resolver. 5. **`reconstruct_fans()`** in `crates/panproto-inst/src/wtype.rs`: rebuilds fan structures (ordered child collections) after restrict. Missing or reordered children point to fan reconstruction issues. Set `RUST_LOG=panproto_inst=debug` to see entry/exit traces for each function, including the sizes of intermediate data structures. ## Inspecting WASM boundary data The WASM boundary (`panproto-wasm`) serializes all data as MessagePack via `rmp-serde`. When debugging WASM integration issues, you may need to inspect the raw bytes. ### Hex-dump a MessagePack payload In the TypeScript SDK, add a temporary log before deserialization: ```typescript // In sdk/typescript/src/msgpack.ts console.log('msgpack hex:', Buffer.from(bytes).toString('hex')); ``` Then decode the hex with a MessagePack inspector (e.g., [msgpack.org](https://msgpack.org) or the `msgpack-tools` CLI). ### Common deserialization errors | Error | Cause | Fix | |---|---|---| | `InvalidMarkerRead` | Empty or truncated payload | Check that the Rust side returned data (not an empty `Vec<u8>`) | | `TypeMismatch` | Schema version skew between Rust and TS | Rebuild WASM and TS SDK from the same commit | | `LengthMismatch` on arrays | Struct field added/removed on one side | Ensure `#[serde(rename)]` and TS types match | | `UnknownVariant` | Enum variant added in Rust but not in TS | Add the variant to the TS `types.ts` discriminated union | ### Handle-related debugging The WASM slab allocator (`crates/panproto-wasm/src/slab.rs`) uses opaque integer handles to reference Rust objects from JavaScript. Common handle issues: - **Invalid handle panic**: The JS side passed a handle that was never allocated. Check for off-by-one errors or stale handles after a `free()` call. - **Double-free panic**: The JS side called `free()` twice on the same handle. Add a guard in the TS wrapper to nullify handles after freeing. - **Memory leak**: Handles allocated but never freed. Use the `slab_stats()` debug API to check the count of live handles. ## Benchmarking with divan Three crates have benchmarks using the `divan` framework: ```bash # GAT engine benchmarks (theory construction, morphism checking, colimit) cargo bench -p panproto-gat # Instance benchmarks (W-type construction, restrict, parse) cargo bench -p panproto-inst # Migration benchmarks (compile, lift, compose) cargo bench -p panproto-mig ``` ### Interpreting output divan produces output like: ``` panproto_gat fastest │ slowest │ median │ mean ├─ colimit_small 142.3 ns │ 289.1 ns │ 148.7 ns │ 152.4 ns ├─ colimit_large 12.41 µs │ 18.92 µs │ 13.05 µs │ 13.28 µs ╰─ morphism_check 84.21 ns │ 127.6 ns │ 88.43 ns │ 90.12 ns ``` Focus on the **median** column. Differences under 5% are noise; investigate anything over 10%. ### CI regression checks CI runs benchmarks on every PR and compares against the `main` branch baseline. If a regression exceeds the threshold (currently 15%), CI reports it as a warning (not a failure). The benchmark data is attached as a PR comment. To reproduce a CI benchmark comparison locally: ```bash # Baseline on main git checkout main cargo bench -p panproto-gat -- --save-baseline main # Your branch git checkout your-branch cargo bench -p panproto-gat -- --baseline main ``` ## Common debugging scenarios ### Migration compiles but lift returns wrong data 1. **Check the surviving set.** Run with `RUST_LOG=panproto_inst=debug` and look at the `anchor_surviving` output. Are all expected nodes present? 2. **Inspect the vertex remap.** Print the compiled migration's `vertex_remap` field. Does every source vertex map to the correct target vertex? 3. **Trace the lift.** The lift reconstructs source-schema data from a target instance. If it produces wrong values, the resolver functions in the compiled migration may be incorrect. ### Existence check passes but restrict panics The existence check (`panproto-mig/src/existence.rs`) verifies that a migration *can* be applied, but it does not run the full restrict pipeline. A panic during restrict usually means: 1. **Resolver coverage gap.** The compiled migration has a resolver for every edge in the target schema, but one of them returns an invalid handle. Add a test that calls `restrict()` directly on a minimal instance. 2. **Fan arity mismatch.** A hyper-edge changed its arity (number of targets), but the fan reconstruction did not account for it. Check `reconstruct_fans()` with tracing enabled. ### Property test found a failure but cannot reproduce it 1. Look for the **seed** in the failure output: `proptest: seed = 0x1234abcd...`. 2. Re-run with that seed: ```bash PROPTEST_SEED=0x1234abcd cargo nextest run -p panproto-gat -- test_name ``` 3. If the test is non-deterministic (it should not be), check for `HashMap` iteration order sensitivity. panproto uses `rustc-hash` (`FxHashMap`) which has a fixed hash seed, so iteration order should be deterministic. ### WASM tests pass locally but fail in CI 1. CI runs WASM tests in headless Chrome. If you are testing locally with Node.js, there may be differences in `TextEncoder`/`TextDecoder` behavior. 2. Reproduce CI conditions: `wasm-pack test --headless --chrome crates/panproto-wasm`. 3. If headless Chrome is not installed, use `--headless --firefox` or install Chrome via your system package manager. ### Colimit produces unexpected merged theory When two theories are merged via `colimit()` and the result has unexpected sorts or operations: 1. **Verify the span.** The colimit is computed over a span (two structure-preserving maps from a shared base theory to each of the theories being merged). Print both morphisms and confirm that the shared sorts map correctly in each direction. 2. **Check equation preservation.** If the merged theory has conflicting equations, `colimit()` will return an error. But if equations are merely *missing*, the merged theory may be valid but semantically wrong. Inspect the equations in the output theory. 3. **Trace with debug logging:** ```bash RUST_LOG=panproto_gat=debug cargo nextest run -p panproto-gat -- test_colimit ``` The debug output shows each sort and operation merge step. ### Schema validation passes but normalize changes structure The `normalize()` function in `panproto-schema` collapses reference chains and canonicalizes vertex ordering. If your schema looks correct before normalization but wrong after: 1. **Check for reference cycles.** `normalize()` follows reference edges to collapse chains. A cycle causes infinite recursion (which Rust will catch as a stack overflow). 2. **Inspect edge targets.** Print the schema's edges before and after normalization. Look for edges whose targets changed unexpectedly; this usually means a reference chain was collapsed through an intermediate vertex you did not expect. 3. **Run validate before and after:** If `validate()` passes before normalization but fails after, the normalization may have introduced an edge-rule violation. File a bug. ## Profiling with system tools For deeper performance analysis beyond divan benchmarks: ### CPU profiling on macOS ```bash # Using Instruments (Xcode) cargo build -p panproto-gat --release xcrun xctrace record --template "Time Profiler" --launch -- \ ./target/release/deps/panproto_gat-* # adjust binary name # Using samply cargo install samply samply record cargo bench -p panproto-gat ``` ### Memory profiling panproto avoids allocator-heavy patterns, but if you suspect a memory issue: ```bash # Using DHAT (heap profiling via valgrind) cargo install cargo-valgrind cargo valgrind --tool=dhat -p panproto-inst -- test_name # Or use the built-in allocator stats in tests RUST_LOG=panproto_inst=trace cargo nextest run -p panproto-inst -- test_large_instance ``` Look for unexpected `Vec` growth or `bumpalo` arena sizes in the trace output. ## Quick reference | Goal | Tool / Command | |---|---| | Structured logs | `RUST_LOG=panproto_crate=debug cargo nextest run ...` | | WASM payload inspection | Hex-dump MessagePack bytes, decode with msgpack tools | | Handle debugging | `slab_stats()` API in `panproto-wasm` | | Benchmarks | `cargo bench -p panproto-gat` (also `-inst`, `-mig`) | | Reproduce property test | `PROPTEST_SEED=... cargo nextest run ...` | | Reproduce CI WASM tests | `wasm-pack test --headless --chrome crates/panproto-wasm` |

4.1 Tracing with rust_log

4.2 Stepping through the w-type restrict pipeline

4.3 Inspecting WASM boundary data

4.3.1 Hex-dump a MessagePack payload

4.3.2 Common deserialization errors

4.3.3 Handle-related debugging

4.4 Benchmarking with divan

4.4.1 Interpreting output

4.4.2 CI regression checks

4.5 Common debugging scenarios

4.5.1 Migration compiles but lift returns wrong data

4.5.2 Existence check passes but restrict panics

4.5.3 Property test found a failure but cannot reproduce it

4.5.4 WASM tests pass locally but fail in CI

4.5.5 Colimit produces unexpected merged theory

4.5.6 Schema validation passes but normalize changes structure

4.6 Profiling with system tools

4.6.1 CPU profiling on macOS

4.6.2 Memory profiling

4.7 Quick reference