Towards an optimal debugging framework library.

This text is intended as overview of debugging techniques and motivation for uniform execution representation and setup to efficiently mix and match the appropriate technique for system level debugging with focus on statically optimizing compiler languages to keep complexity and scope limited. The author accepts the irony of such statements by "C having no ABI"/many systems in practice having no ABI, but reality is in this text simplified for brevity and sanity.

Theory of debugging.
Practical methods with tradeoffs.
Uniform execution representation.
Abstraction problems during problem isolation.
Possible implementations.

Theory of debugging.

can be represented as (often non-deterministic) state machine

bug

bad transition rule

debugger

as query engine over states and transitions of a buggy execution witness.

asoul

automate the process to minimize errors/oversights during debugging, against probabilistic errors, document the process etc
simplify and isolate system components and changes over time
observe the system while running it to trace state or state changes
understand the expected and actual code semantics to the degree
learn, extend and ensure how and which system invariants are satisfied

of the involved systems

feel

finding out correct sytem components semantics
[e]ensuring deterministic reproducibility of the problem
limited time and effort

feel a soul

Formal Verification as ahead or compile-time invariant resolving.
Validation as runtime invariant checks.
Testing as sample based runtime invariant checks.
Stepping via "classical debugger" to manipulate task execution context, manipulate memory optionally via source code location translation via REPL commands, graphically, scripting or (rarely) freely programmable.
Logging as dumping (a simplification of) state with context from bugs (usually timestamps in production systems).
Tracing as dumping (a simplification of) runtime behavior via temporal relations (usually timestamps).
Recording Encoded dumping of runtime to replay runtime with before specified time and state determinism.

Bisection via git or the actual binaries.
Reduction via removal of system parts or trying to reproduce with (a minimal) example.
Statistical analysis from collected data on how the problem manifests on given environment(s) etc.

Debugging

relies on

the to be debugged system to provide necessary debug functionality

Practical methods with tradeoffs.

functional behavior

time behavior

internal and external system resources

Bugs related to functional behavior.
Bugs related to time behavior.
Internal and external system resources.

Debugging hard(ware) problems

Hardware design reviews

Kernel and platform problems.
Detectable Undefined Behavior

C
C++
Zig with -OReleaseSafe turns "undefined behavior" into runtime-checked disallowed behavior except for
1. TODO
2. TODO
3. TODO
4. TODO

Undetectable Undefined Behavior
Miscompilations
Memory problems sanitizers, validators, simulator, tracers: TODO which, configurability and costs
1. out-of-bounds access
2. null pointer dereference
3. type confusion
4. integer overflow
5. use after free
6. invalid stack access
7. usage of uninitialized memory
8. data races
Resource leaks (Freestanding/Kernel)
Freezes (deadlocks, softlocks, signal safety, unbounded loops etc)
Performance problems
Logic problems of software systems can be described as problems related to incorrectly applied logic of how the code is solving the intended and follow-up problems ignoring hardware problems, kernel problems, different types of UB, miscompilations memory problems, resource leaks, freezes and performance issues.
1. (temporary) inconstency of state (relations)
2. incorrect math ie for edge cases
3. incorrect modeling of external and internal state and synchronization
4. incorrect protocol handling
5. insufficient handling of or the software requirements themself
The source of these problems are usually
1. incorrect constrains on the design, meaning how the different parts should interact and work towards the goals for the use cases
2. unclear, unspecified or incorrectly assumed hardware or software guarantees by components
3. implementation oversights, unintended use cases, unfeasibility of a general solution due to time and/or money constrains
Formal modeling of the design, model checking, code review, writing tests for edge cases or runtime validation are typically used with best practice being to write code in a risk-aware, testable and debuggable way. The methods and scope are here very wide and very domain and use case specific, so no general or short recommendation can be made.

Tooling and performance tradeoffs.
minimal descriptions for C, Rust, Zig; Posix, Linux, Windows

Uniform execution representation.
Abstraction problems during problem isolation.
Possible implementations.