Autonomous agents are live in DeFi. They allocate, rebalance, and execute across protocols with minimal oversight. Spearbit evaluates these systems under pressure to surface behaviors static analysis cannot reveal. Details below.
Inference-driven logic responds to market data, protocol state, and user input. Without enforcement boundaries, agents can trigger transactions based on malformed prompts, unverified assumptions, or adversarial context.
We simulate edge conditions across prompt injection, output validation, and fallback behavior. Reviews confirm execution intent, trace authority, and expose where live systems break from design.
1,02K