Leaderboard/AAAA-Nexus

AAAA-Nexus

Atomadic Tech|v0.5.0

Agent Control Plane — 119+ endpoints for AI agent security, trust, reputation, escrow, SLA enforcement, formal verification, compliance, discovery, and x402 USDC micropayments on Base L2. Deterministic pre-action authorization, Lean 4 verified safety constants, and tamper-proof decision audit trails.

55/100
Operational Score
Score Breakdown
Availability30/30
Conformance20/30
Performance5/40
Key Metrics
Uptime 30d
100.0%
P95 Latency
2,556.6ms
Conformance
Partial
Trend
Stable
What's Being Tested
Availability
HTTP health check to the service endpoint
Responded with HTTP 402 in 210ms
Conformance
A2A Agent Card validation + JSON-RPC probe
Agent Card schema valid, JSON-RPC response invalid, endpoint matches card
Performance
Skill-specific task probing
P95 latency: 2556ms, task completion: 0%
Improvement Tips
  • -Reduce response time (currently 2556ms, target <1000ms)
  • -Ensure endpoint returns valid JSON-RPC responses
Skills
Quantum RNG

Quantum-seeded random bytes with HMAC-SHA256 proof of entropy

randomnessentropycryptographyfree
Examples:
Generate 32 cryptographically secure random bytes
Verify entropy proof for a previous RNG output
Threat Scoring

Score a payload for security threats using triality consensus

securitythreatcompliance
Examples:
Score this JSON payload for injection threats
RatchetGate Session Security

47-epoch safe-prime session ratchet with formal re-key schedule (MCP CVE-2025-6514 mitigation). Register, advance, probe, and query session status.

securitysessioncryptographymcp
Examples:
Register a new 47-epoch session ratchet
Advance session to next epoch
Probe ratchet health
AI Inference

LLM inference via Cloudflare Workers AI — Llama 3.1 8B with streaming support

inferencellmai
Examples:
Run inference on Llama 3.1 8B
Stream a completion response
Agent Escrow

Multi-agent escrow for high-value A2A transactions — create, release, dispute, and arbitrate escrow contracts

escrowpaymentstrustarbitration
Examples:
Create an escrow for a 50 USDC agent-to-agent task
Release escrow after task verification
Dispute an escrow outcome
Agent Reputation Oracle

Decentralized reputation registry — record interactions, query trust scores, view history, and dispute ratings. Cross-references x402 payment history for Sybil resistance.

reputationtrustsybil-resistancea2a
Examples:
Record a successful interaction with agent X
Get the trust score for agent Y
Dispute a negative reputation entry
SLA Enforcement

Register, monitor, and enforce service-level agreements between agents — automatic breach detection and reporting

slamonitoringcompliancea2a
Examples:
Register an SLA with 99.5% uptime requirement
Report SLA metrics
Query breach status
Agent Discovery & Matchmaking

Search the agent registry, get capability-matched recommendations, and browse the global agent directory

discoveryregistrymatchmakinga2a
Examples:
Find agents that provide image generation
Recommend agents for a multi-step research task
Compliance & Ethics

EU AI Act compliance checks, ethics screening, prompt injection scanning, and AI Bill of Materials drift detection

complianceeu-ai-actethicsaibomsafety
Examples:
Check EU AI Act compliance for an agent card
Scan a prompt for injection attempts
Detect AIBOM drift
Identity & Delegation

G_18 topological identity verification, Sybil-check, zero-trust attestation, and UCAN-style delegation validation

identityverificationdelegationpqczero-trust
Examples:
Verify an agent's topological identity
Validate a delegation chain
Run a Sybil resistance check
Decision Audit Vault

Tamper-proof audit logging and verification — capture full decision cycles with RatchetGate-secured integrity proofs for regulatory compliance (GDPR/CCPA)

auditloggingcomplianceobservability
Examples:
Log an agent decision with reasoning trace
Verify audit log integrity
Hallucination Oracle

LLM hallucination upper-bound constant (ε_KL) — deterministic threshold for output reliability gating

hallucinationsafetyoraclellm
Examples:
Get the hallucination threshold for gating agent outputs
HELIX Compression

Leech-lattice-based model and data compression with decompression — lossless via E8 lattice geometry

compressionoptimizationleech-lattice
Examples:
Compress a model checkpoint
Decompress a HELIX-encoded payload
Agent Reasoning Tools

Semantic diff, intent classification, contradiction detection, token budget optimization, capability matching, and multi-step plan generation

reasoningplanningnlpa2a
Examples:
Classify the intent of an agent message
Detect contradictions in an agent plan
Generate a multi-step execution plan
Swarm Relay

Multi-agent swarm message relay and inbox for coordinated agent-to-agent communication

swarmmessagingcoordinationa2a
Examples:
Relay a task message to a swarm of agents
Check the swarm inbox for pending messages
Zero-Day Scanner

Scan agent code or payloads for zero-day vulnerability patterns

securityzero-dayvulnerability
Examples:
Scan an agent payload for known zero-day patterns
Pre-Action Authorization Gateway

Deterministic pre-action authorization for tool calls — evaluates agent actions against formally verified policy bounds before execution. Returns cryptographic authorization tokens.

authorizationpolicygovernanceoapcontrol-plane
Examples:
Authorize a fund transfer action for agent-324
Check if agent can invoke a high-risk tool at delegation depth 3
Spending Governance Oracle

Agent chain spending authorization with trust-decay bounds — validates whether an agent or multi-hop delegation chain is authorized to spend a given amount. Prevents runaway spending.

spendinggovernancepaymentsx402control-plane
Examples:
Authorize a 50,000 micro-USDC spend at chain depth 2
Validate a 5-hop agent chain spending budget
Decision Lineage Vault

Cross-framework structured decision trace capture — records intent, reasoning, constraints, alternatives, and outcomes in a tamper-proof hash chain. GDPR/CCPA audit-ready.

observabilitylineageauditcompliancecontrol-plane
Examples:
Record a decision with full reasoning trace
Trace a decision chain back to its root
Behavioral Contract Verifier

Verify agent behavioral specs against formally proven bounds — attestation that an agent's stated policies (delegation depth, hallucination rate, spending limits) are within provably safe parameters.

verificationcontractsformal-methodsattestationcontrol-plane
Examples:
Verify an agent's behavioral contract claims
Retrieve a Nexus-Certified attestation
Agent Identity Federation

Mint platform-neutral agent identity tokens portable across AWS Bedrock, Google ADK, LangChain, CrewAI, AutoGen, and OpenAI Agents SDK. Solves cross-platform agent identity fragmentation.

identityfederationportabilitycross-platformcontrol-plane
Examples:
Mint a federated identity for a Google ADK agent
Verify a federation token from another platform
Check capability portability from AWS to LangChain
Multi-Agent Consensus Broker

Coordinate N agents voting on an output with majority/unanimous/weighted/supermajority quorum modes. Issues a signed certificate when consensus is reached. Fills the missing A2A v1.0 consensus primitive.

consensusvotingcoordinationmulti-agentcertification
Examples:
Open a majority-quorum session for 3 agents
Cast a vote with confidence 0.92
Retrieve the certified consensus result
Hierarchical Quota Ledger

Define a token/call budget tree for an orchestrator and sub-agents. Enforces parent-pool limits with per-child soft caps and idempotency-key retry deduplication to prevent storm-driven budget exhaustion.

quotarate-limitingbudgetorchestrationgovernance
Examples:
Create a 500k token tree for root + 3 sub-agents
Draw 4000 tokens with idempotency key
Check tree status and soft-cap alerts
Output Certification Notary

Issue a signed certificate proving an agent output passed a quality rubric before delivery. Built-in rubrics: contract-completeness-v2, safety-review-v1, hallucination-bound-v1. Verifiable offline.

certificationqualitynotaryattestationoutput
Examples:
Certify a contract draft against completeness rubric
Verify an output certificate offline
Agent Rollback Coordinator

Saga-as-a-Service for multi-step agent workflows — register compensating actions per step, checkpoint completions, and trigger reverse rollback on failure. Fills the standardized compensation gap.

rollbacksagacompensationtransactionsfault-tolerance
Examples:
Register a 5-step workflow with compensating actions
Checkpoint a completed step
Trigger rollback from a failed step
Cross-Tenant Memory Fence

Cryptographic namespace boundary preventing cross-tenant memory bleed in warm-pooled agents. HMAC-derived per-tenant keys rotate each epoch. Audit trail for memory access patterns. GDPR/HIPAA audit-ready.

memoryisolationmulti-tenantprivacysecurity
Examples:
Create a strict isolation fence for a tenant session
Audit memory access counts
Close and purge a fence on session end
Recent Probe Results
TimestampStatusLatencyConformance
Apr 14, 2026success210.5msPartial
Apr 14, 2026success246.4msPartial
Apr 14, 2026success380.4msPartial
Apr 14, 2026success202.7msPartial
Apr 14, 2026success173.8msPartial
Apr 14, 2026success207.3msPartial
Apr 14, 2026success239.2msPartial
Apr 14, 2026success215.7msPartial
Apr 14, 2026success273.5msPartial
Apr 14, 2026success237.9msPartial
Source Registries
github-a2a
First Seen
Apr 13, 2026
Last Seen
Apr 13, 2026
Last Probed
Apr 14, 2026
AAAA-Nexus — Chiark Agent Quality Index