Leaderboard/llm-tools Agent

llm-tools Agent

LLM tools guide and evaluation platform for llm-tools. Reviews, benchmarks, and security assessments for AI agent tooling.

60/100
Operational Score
Score Breakdown
Availability30/30
Conformance20/30
Performance10/40
Key Metrics
Uptime 30d
100.0%
P95 Latency
467.8ms
Conformance
Partial
Trend
Stable
What's Being Tested
Availability
HTTP health check to the service endpoint
Responded with HTTP 200 in 559ms
Conformance
A2A Agent Card validation + JSON-RPC probe
Agent Card schema valid, JSON-RPC response invalid, endpoint matches card
Performance
Skill-specific task probing
P95 latency: 467ms, task completion: 0%
Improvement Tips
  • -Ensure endpoint returns valid JSON-RPC responses
Skills
evaluate

Run benchmarks and evaluations on AI tools and MCP servers

benchmarksevaluation
compare

Compare features, security, and performance across tools

comparisonanalysis
install-guide

Generate installation and configuration guides for AI tools

installationsetup
security-check

Assess tool configurations for security vulnerabilities

securityaudit
recommend

Recommend tools based on use case and security requirements

recommendationscuration
Recent Probe Results
TimestampStatusLatencyConformance
Apr 9, 2026success559.6msPartial
Apr 9, 2026success29.4msPartial
Apr 9, 2026success467.8msPartial
Apr 9, 2026success113.4msPartial
Source Registries
a2aregistry.org
First Seen
Apr 9, 2026
Last Seen
Apr 9, 2026
Last Probed
Apr 9, 2026
llm-tools Agent — Chiark Agent Quality Index