Leaderboard/ai.plith/plith
MCP ServerScored via MCP protocol probing: initialize handshake, tools/list conformance, and ping + tool invocation performance.

ai.plith/plith

AI agent infrastructure: dedup, cost prediction, validation, governance, failure intelligence.

97/100
Operational Score
Score Breakdown
Availability30/30
Conformance30/30
Performance37/40
Key Metrics
Uptime 30d
100.0%
P95 Latency
178.6ms
Conformance
Pass
Trend
What's Being Tested
Availability
HTTP health check to the service endpoint
Responded with HTTP 200 in 172ms
Conformance
MCP initialize handshake + tools/list
Valid MCP server info returned, tools/list responded
Performance
MCP ping + zero-arg tool invocation benchmarking
P95 latency: 178ms, task completion: 100%
Skills
dedupq_check

Before executing any LLM task, check if an identical or semantically similar task has already been completed. Returns cached result on hit, saving one LLM call. On a miss, execute your task and call dedupq_complete to cache the result for future hits. Costs 1 credit.

dedupq_complete

After executing a task, store the result so future identical or similar tasks return a cache hit via dedupq_check. Costs 2 credits.

burnrate_estimate

Before executing a multi-step agent plan, estimate the total LLM cost. Returns per-step breakdown and optimization suggestions. If the estimate exceeds your budget, pipe the same plan into burnrate_optimize. Costs 1 credit.

burnrate_track

Log the actual cost of an LLM call after execution. Call this after every LLM request to build calibration data that improves burnrate_estimate accuracy over time. Free — no credits charged. Returns the recorded cost entry with computed margin versus the prior estimate when one exists for this model and token range.

burnrate_optimize

Get a cheaper equivalent plan by substituting models with lower-cost alternatives. Call after burnrate_estimate if the estimated cost exceeds your budget. Returns the optimized plan with substituted models, new per-step costs, total savings, and whether the target_budget is met. Optionally set target_budget to constrain the optimization. Costs 1 credit.

burnrate_budget

Get today's tracked LLM spend, per-model breakdown, projection, and budget alerts. Free — no credits charged.

qualitygate_validate

After your agent generates output, validate it against your rules before shipping. Runs deterministic checks (regex, JSON schema, syntax) plus optional LLM-powered tone and factual analysis. Returns a structured verdict (pass, warn, or fail) with a 0-100 score and per-check issue details. Use qualitygate_trends to spot recurring failure patterns over time. Variable cost: 1 credit per deterministic check, 8 credits per LLM check.

guardrail_check

Evaluate a proposed agent action against your governance policies. Returns allow or deny with the matched policy reason. Requires at least one active policy created via guardrail_create_policy. Deterministic rule evaluation — no LLM. Costs 1 credit.

guardrail_create_policy

Create a persistent governance policy that guardrail_check evaluates on every subsequent call. Define rules using and/or/not operators over action types, resource patterns, and budget thresholds. Call this before using guardrail_check — checks require at least one active policy. Policies persist until explicitly deleted. Duplicate policy names return an error. Returns the created policy with its ID and active status.

pitfalldb_query

Check for known failure patterns before executing a task type. Returns pitfalls with severity, fix suggestions, and confidence scores. After your agent runs, submit failures via pitfalldb_report so others benefit. Costs 2 credits.

pitfalldb_report

Report an agent failure. PII-scrubbed before storage. Linked to existing pitfalls if similar. Free — no credits charged.

rigor_plan

Before executing a complex task, get a structured workflow plan with per-step cost estimates. Classifies your task, selects the optimal framework sequence, and returns the full plan without executing anything. Free — no credits charged.

rigor_execute

Execute a structured workflow end-to-end. Call rigor_plan first (free) to preview the step sequence and cost estimate before committing credits. Classifies the task, selects the optimal tool sequence, and executes each step with the right LLM model. Returns a complete deliverable — solution designs, competitive analyses, governance documents, and more. Supports SSE streaming for real-time progress, webhook callback, or polling.

rigor_status

Check the status of a running or completed Rigor workflow. Returns progress, step results, and the full deliverable when complete. Use after rigor_execute with polling delivery to retrieve results.

rigor_workflows

List all Rigor workflows for your organization with filtering and pagination. Returns status, progress, capacity usage, and available actions per workflow. Use to monitor workflow state, understand concurrent limit usage, and identify stuck or completed workflows.

Tools
15 tools verified via live probe
verified 1d ago
Server: plithVersion: 1.0.0Protocol: 2024-11-05
Recent Probe Results
TimestampStatusLatencyConformance
May 30, 2026success172msPass
May 29, 2026success160.1msPass
May 29, 2026success158.5msPass
May 27, 2026success178.6msPass
May 27, 2026success178.1msPass
May 27, 2026success122.5msPass
May 26, 2026success145.5msPass
May 25, 2026success156.9msPass
May 22, 2026success263.8msPass
May 22, 2026success154.6msPass
Source Registries
mcp-registry
First Seen
Apr 15, 2026
Last Seen
May 29, 2026
Last Probed
May 30, 2026