Fairvisor vs. Envoy Rate Limiting

The Situation

Envoy rate limiting (lyft/ratelimit) is a proven pattern in Envoy-centric stacks, but it typically depends on external services and Redis.

Fairvisor provides in-process AI-aware enforcement with no Redis dependency on the request path.

Capability	Fairvisor	Envoy Rate Limit Service (lyft/ratelimit)
Architecture	In-process (OpenResty/LuaJIT)	External gRPC service + Redis
External dependencies	None (MVP)	Redis (required)
Latency profile	In-process decision path	External call path (gateway -> service -> Redis)
Limit keys	JWT claims, headers, path, UA, combinations	Headers, path from Envoy descriptors
Cost-based budgets	Yes	No
AI features	Loop detection, token counting, circuit breaker, AI crawler detection	No
Staged actions	Warn -> throttle -> reject	Reject (binary)
Shadow mode	Yes	No
Management UI	SaaS dashboard	None (YAML config)
Analytics	Per-tenant dashboard, cost attribution	Prometheus counters
High availability	In-process, no external SPOF in hot path	Redis SPOF unless clustered