Skip to content

Apache APISIX Upstream Marked Unhealthy

DodaTech Updated 2026-06-24 2 min read

You encounter a apisix configuration issue that prevents your workflow from completing. This guide walks through the fix step by step.

Wrong ❌

{
  "upstream": {
    "type": "roundrobin",
    "nodes": { "backend:3000": 1 },
    "checks": {
      "active": {
        "type": "http",
        "http_path": "/health",
        "healthy": { "interval": 2, "successes": 1 },
        "unhealthy": { "interval": 1, "http_failures": 1 }
      }
    }
  }
}

Wrong Output

Upstream nodes marked unhealthy immediately. Single failure triggers unhealthy with 1s recheck interval.
{
  "upstream": {
    "type": "roundrobin",
    "nodes": { "backend:3000": 1, "backend2:3000": 1 },
    "checks": {
      "active": {
        "type": "http",
        "http_path": "/health",
        "timeout": 5,
        "concurrency": 1,
        "healthy": { "interval": 10, "successes": 3 },
        "unhealthy": { "interval": 5, "http_failures": 5, "timeouts": 3 }
      },
      "passive": {
        "http_failures": 10,
        "timeouts": 7
      }
    }
  }
}

Right Output

Nodes require 3 successes to become healthy, 5 failures to become unhealthy. Passive checks add extra tolerance.

Prevention

  • Set realistic healthy/unhealthy thresholds.
  • Use both active and passive checks.
  • Passive check failures should be higher than active.
  • Use multiple upstream nodes for fault tolerance.
  • Monitor upstream health via APISIX dashboard.

DodaTech applies similar defensive patterns across Doda Browser, DodaZIP, and Durga Antivirus Pro infrastructure for production reliability.

Common Mistakes with upstream health

  1. Misunderstanding that String is [Char] with poor performance for large text operations
  2. Using foldl instead of foldl' causing stack overflow on large lists
  3. Forgetting deriving (Show, Eq) on custom data types needed for debugging

These mistakes appear frequently in real-world APISIX code. DodaTech's contributors have identified these patterns through analysis of open-source projects and production systems.

Practice Exercise

Write a pure function that safely divides two integers using Maybe, then test it with edge cases like division by zero and negative numbers.

This exercise reinforces the concepts covered in this guide. Try implementing it before checking online solutions.

FAQ

**Q: What is the most common cause of this apisix error?**

A: Configuration drift between environments and version mismatches are the top causes. Always verify both before deeper troubleshooting.

Q: Can this error affect production traffic?

A: Yes. Depending on whether it occurs in the control plane or data plane, it can block all traffic or cause silent failures.

Q: How do I monitor for this error in production?

A: Set up log-based alerts for the error signature shown above. Most monitoring platforms support pattern matching on log entries.

Q: Is there a quick rollback procedure?

A: Revert the configuration change and restart the service. For data-plane errors, replay affected records from the source of truth.


This quick fix is part of the DodaTech infrastructure engineering series. Learn more at DodaTech tutorials.

Built by the developers of DodaTech

Doda Browser, DodaZIP & Durga Antivirus Pro