Nomad Troubleshooting
Debugging the cluster control plane.
Infrastructure
| Role | IP Address | Hostname |
|---|---|---|
| Nomad Server (Agent) | 138.199.222.109 | nomad-server-01 |
| Nomad Client 1 | 91.99.17.219 | nomad-client-01 |
| Nomad Client 2 | 91.99.60.214 | nomad-client-02 |
Steps
If UI is down or jobs failing:
1. SSH Access
ssh root@138.199.222.1092. Service Status
systemctl status nomadIf inactive/failed:
systemctl restart nomad3. Cluster Members
Verify server sees clients:
nomad server members
nomad node status- members: Should be
alive. - node status: Clients should be
ready.
4. Logs
journalctl -u nomad -fCommon Issues:
- Disk Space:
df -h - Consul:
systemctl status consul
5. Client Debugging
SSH into specific client (e.g., 91.99.17.219).
systemctl status nomad
journalctl -u nomad -n 100Cluster Restart
Last Resort for inconsistent state:
- Stop Nomad on all clients.
- Stop Nomad on server.
- Start Nomad on server (wait for leader election).
- Start Nomad on clients.
Last updated on