v6.db.transport.rest API status page inconsistent with my test results #33

LilaHexe0 · 2024-12-17T14:12:31Z

The API status page (https://stats.uptimerobot.com/57wNLs39M/793274556) claims 100% uptime on days where my personal monitoring (datadog) indicates otherwise:

Most/all of the failures are caused by 503 errors.

How exactly does the UptimeRobot check if the service is operational?

traines-source · 2024-12-18T02:38:40Z

I think the status page only indirectly, if at all, monitors whether HAFAS requests themselves are working. Maybe one could switch to monitoring the /health endpoint directly, of course entailing many more additional requests towards HAFAS.

The 503s you've been encountering are mostly due to errors on the HAFAS side, and it seems that the DB HAFAS mgate.exe endpoint will be shut off soon (see public-transport/hafas-client#331 and
schildbach/public-transport-enabler#610)

schaerfo · 2025-01-09T08:54:24Z

I agree, using the /health endpoint for status monitoring would result in uptime stats that reflect real-world use cases of the API better.

derhuerst · 2025-01-09T15:27:24Z

At least with regards to DB's HAFAS API (and v6.db.transport.rest), this seems obsolete now that it's likely shut-off for good.

However, let me make a more general point that applies to other HAFAS-based *.transport.rest APIs: Obviously, the /health endpoint is not using caching. If you all use it to monitor availability of the API, you'll quickly exhaust the shared resource "requests from the server's single static IP to HAFAS", so you effectively prioritise your personal insight when the API is available over everyone's access to it. To keep the rate of requests low, I don't see any solution to this other than making the /health endpoint private.

As an alternative, I suggest you to monitor "user-need-driven" requests (for actual public transport data), specifically e.g. their rate of success/error and the last successful one.

It might also be worthwhile to add Prometheus-/OpenMetrics-compatible metrics to hafas-rest-api and expose them to the public, so you can ingest and monitor them.

derhuerst added the question Further information is requested label Jan 6, 2025

LilaHexe0 closed this as completed Jan 9, 2025

LilaHexe0 closed this as not planned Won't fix, can't repro, duplicate, stale Jan 9, 2025

derhuerst added the ops operations label Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v6.db.transport.rest API status page inconsistent with my test results #33

v6.db.transport.rest API status page inconsistent with my test results #33

LilaHexe0 commented Dec 17, 2024

traines-source commented Dec 18, 2024

schaerfo commented Jan 9, 2025

derhuerst commented Jan 9, 2025 •

edited

Loading

v6.db.transport.rest API status page inconsistent with my test results #33

v6.db.transport.rest API status page inconsistent with my test results #33

Comments

LilaHexe0 commented Dec 17, 2024

traines-source commented Dec 18, 2024

schaerfo commented Jan 9, 2025

derhuerst commented Jan 9, 2025 • edited Loading

derhuerst commented Jan 9, 2025 •

edited

Loading