Link Search Menu Expand Document
Consortium of European Social Science Data Archives

Observability

CESSDA IT observability is a business process that collects and analyses data from its infrastructure and leverages that data to improve business results and drive value creation for the organization. Observability of the infrastructure includes:

(one of CDC, CVS, EQB, ELSST)

  • Infrastructure Management tools, CI/CT process with Jenkins

The observability strategy is made up of three pillars:

CESSDA Observability

  1. Metrics Monitoring: Metrics represent the raw measurements of resource usage or behaviour that can be observed and collected throughout the infrastructure. The monitoring system collects, exposes, and analyses those values to improve awareness of the characteristics and behaviour of the components the the core products are composed of.

  2. Logging & aggregation: Logs are detailed records of events that occur within the components or the Infrastructure in general. The central logging system collects, aggregates, filters and enriches the contents of the logs and enables visualisation of system and product events, which can aid debugging and problem solving.

  3. Alerting: Alerting is the responsive component of the observability system that performs actions based on changes in metrics, threshold violations or/and the occurrence of unacceptable conditions. Alert notification, sent via email or dedicated Slack channels, are based on the following elements:

  • Metrics: that are generated from the monitoring system.

  • Logs: produced from the logging system if failures or unacceptable conditions occur.

  • Service Endpoints: availability is monitored by Uptime Robot.

  • Processes: generated when there are failures in the CI/CT processes.


Table of contents