/
4.2 Grafana Dashboards

4.2 Grafana Dashboards

If you used the scripts provided to install System Insight, the sample dashboards are included in this installation.

If you installed System insight manually, and have your own installation of Grafana, you can import the dashboards provided in the directory $MZ_HOME/scripts/str-templates/system-insight/dashboards into your Grafana installation. 

After installing the sample dashboards, go to https://<your grafana installation url>:3000/.

Dashboards Design

The sample dashboards rely heavily on the Grafana concept of Templates to provide filtering possibilities to narrow down the scope of the data displayed. Examples of this are templates to enable filtering on service, pico instance or workflow name. For further information on Grafana Templates, see http://docs.grafana.org/reference/templating/. 

Another feature frequently used is the dashboards is Repeat Row/Repeat Panel, where it is possible to design a row or panel and then reuse it by replicating it using a Template. An example of this can be seen in the Hosts dashboard where the rows are repeated once per server selected in the Server template.

All graphs and panels provide a tool tip with a summary on the intent of the graph, from which data it is derived, and if any specific configurations have been done for the display.

Sample Dashboards

Six sample dashboards are provided, and each graph and panel has tooltips which provide information to help you determine how you want to customize the view of the graphs and panels for your requirements. To display a tooltip, hover your cursor over the i at the top left hand corner of each table and graph.

The sample dashboards provided are:

Overview 

The Overview dashboard

This dashboard provides an overview of  focusing on high level statistics. Using templates, you can filter on server, pico type and pico instance. The dashboard includes the following graphs and panels:

  • Platform uptime
  • Pico uptimes
  • Throughput per execution context
  • CPU usage per host
  • JVM Memory usage
  • Network I/O per host
  • Storage I/O per host

Host

The Host dasboard

The Host dashboard provides data for the servers hosting  with regards to CPU and storage utilization. Using templates, you can filter on servers and mount directories. The dashboard includes the following graphs and panels:

  • CPU Utilization
  • CPU Over Time
  • Server Uptime
  • Pico Uptimes
  • Swap Space Usage
  • Swap Activity
  • Disk Usage
  • Disk I/O

Network

The Network dashboard

The Network dashboard provides I/O information on the network interfaces of the servers running . Using templates, you can filter on server and network interfaces. The dashboard includes the following graphs and panels:

  • Traffic (bytes)
  • Traffic (packets)
  • Packets dropped and errors
  • Network statistics on <host name>

Pico

The Pico dashboard

The Pico dashboard provides pico related data with focus on JVM details, e g, uptime and garbage collections. Using templates, it provides filtering options as well as the option to specify an interval for garbage collection details. The dashboard includes the following graphs and panels:

  • Pico Uptime
  • Garbage Collection last 5m
  • Average Duration of Garbage Collections last 5m
  • Memory Usage
  • Active Threads

Trends

The Trends dashboard

The Trends dashboard provides a comparison between high and low resolution data to see trends in CPU and JVM memory utilization. Using templates, you can filter on server, pico type and pico instance. The dashboard includes the following graphs and panels:

  • CPU Utilization Hourly
  • JVM Memory Usage Last  Hour
  • CPU Utilization Weekly
  • JVM Memory Weekly

Workflows

The Workflows dashboard

The Workflows dashboard provides basic information about running workflows with focus on throughput. Using templates, it provides filter options on Execution Context and workflow details. The dashboard includes the following graphs and panels:

  • Workflow Throughput (panel)
  • Running Workflows
  • Throughput per Execution Context
  • Workflow Throughput (graph)