Cassandra Monitoring

Get API access to information to manage and review the health of your cluster.

Sidebar Menu

Cassandra monitoring API

Our Cassandra Monitoring API enables access to Apache Cassandra performance metrics, such as: CPU utilization, disk utilization, reads and write, latency tasks, pending compactions, live cells and tombstones per read, sstable, task statistics of thread pool and more. The Instaclustr Console also delivers a visual view of these metrics as part of our monitoring dashboard.

Get started in minutes

For all Apache Cassandra clusters and nodes managed through the Instaclustr Managed Platform you can get started with monitoring your Cassandra database within just a few minutes of creating your cluster from a centralised console. All metrics data can be viewed on a per node basis or for all nodes in a cluster to help you diagnose issue and plan capacity.  All available metrics are updated every 20 seconds.

The metric information viewable on our monitoring dashboard is available as part of our console once you authenticate the monitoring API.

Get in Touch

Advantages of Monitoring Tool

Understand immediately when your Cassandra instance is not performing as expected.

Identify the root cause of any problem your database may be experiencing fix it fast.

Save time digging through and analyzing multiple reports to understand your metrics.

Cluster Summary

View the summary of key metrics to understand the health of your cluster at a glance.

Metrics List

Get a tabular view of key metrics which is great for understanding relatively static information (for example, tombstones per read over the last 5 minutes)

Metrics Graphs View

A graphical view of metric values over time – ideal for pinpointing the time when there was a sudden change in a metric value or identifying trends over time.

Cluster Health Check

Instaclustr’s Cluster Health check exposes a  number of indicators to help you understand your cluster’s long term performance. There are three potential states for each indicator:

  • Green represents a healthy state;
  • Amber represents a warning state; and
  • Red represents failed state.

View important Cassandra node metrics and find the problem

The Instaclustr Managed Platform provides a Monitoring Dashboard through our console that enabled access to key metrics that help you with checking the health of your cluster and to manage overall Cassandra performance.

CPU Usage

A sustained high CPU usage is an indicator that your cluster is reaching processing capacity and you may need to consider adding capacity to cope with any increase in load.

Disk Usage

Keeping an eye on disk usage should be a key part of your capacity planning. Maintaining disk usage at less than 70% during normal operations is recommended.

Read+Write per Second

This metrics helps you identify changing levels of load on your cluster. A significantly uneven distribution could indicate misconfiguration of the driver or data model issues.

Reads and Latency

The Metris is great for diagnosing whether read issues are impacting all tables (capacity or infrastructure issue) or a single table (data issue).

Pending Compaction

Compactions are a continuous, background process in Cassandra. A high or increasing number of pending compactions indicates that your cluster does not have sufficient capacity to process the level of operations it processes.

Active / Pending Repairs

Repair is a Cassandra operation ensures that data consistency is eventually attained across the ring. Repairs are a scheduled operation and represent an additional load on the cluster.

Partition Size

Checks the size of the largest partition in each table. We recommended limiting the maximum partition size to 10MB for optimal performance with 100MB as un upper limit for ongoing stability. Large partitions may significantly impact the performance of Cassandra operation

Tombstones to Live Cells

Checks the average ratio of the number of tombstones and live cells per read in each table. High ratios of tombstones to live cells (greater than 5x as a starting guide) can cause substantially reduced performance in reads from a table.

SSTables Per Read

Monitor the latest mean and maximum metrics for each column family, averaged across the cluster.
High numbers of sstables per read (typically, more than 3 or 4 as a guide) can reduce read performance and if read performance is below desired levels you many need to change compaction strategy for the affected column family.

Replication Strategy Indicator

checks the replication class used for each keyspace. NetworkTopologyStrategy is highly recommended to ensure data is replicated to minimise impact of likely failures in your infrastructure

Replications Factor Indicator

The Replication Factor indicator checks the number of replicas set for each datacenter. A replication factor of at least 3 is required for Instaclustr SLAs to apply and highly recommended for data protection and high availability.

Thread Pool Metrics

Associated with each stage in Cassandra’s Staged Event-Driven Architecture (SEDA).

Monitoring API

Instaclustr’s monitoring API is designed to allow you to integrate the monitoring information from your Instaclustr managed cluster with external monitoring tools such as DataDog or Prometheus so you can easily construct a single monitoring view of your application stack. 

Our Support Portal has details of all the Monitoring tool that we provide and how you can get the most of our console monitoring feature.

Related Solutions

Site by Swell Design Group