instaclustr product Cassandra monitoring logo
// Most trusted platform for open source

Cassandra Monitoring

Get API access to the information needed to effectively manage and review the health of your cluster.

Cassandra Monitoring API

Advantages of the Cassandra monitoring tool
  • LOREM IPSUM DOLOR SIT AME

    Understand immediately when your Cassandra instance is not performing as expected.

  • LOREM IPSUM DOLOR SIT AME

    Identify the root cause of any problem your database may be experiencing and fix it faster.

    Instaclustr has many years of experience providing Cassandra support and has Cassandra experts and consultants on staff capable of solving any issue.

  • LOREM IPSUM DOLOR SIT AME

    Save time digging through and analyzing multiple reports to understand your metrics.

Access Cassandra metrics

The Instaclustr Managed Platform provides a Monitoring Dashboard through our console. This enables important and quick access to key metrics that help you check the health of your cluster and better manage your overall Cassandra performance.

Our Cassandra Monitoring API enables access to Apache Cassandra performance metrics including, CPU utilization, disk utilization, reads and writes, latency tasks, pending compactions, live cells and tombstones per read, SSTable, thread pool task statistics, and more. The Instaclustr Console also delivers a visual view of these metrics as part of our monitoring dashboard.

For all Apache Cassandra clusters and nodes managed through the Instaclustr Managed Platform, you can start monitoring your Cassandra database in just a few minutes by creating your cluster from a centralized console. All data metrics data can be viewed either on a per-node basis or for all nodes in a cluster, helping you diagnose any issues and enabling better capacity planning. All available metrics are updated every 20 seconds.

Once you authenticate the monitoring API, the metric information viewable on our monitoring dashboard is available as part of our console

CPU Usage

Sustained high CPU usage is an indicator that your cluster is reaching processing capacity, and you may need to consider adding capacity to cope with any increase in load.

Disk Usage

Keeping an eye on disk usage should be a key part of your capacity planning. Maintaining disk usage at less than 70% during normal operations is recommended.

Read+Write Per Second

This metric helps you identify changing levels of load on your cluster. A significantly uneven distribution could indicate driver misconfiguration or data model issues.

Pending Compaction

Compactions are a continuous, background process in Cassandra. A high or increasing number of pending compactions indicates that your cluster does not have sufficient capacity to process the level of operations it requires.

Active/ Pending Repairs

Repair is a Cassandra operation that ensures data consistency is eventually attained across the ring. Repairs are a scheduled operation and represent an additional load on the cluster.

Partition Size

This metric checks size of the largest partition in each table. We recommended limiting the maximum partition size to 10MB for optimal performance, with 100MB as an upper limit for ongoing stability. Large partitions may significantly impact the performance of Cassandra operation.

Tombstones to Live Cells

Checks the average ratio of the number of tombstones and live cells per read in each table. High ratios of tombstones to live cells (greater than 5x as a starting guide) can cause substantially reduced performance in reads from a table.

SSTable Per Reads

Monitor the latest mean and maximum metrics for each column family, averaged across the cluster.
High numbers of SSTables per read (more than 3 or 4, as a starting guide) can reduce read performance, and if read performance is below desired levels you may need to change compaction strategy for the affected column family.

Replication Strategy Indicator

This indicator checks the replication class used for each keyspace. NetworkTopologyStrategy is highly recommended to ensure data is replicated in order to minimize the impact of infrastructure failure.

Thread Pool Metrics

These metrics are associated with each stage of Cassandra’s Staged Event-Driven Architecture (SEDA).

Spin up a cluster in minutes