Instaclustr Blog Archive
2025 | Page 2
-
- Apache Kafka
- Dev Rel
How to size Apache Kafka® clusters for Tiered Storage: Part 2
A Kafka performance model for SSDs/EBS, network, I/O, and brokers In Part 1 of this new series, we explored the question: how do you resize a Kafka cluster (using Solid State Drives [SSDs] for local storage) for producer and consumer workloads on topics with remote Tiered Storage enabled? Now in Part 2, we’re going to...
Learn MorePaul BrebnerMarch 19, 2025 -
- Apache Cassandra
- Dev Rel
- OpenSearch
Introduction to similarity search: Part 2–Simplifying with Apache Cassandra® 5’s new vector data type
In Part 1 of this series, we explored how you can combine Cassandra 4 and OpenSearch to perform similarity searches with word embeddings. While that approach is powerful, it requires managing two different systems. But with the release of Cassandra 5, things become much simpler. Cassandra 5 introduces a native VECTOR data type and built-in...
Learn MoreMurilo MirandaMarch 17, 2025 -
- Feature Releases
- Open Source
Introducing the ROI Calculator for open source services
Crunch the numbers and count your savings Managing open source technologies can be a heavy and expensive lift, draining your time, resources, expertise, and ultimately, your budget. No two enterprises are ever alike, and how they manage their data is likely an even larger gap. With so many different factors at play, accurately determining the...
Learn MoreMichael ReynoldsMarch 12, 2025 -
- Apache Kafka
- Dev Rel
How to size Apache Kafka® clusters for Tiered Storage: Part 1–A Kafka performance model for SSDs, network, and I/O
Introduction: The next phase of Kafka Tiered Storage In my previous blog series, I explored how Apache Kafka Tiered Storage is more like a dam than a fountain by comparing local vs remote storage (Part 1), performance results (Part 2), Kafka time and space (Part 3), and the impact of various consumer behaviors on the...
Learn MorePaul BrebnerMarch 05, 2025 -
- Apache Cassandra
- Dev Rel
- OpenSearch
Introduction to similarity search with word embeddings: Part 1–Apache Cassandra® 4.0 and OpenSearch®
Word embeddings have revolutionized how we approach tasks like natural language processing, search, and recommendation engines. They allow us to convert words and phrases into numerical representations (vectors) that capture their meaning based on the context in which they appear. Word embeddings are especially useful for tasks where traditional keyword searches fall short, such as...
Learn MoreMurilo MirandaMarch 05, 2025 -
- Open Source
Harnessing managed open source: The future of data infrastructure
Whether you’re exploring managed open source solutions or scaling your existing systems, these advancements can help your organization unlock the full potential of your data.
Learn MoreInstaclustrMarch 04, 2025 -
- Apache Cassandra
- News
IBM acquires DataStax: What that means for customers–and why Instaclustr is a smart alternative
IBM’s recent acquisition of DataStax has certainly made waves in the tech industry. With IBM’s expanding influence in data solutions and DataStax’s reputation for advancing Apache Cassandra® technology, this acquisition could signal a shift in the database management landscape. For businesses currently using DataStax, this news might have sparked questions about what the future holds....
Learn MoreMichael ReynoldsFebruary 28, 2025 -
- ClickHouse
Instaclustr for ClickHouse® on Azure now in Preview
Instaclustr for ClickHouse® is now available in preview for Azure on the NetApp® Instaclustr Managed Platform. ClickHouse is an open source, column-oriented database management system renowned for its lightning-fast query processing and high compression ratios. It is commonly used for real time analytics, log, and event analysis, and machine learning capabilities to support AI use...
Learn MoreInstaclustrFebruary 13, 2025