Paul Brebner

Paul Brebner

Technology Evangelist

Since learning to program on a VAX 11/780, Paul has extensive R&D and consulting experience in distributed systems, technology innovation, software architecture and engineering, software performance and scalability, grid and cloud computing, and data analytics and machine learning.

Paul is the Technology Evangelist at Instaclustr. He’s been learning new scalable technologies, solving realistic problems and building applications, and blogging about Apache Cassandra, Spark, Zeppelin, and Kafka.

Paul has worked at UNSW, several tech start-ups, CSIRO, UCL (UK), & NICTA. Paul has helped pre-empt and solve significant software architecture and performance problems for clients including Defence and NBN Co. Paul has an MSc in Machine Learning and a BSc (Computer Science and Philosophy).

Research net profile

Paul's Articles

ApacheCon Berlin, 22-24 October 2019

Monday 2nd December 2019

ApacheCon Europe, October 22-24, 2019, Kulturbrauerei Berlin #ACEU19 What’s better than one ApacheCon? Another ApacheCon! This year there were two Apache Conferences, one in Las Vegas and then again in Berlin. They...

Read more

Kongo 5.2: Apache Kafka Streams Examples

Tuesday 29th May 2018

Dr Black has been murdered in the Billiard Room with a Candlestick! Whodunnit?! In this blog, we’ll have a look at some simple Kafka Streams examples using the murder mystery game Cluedo (Clue...

Read more

Kongo 5.1: Apache Kafka Streams Introduction

Tuesday 29th May 2018

Abstract Apache Kafka Streams is a framework for stream data processing. In this blog, we’ll introduce Kafka Streams concepts and take a look at one of the DSL operations, Joins, in more detail....

Read more

Apache Kafka Connect Architecture Overview

Wednesday 9th May 2018

Kafka Connect is an API and ecosystem of 3rd party connectors that enables Apache Kafka to be scalable, reliable, and easily integrated with other heterogeneous systems (such as Cassandra, Spark, and Elassandra) without...

Read more

Spark Structured Streaming with DataFrames

Tuesday 28th November 2017

This blog provides an exploration of Spark Structured Streaming with DataFrames The blog extends the previous Spark MLLib Instametrics data prediction blog example to make predictions from streaming data.  We demonstrate a two-phase...

Read more

Behind the Scenes

Wednesday 25th October 2017

Spoiler alert! Kubrick’s scientific consultant Frederick Ordway once revealed that Kubrick had the props for the film destroyed because he didn’t want to ruin the illusion of 2001 for people.  If you prefer...

Read more

Fourth Contact with a Monolith

Friday 20th October 2017

“The thing’s hollow — it goes on forever — and — oh my God! — it’s full of stars!” It’s full of Spreadsheets! (DataFrames) Given that a dog, Laika, was the 1st astronaut...

Read more

Hello Cassandra! A Java Client Example

Thursday 7th September 2017

This is the third (and final) part of my blog-series on creating a demonstration Cassandra cluster, connecting, and communicating. We landed on the moon and made Second Contact with the Monolith (CQL shell)...

Read more

Cassandra Cluster Creation in Under 10 Minutes

Tuesday 29th August 2017

Enough Information I watched the classic movie “2001: a Space Odyssey” for the nth time on the weekend.  My previous favourite quote from HAL (the eventually paranoid and murderous ship AI) was:  ...

Read more


Spin up a cluster in less
than 5 minutes.
(No credit card required)

Sign Up Now

Site by Swell Design Group