March 14, 2024
Epics: What is an Agile Epic?
Introduction If you use or are interested in an agile approach to delivery, “epic” is a term you want to understand. Epics are a way of organising large items of…
March 14, 2024
The 5 Vs of Big Data
What is big data? Big data refers to large, complex data sets often contrived from multiple and new sources. The reason that “big data” has received its own term (after…
March 14, 2024
Basic Analytics vs Advanced Analytics
What is “Basic Analytics”? Basic analytics, most commonly just referred to as analytics, focuses on understanding historical data. This understanding is achieved by extracting valuable insights from aggregated, clean datasets.…
March 14, 2024
What is a Data Platform?
A data platform is an integrated technological infrastructure that empowers organisations to collect, store, process, manage, analyse, and extract valuable insights from massive volumes of structured and unstructured data. All…
March 14, 2024
What Is Databricks And What’s It Used For?
While some people are very familiar with Databricks, others might not know as much. We thought it would be a good idea to break down what Databricks is, explore what…
March 14, 2024
How to run a hello world program in Databricks – Part 2
Introduction In our previous article, we have seen how to get onboarded with Databricks and run your first hello world application using Spark. The first step involved creating a community…
March 14, 2024
How to run a hello world program in Databricks – Part 1
Let's get started In this post, we're going to get you started on Databricks, but first let's take a quick look at the history of big data space. Spark has…
March 14, 2024
Databricks productivity tips and tricks
Databricks is one of the major players in the data space, but some people may still treat it like an optimised hosted service of Apache Spark. It's true that Spark…
March 13, 2024
Navigating the data lake using Rust – Part Two
Introduction In part one of this series, we discussed how Delta Lake has become the standard data format for a lakehouse architecture, and how to create and write data to…