Journal/Data lakes/The data lake question every CFO should ask first.

The data lake question every CFO should ask first.

Before you buy another SaaS dashboard, answer this one question. If you can't, you don't have a data problem — you have a procurement problem.

Published
Apr 14, 2026
Reading time
7 minutes
Category
Data lakes

There is one question every finance leader should be able to answer about their company's data — and almost none can. It is not "how much do we have," "where is it stored," or "is it in the cloud." It is this:

If I asked our systems, in plain English, "which of our customers cost us money last quarter," — could anyone here answer it without opening a spreadsheet?

If the answer is no — and it almost always is — the problem you are about to solve by buying another dashboard is not the problem you actually have.

01. The procurement trap

Most mid-sized firms own three to six SaaS systems that each contain a partial answer to that question. The CRM has the customers. The accounting tool has the cost. The support ticket platform has the time spent. The project tool has the deliverables. Each tool ships with a dashboard. None of them speak to each other.

So a finance leader, looking at a flat margin and a slow quarter, does the rational thing: she asks for a unified view. The answer arrives as a proposal for a seventh tool — a "BI platform" or an "analytics suite" — that promises to draw from the other six.

This rarely works, because the seventh tool inherits the same boundaries as the first six. The question crosses systems. The integration does not.

Buying analytics on top of disconnected systems is like buying a translator who only speaks one of the languages. — a CFO, six months in

02. What the lake actually buys you

A data lake — and we mean a small one, a few terabytes at most for a mid-sized firm — is not a technology choice. It is a posture. It says: the answers to the questions we will ask in the next three years live across systems we don't yet know we'll buy. Let's collect the raw material in one place we control.

In practical terms, this means a Postgres database, an S3 bucket, or a managed table format like Iceberg, fed by reliable connectors from each system of record. Not a copy of every dashboard. A copy of the underlying data the dashboards were built on.

The question we opened with — which customers cost us money — is unanswerable in any single SaaS tool. In a lake, it is a single SQL query, written once, run forever.

03. The CFO's checklist before the next purchase

We ask finance leaders to answer four questions before we'll help them buy anything new:

  1. What is the question you cannot currently answer? Write it down. One sentence.
  2. Which two or more systems hold parts of the answer? Name them.
  3. What would it cost to copy the underlying tables — not the dashboards — from each system into one place? It is almost always less than the seventh tool.
  4. Who would write the query? Not "who would build the dashboard." Who would write the SQL.

If you can answer all four, you don't need our help to do this. If you can't answer any of them, we can find the answers in a week — and you will not need the seventh tool.

Rule of thumb: the cost of a small, owned data lake is roughly the annual contract on one mid-tier BI tool. The lake compounds. The contract renews.

A short closing

The procurement reflex in most companies is to add tools. The compounding move is to consolidate the data underneath them. CFOs who get this right do not end up with more systems — they end up needing fewer of them, because the question they wanted answered six months ago is now a query, not a project.


Filed under: DATA · STRATEGY
First published: Apr 14, 2026