The "Un-Vendor-Lock-In" Manifesto: Why Your CFO Will Finally Hug a Data Engineer

October 17, 2025

by James Hunt, Partner

You want an opinionated, zero-fluff take on this Apache Iceberg shift written from the trenches? Pull up a chair. Let's talk about why data warehousing architectures are broken, why your data lake has probably evolved into an expensive data swamp, and why building an open lakehouse isn't just a technical flex—it's the only baseline strategy that makes financial sense anymore.

The "Un-Vendor-Lock-In" Manifesto: Why Your CFO Will Finally Hug a Data Engineer

Every data vendor in the world is trying to sell you a proprietary walled garden disguise as an open platform. They give you slick UIs, promise you planetary-scale computation, and then quietly shackle your data behind custom layouts and egress taxes.

It’s less like a tech partnership and more like a timeshare agreement you can’t escape. If compute prices rise, you are stuck paying a premium just to look at your own data.

At Vivanti, we have a simple rule that guides our global team: Technology is only as valuable as the real-world client benefits it unlocks.

Don't believe me? Look at what happens when you dismantle the silos. We recently dropped into a major bank to execute a data mesh transformation. They were drowning in fragmented reporting—Sharepoint sheets, fragile Excel layouts, and duplicated Power BI dashboards.

We decoupled their storage from the query engines. We implemented a data mesh and lakehouse architecture on AWS atop Amazon S3 and Apache Iceberg.

The results?

200% faster data pipeline execution time, turning monthly batch processing into a daily breeze.
10x faster serving of data to end-user dashboards, smashing refresh metrics from minutes down to seconds.
400% reduction in time for new table design, crushing a six-month new source development timeline into just four weeks.

Smashing the Proprietary Data Prisons

For years, enterprises bought into a 30-year-old compromise: you separate the systems that run the business from the systems that help you analyze it. Databases handled low-latency transactions; warehouses handled historical analysis. They never truly met.

But in the age of inference, your data cloud is your AI strategy. If your data lake doesn’t provide a highly reliable, point-in-time consistent view of historical data, your AI models are going to hallucinate loudly in front of your customers. AI models are like toddlers—they will ingest whatever un-sanitized garbage you accidentally set down in front of them.

This is why we push Apache Iceberg.

Iceberg is not just another file format. Parquet is a file format. Iceberg is a table format that manages those Parquet files directly in your own Amazon S3 data lake, introducing full ACID compliance, time travel, and native schema evolution straight to object storage. It effectively acts as a relational database layer sitting on top of an infinite object store.

Capability	What It Actually Means for Systems
ACID Transactions	Multiple engines can concurrently write, update, and delete object storage files without risking state corruption.
Time Travel	You can instantly query immutable snapshots of the lake exactly as it existed at any specific second in the past.
Schema Evolution	You update table structures (add, rename, drop columns) via standard SQL without breaking downstream analytics or rebuilding schemas from scratch.

We coupled this rock-solid data foundation at Cochlear, delivering an AI Walled-Garden hosted within an AWS VPC, powered by AWS Bedrock, and integrated with Snowflake. By utilizing Iceberg table formats for consistent experimentation outcomes, the foundational patterns for scaling AI safely were established with estimated cost savings reaching ~$20M USD.

Moving from Weeks of Toil to "Is It Done Already?"

Innovation stalls when your data lake is so fragile that a single bad update breaks an entire ingestion pipeline, forcing your data engineers to pull an all-nighter mucking about with brittle file fixing.

By bringing Iceberg onto AWS, we leverage native integrations like AWS Glue Data Catalog and AWS Lake Formation to enforce fine-grained access controls and track metadata lineages automatically across disparate data lakes. We have built pipelines that do not snap when an upstream software engineer modifies a schema.

Your existing analytical teams don't need to retrain in complex big-data runtimes; they query massive datasets using the standard ANSI-SQL skills they already possess. We turn the data lake into an accessible, open asset rather than an intimidating bottleneck.

We applied this exact philosophy of radical efficiency for an insurance client, transforming a tedious, weeks-long historical actuarial data preparation process into a fully automated, 20-minute breeze.

At the end of the day, it is your data. You shouldn't have to pay a toll or a licensing fee to touch it. Apache Iceberg gives you total financial and architectural autonomy. We don’t start with the tool; we start with your systems constraints and business goals. Then we design the architecture that gets you there.

Curious about how to un-vendor-lock your own data fabric or stand up an open lakehouse pipeline that scales to exabytes without blowing up your compute budget? Let's trace your system paths and see where the bottlenecks are.

Happy Hacking!

The "Un-Vendor-Lock-In" Manifesto: Why Your CFO Will Finally Hug a Data Engineer

The "Un-Vendor-Lock-In" Manifesto: Why Your CFO Will Finally Hug a Data Engineer

Smashing the Proprietary Data Prisons

Moving from Weeks of Toil to "Is It Done Already?"

More articles

AI alone is not the strategy. Industrialisation is.

AI Gateway — One control plane for every AI model in your enterprise

Tell us about your project

Our offices