Main goal of Data Platform
Data Platform Series
- 1.What is data platform
- 2.Main goal of Data Platform (current)
- 3.How that goal is achieved
- 4.Why is it important
- 5.How it evolves
Main goal of Data Platform
“We help data guide the way”
It’s what the data platform team does.
Given these requirements, we can now start to think more clearly about who should be our first hire in this area. We’d like them to have:
That first person will need a lot of context, but compactly. Working on the right thing is paramount as it’s trivial to spend whole days and weeks on problems that might have a simple temporary workaround. Experience and pragmatism are definitely of high value.
How to fail less with the first hire?
The first aspect I’d like to touch on is the technical choice of build vs. buy and some guidance on tools. Not that the choices are made beforehand, but that there is clarity on (cloud) platform to be used, budget, vendor selection process, and other considerations. Same with headcount plans – it makes a significant difference whether there is a plan to hire a team of 6 within a few months or maybe hire one other person next year.
In the first couple of weeks, there should be some meetings set up in addition to the regular things (direct lead, closest stakeholders, etc.) :
The aim of those meetings is twofold – an explanation of what’s a data platform and what’s going to happen when, but also not to make a wrong turn right off the gates.
What else to keep in mind?
Everyone benefits from having someone to discuss problems and solutions with. Especially in this case, as data engineering work is a systems integration work and the challenges are somewhat unique. Things like how to store microsecond precision data in a system with millisecond precision. Or what would be a valuable way to partition some of the larger datasets. Or how to create a proper audit trail for the sensitive data. Or what happens when they go on vacation or fall sick?
My point is that the inability of a single person in a specific function to talk about these things can be a problem. The first solution is that the direct lead should be interested and collaborative and have time to help. The second solution is to hire another data engineer. And the third solution, in case the first two are not an option, is to encourage and support the single data engineer to attend some meetups and conferences.
What next?
It feels like a good place to introduce how I see the progression of value in the area of data engineering/platform.
The number of humans working on data engineering:
The following article will be about the how part – “*we help data guide the way … **by x, y, z*“.