Senior vs Junior Data Engineer #dataengineering #modernstack
The first time I was hired to work in data, it felt so exciting - I was going to be creating business value, working on AI/ML pipelines, and implementing cutting-edge technology. Then the reality quickly set in: The data infrastructure had been completely neglected for years. No one had established clear data domains. There was so much data debt I legitimately could not understand what the most important tables in the company were supposed to be doing or why they were even important. OK, not great...but then things started breaking. When the alarms began going off, people came to yell at ME. They got annoyed that I asked them to file a ticket, and then they got even more annoyed when I didn't get to their ticket in time. I would try to explain that these data quality issues weren't my fault or the fault of my team - but no one cared to listen, they just wanted their dashboard to look right so the CMO wouldn't keep hounding them that the numbers were wrong. It dawned on me that my role wasn't about building cool features or pushing the boundary on data management- it was a glorified customer support agent for SQL. The next data role I had wasn't much different - everything I had seen in the first position reared its head again - but this time, my mission was to change the culture and the people, NOT to spend my entire day languishing in a dark room writing comments asking for more information on JIRA. Now, don't get me wrong, on-call is a part of life. But when it feels like it is the ONLY thing your time is spent working on the mental burden can be worse can be worse than the paycheck you get every two weeks. And the absolute worst part is that you become non-essential in the eyes of leadership. You aren't adding to the bottom line like a software engineer working in a product org...you're a cost center. And cost centers are the first to get cut. If you are a data leader, I believe your number one goal is to make data engineering MANAGABLE for your organization. Buying an analytical database and ELT/ETL tool is just the mechanics. How do you people use those tools? What's the standard operating procedures? How do you deal with quality issues? Who is accountable? How do you decide what is a P0 and what's a P1/P2? Where do you NEED governance and quality versus where is it a nice to have? Does the rest of the business know their role in data generation and incident management? What VALUE is your team adding, how do you prove that value? Unless you can confidently answer those questions (and more) your team is going to spend more cycles hating their jobs than doing useful work. Good luck!