H

Principal Data Engineer

Harnham · United States

Full-timeStaff+PythonAWS

🔥16 people viewed this job

About the Role

Our client is looking for a Principal Data Engineer to join an engineering team building modern, enterprise-grade data platforms. This role is ideal for an experienced engineer who can define data strategy, own platform decisions end-to-end, and contribute to technical leadership across the team.In this role, you will design scalable data lakes, warehouses, and pipelines, define governance and quality standards, and drive data platform modernization across real, in-flight work where performance, reliability, and security are critical. You'll mentor more junior engineers, partner with leadership on data strategy, and bring an AI-forward mindset. You shape how AI systems integrate into enterprise operations, how teams move at real velocity, and how products create measurable impact for clients and the people they serve. We are looking for people who take ownership, lead with craft, and care deeply about their work.What You'll DoCraft & Delivery • Define data architecture and platform strategy, leading design across pipelines, warehouses, and data lakes • Build and optimize scalable data pipelines supporting batch and real-time processing • Define and enforce data governance, quality standards, and compliance frameworks across the platform • Build monitoring, logging, and alerting for data pipelines and services, and contribute to CI/CD workflows for data deployment and automation • Drive data platform modernization, optimizing for performance, cost, and scalability • Bring an AI-forward mindset to your daily work, using tools like Claude, Cursor, and other modern AI assistants to ship higher-quality work at pace • Design and implement data contracts and event flows in collaboration with backend, platform, and engineering teams • Lead the design and implementation of data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows • Integrate data services with APIs, middleware, and third-party systems to support end-to-end data consumptionCollaboration & Communication • Partner with leadership on data strategy, translating technical depth into decisions others can act on • Collaborate closely with engineering, analytics, AI, and product teams to align data platforms with broader goals • Advocate for data quality, governance, and platform best practices across teams and engagements What You'll Bring • 7+ years of professional data engineering experience, with experience leading complex data platform initiatives • Strong system architecture background with expertise in distributed data systems • Strong experience in AWS • Expert proficiency in Python, Scala, and SQL • Deep expertise with cloud-native data platforms and enterprise data warehousing • Strong expertise in data pipeline orchestration and processing • Strong experience with streaming platforms and real-time data processing (e.g., Kafka, Kinesis, Pub/Sub) • Strong data modeling expertise and experience with data transformation • Strong experience with data quality, governance, and compliance frameworks • Strong experience with container orchestration and CI/CD for data systems • Strong experience building data pipelines for production AI/ML systems, including embeddings, vector stores, RAG data preparation, feature stores, and training/inference data flows • Demonstrated leadership and technical mentoring experience across a team or organization • Strong stakeholder communication skills, with the ability to translate technical depth across audiences • Demonstrable, day-to-day usage and expert knowledge of AI-forward coding tools such as Claude and Cursor • Excellent problem-solving skills and the ability to navigate highly ambiguous technical and business challenges with sound judgment • Experience with data mesh or data fabric concepts, lakehouse architectures, or governance framework implementation is a plus Helpful Extras and Unique Skills • Experience with handling and modeling data in the healthcare industry is a plus • AWS certifications, like Certified Data Engineer – Associate, strongly preferred

Harnham is a suburb of the city of Salisbury in Wiltshire, England, centred about 0.6 miles (1 km) south of Salisbury Cathedral and across the River Avon. Harnham is split into the areas of West Harnham and East Harnham.

💬 Developer Questions

Ask the team a question — answers show up here

🎯

What does the interview process look like?

🤖

What AI/vibe coding tools does the team use daily?

👥

How big is the engineering team?

Is the team fully async or are there required meetings?

🚀

What does onboarding look like for remote hires?

🔧

Can you share more about the tech stack and architecture?

📈

What does career growth look like in this role?

📅

What does a typical day look like?

💰

Is there a salary range you can share?

📊

Is equity or stock options part of the package?

🌍

Are there timezone requirements or preferences?

🛂

Do you sponsor work visas?

🏢 Is this your listing? Claim it to answer questions

Similar Jobs

Helpful resources

Hiring for a similar role? Post your job here — it's free →