About the Role
Note: The job is a remote job and is open to candidates in USA. CrowdStrike is a global leader in cybersecurity, dedicated to stopping breaches with its advanced AI-native platform. They are seeking a Principal Data Engineer with expertise in Large Language Models (LLMs) to design and deploy data infrastructure for AI-driven security products, focusing on scalable solutions and engineering excellence.
Responsibilities
Architect, implement, and optimize data platforms and pipelines specifically designed to support LLMs, Retrieval-Augmented Generation (RAG), and sophisticated AI agentic systems at Exabyte scaleDrive the adoption and deployment of agentic workflows and agent harnessing techniques to create autonomous, data-driven security featuresDesign and implement highly scalable, fault-tolerant, and cost-effective data solutions, emphasizing rapid iteration and high-quality deploymentWrite elegant, production-ready code with a focus on performance, maintainability, and testing rigor, ensuring the ability to ship fast without compromising qualityProvide technical leadership and deep expertise in data modeling, normalization, and semantic cataloging for AI/ML workloadsEstablish best practices for MLOps/DataOps surrounding LLMs, including monitoring, observability, and zero-touch recovery mechanisms for AI servicesActively mentor engineers, conducting technical workshops, leading design reviews, and strengthening the team's knowledge in cutting-edge AI platform technologiesCollaborate across the organization with Data Scientists, Product Managers, and other engineering teams to transform research prototypes into robust, production-grade servicesOwn the end-to-end lifecycle of critical data services: development, testing, deployment, and monitoring
Skills
Master's degree or PhD in Computer Science, Data Engineering, or a related STEM field, or equivalent practical experience10+ years of progressive experience in Data Engineering/Platform Engineering, with at least 3 years focused on architecting and building platforms for AI/ML or Data Science at massive scaleDemonstrable hands-on experience in LLM engineering (fine-tuning, prompt engineering, deployment), RAG, and developing agentic workflowsProven track record of designing and delivering large-scale distributed systems (sharding, partitioning, concurrency)Exceptional ability to write clean, elegant, performant, and well-tested code, coupled with a proactive mindset for delivering results quicklyA thorough understanding of engineering practices, including effective peer code reviews, resilient architecture design, and comprehensive testing paradigmsPrior experience in a Principal or Staff level engineering role, demonstrating technical leadership and mentorship capabilitiesDirect experience building, deploying, and managing LLMs in a production environmentPrior experience in the cybersecurity, intelligence, or high-compliance industriesContributions to open-source projects related to data or AI/ML
Benefits
Market leader in compensation and equity awardsComprehensive physical and mental wellness programsCompetitive vacation and holidays for rechargePaid parental and adoption leavesProfessional development opportunities for all employees regardless of level or roleEmployee Networks, geographic neighborhood groups, and volunteer opportunities to build connectionsVibrant office culture with world class amenitiesGreat Place to Work Certified™ across the globeEligibility for bonuses, equity grants and a comprehensive benefits package that includes health insurance, 401k and paid time offVariable/incentive compensation + equity + benefits
Company Overview
CrowdStrike is a cybersecurity technology firm that provides cloud-delivered protection for cloud workloads, identity, and data. It was founded in 2011, and is headquartered in Sunnyvale, California, USA, with a workforce of 5001-10000 employees. Its website is http://www.crowdstrike.com.
Apply To This Job