J

Machine Learning Engineer - Large Language Models

John Snow Labs · Remote

Full-timeLeadPyTorch

About the Role

This a Full Remote job, the offer is available from: Anywhere Company DescriptionJohn Snow Labs is an award-winning AI and NLP company, accelerating progress in data science by providing state-of-the-art software, data, and models. Founded in 2015, it helps healthcare and life science companies build, deploy, and operate AI products and services. John Snow Labs is the winner of the 2018 AI Solution Provider of the Year Award, the 2019 AI Platform of the Year Award, the 2019 International Data Science Foundation Technology award, and the 2020 AI Excellence Award. John Snow Labs is the developer of Spark NLP - the world's most widely used NLP library in the enterprise - and is the world's leading provider of state-of-the-art clinical NLP software, powering some of the world's largest healthcare & pharma companies. John Snow Labs is a global team of specialists, of which 33% hold a Ph.D. or M.D. and 75% hold at least a Master's degree in disciplines covering data science, medicine, software engineering, pharmacy, DevOps and SecOps. Job DescriptionWe are seeking a highly skilled and experienced Machine Learning Engineer to join our team. The ideal candidate will have a robust background in machine learning, natural language processing (NLP), and Large Language Models (LLMs). The role focuses on training, tuning, and evaluating AI models as part of a diverse team which includes professionals from software engineering, data science, and medicine. QualificationsKey Responsibilities: • Adapt LLMs to diverse healthcare use-cases using techniques such as Sparse Fine-Tuning (SFT), Prompt Engineering Fine-Tuning (PEFT), Direct Parameter Optimization (DPO), and Proximal Policy Optimization (PPO). • Optimize LLMs for Retriever-Augmented Generation (RAG) to enhance decision-making and information retrieval capabilities. • Collect, clean, and refine healthcare datasets for training LLMs to ensure high-quality data provisioning. • Convert models into various formats suitable for production environments, ensuring their readiness for real-world application. Requirements: • 5+ years of hands-on professional experience in software engineering, building production-grade deep learning solutions. • An academic degree in computer science, data science, or a related degree. M.Sc. or Ph.D. degree is strongly preferred. • Demonstrated expertise in model tuning frameworks like Axolotl. • Familiarity with model serving frameworks, including vLLM, TGI, and llama-cpp, to support the deployment and scalability of machine learning models. • Knowledge of model quantization techniques and frameworks to optimize AI models for performance in resource-constrained environments. • Hands-on experience with Transformer architectures and proficiency in machine learning frameworks such as PyTorch. This role offers an opportunity to be at the forefront of technological advancement in AI and healthcare, contributing to innovations that advance the healthcare, life science, and open-source AI communities. When applying, please add a cover letter which includes the words 'John Snow Labs' and explains your academic, professional ML engineering, and recent LLM engineering experience. Additional InformationWhat John Snow Labs offers: A fully virtual company, collaborating across 28 countriesCompetitive package and compensation planIndustry leader and respected brand nameLearning and developmentWe are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity we will give your application fair consideration. This offer from "John Snow Labs" has been enriched by Jobgether.com and got a 82% flex score.

💬 Developer Questions

Ask the team a question — answers show up here

🎯

What does the interview process look like?

🤖

What AI/vibe coding tools does the team use daily?

👥

How big is the engineering team?

Is the team fully async or are there required meetings?

🚀

What does onboarding look like for remote hires?

🔧

Can you share more about the tech stack and architecture?

📈

What does career growth look like in this role?

📅

What does a typical day look like?

💰

Is there a salary range you can share?

📊

Is equity or stock options part of the package?

🌍

Are there timezone requirements or preferences?

🛂

Do you sponsor work visas?

🏢 Is this your listing? Claim it to answer questions

Similar Jobs

Helpful resources

Hiring for a similar role? Post your job here — it's free →