R

Senior AI Software Engineer

Ruby Labs · Anywhere

🔥31 people viewed this job

About the Role

This a Full Remote job, the offer is available from: European Union, Turkey, North Macedonia, Montenegro, Serbia, Georgia (USA) About usRuby Labs is a leading tech company that creates and operates innovative consumer products. We offer a diverse range of opportunities across the health, education, and entertainment industries. Our innovative teams are driving the future of consumer-led products, and we're always looking for passionate individuals to join us. Learn more about our story at: https://rubylabs.com/about-us/ About the roleAt Ruby Labs we are looking for a Senior AI Engineer to own and drive the quality, reliability, and evolution of our AI systems in production. This is a high-ownership role. You will be responsible for end-to-end delivery of major AI features, production stability of AI systems, and data-driven experimentation using tools like Langfuse, Mixpanel and OpenRouter. You'll work in a modern stack built on Next.js, TypeScript, Node.js, and Redis, collaborating closely with product, growth, data, and billing teams. Increasingly, this includes building agentic, tool-using AI systems — defining clean tool contracts (including MCP-based tools) and orchestrating how AI interacts with internal services and business systems. Our engineering organization uses a squad-based structure. You will operate within an AI engineering squad, contributing as a senior technical voice and driving engineering quality within your area of the product. Key ResponsibilitiesTake complete ownership and deliver major AI engineering features within agreed timelines. Own AI output quality, structure, and predictability across all user-facing AI interactions. Design, implement, and maintain output-type-based AI systems, including segmentation, routing, and enforcement. Ensure consistent output structure and formatting across different LLMs for the same request type. Integrate and orchestrate multiple LLM providers via OpenRouter, managing model selection, fallback strategies, and cost optimisations. Design and orchestrate tool-using and agentic AI workflows, defining clean tool contracts (including MCP-based tools), function-calling interfaces, and reliable AI-to-system integrations. Build and maintain complex, multi-step LLM workflows, including with orchestration frameworks such as LangChain or LlamaIndex, for advanced reasoning, context reuse, and retrieval. Design and manage production prompt systems with dynamic prompting, context injection, and conditional logic. Own the deployment and release of LLM experiments, prompt management, and Langfuse-based evaluation pipelines. Run A/B tests across models, analyse results, and present data-driven impact assessments of AI features and experiments. Monitor AI system metrics, quality signals, latency, and release health using Langfuse and other observability tools. Deep-debug complex LLM chains using Langfuse traces, identifying bottlenecks and optimising for cost, latency, and context-window usage, and build output-scoring systems to root-cause hallucinations and logic errors. Write clean, scalable, and maintainable TypeScript code across the Next.js and Node.js stack. Build reliable backend logic for AI systems, with strong error handling, request validation, fallback flows, and predictable behaviour in production, including reliable tool execution and AI-to-service integrations. Ensure high code quality through testing, code reviews, and clear engineering standards. Monitor, troubleshoot, and improve production performance, reliability, and system health. Drive maintainability and technical quality through solid architecture, refactoring, and disciplined release practices. Qualifications6+ years of backend/full-stack software engineering experience, including production-grade TypeScript/Node.js. Experience with Next.js and/or Python is a plus. 2+ years of experience building AI/LLM systems in production. Less experience may be considered for exceptional candidates. Deep hands-on experience working with LLM APIs (OpenAI, Anthropic, or similar) in production environments. Experience with Agentic AI, multi-agent orchestration, tool-based workflows (function calling/tool execution), and/or RAG pipelines, including indexing, retrieval, and re-ranking. Experience with LLM observability tools such as Langfuse, LangSmith, or similar platforms. Experience with AI gateways and model routing solutions, such as OpenRouter or equivalent technologies. Solid understanding of Redis and relational databases, such as PostgreSQL. Exceptional ownership mindset and personal responsibility for engineering quality and delivery. Nice to haveExperience with AI-centered development tools such as Cursor, Claude Code, Windsurf, or similar platforms. Familiarity with evaluation frameworks, including LLM-as-a-judge, RAGAS, or similar approaches. Experience working in high-pressure startup environments with rapid product iteration cycles. Experience wit

Ruby Labs has 2 open positions on Remote Vibe Coding Jobs.

💬 Developer Questions

Ask the team a question — answers show up here

🎯

What does the interview process look like?

🤖

What AI/vibe coding tools does the team use daily?

👥

How big is the engineering team?

Is the team fully async or are there required meetings?

🚀

What does onboarding look like for remote hires?

🔧

Can you share more about the tech stack and architecture?

📈

What does career growth look like in this role?

📅

What does a typical day look like?

💰

Is there a salary range you can share?

📊

Is equity or stock options part of the package?

🌍

Are there timezone requirements or preferences?

🛂

Do you sponsor work visas?

🏢 Is this your listing? Claim it to answer questions

Similar Jobs

Helpful resources

Hiring for a similar role? Post your job here — it's free →