Prompt Engineer, Agent Prompts & Evals

Remote, USA Full-time Posted 2025-11-24

About the position Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. We’re looking for prompt and context engineers to join our product engineering team to help build AI-first products, features, and evaluations. Your mission will be to bridge the gap between model capabilities and real product experience, working with product teams to build consistent, safe, and beneficial user experiences across all product surfaces. You will be deeply involved in new product feature and model releases at Anthropic, combining engineering expertise with an understanding of frontier AI applications and model quality. You’ll become an expert on Claude’s behavioral quirks and capabilities and apply that knowledge to deliver the best possible user experience across models and domains. You’ll be the first resource for product teams working on Claude’s AI infrastructure: system prompts, tool prompts, skills, and evaluations. This role requires someone who can effectively balance caring deeply about making Claude the best it can be while also supporting a wide variety of concurrent projects and efforts across many product teams. Responsibilities • Prompt Engineering Excellence: Design, test, and optimize system prompts and feature-specific prompts that shape Claude’s behavior across consumer and API products. • Evaluation Development: Build and maintain comprehensive evaluation suites that ensure model quality and consistency across product launches and updates. • Cross-functional Collaboration: Partner closely with product teams, research teams, and safeguards to ensure new features meet quality and safety standards. • Model Launch Support: Play a critical role in model releases, ensuring smooth rollouts and catching regressions before they impact users. • Infrastructure Contribution: Help build and improve the frameworks and tools that allow teams to develop and test prompts and features with confidence. • Knowledge Transfer: Mentor product engineers on prompt engineering best practices and help teams build their first evaluations. • Rapid Iteration: Work in a fast-paced environment where model capabilities advance daily, requiring quick adaptation and creative problem-solving. Requirements • 5+ years of software engineering experience with Python or similar languages. • Demonstrated experience with LLMs and prompt engineering (through work, research, or significant personal projects). • Strong understanding of evaluation methodologies and metrics for AI systems. • Excellent written and verbal communication skills – you’ll need to explain complex model behaviors to diverse stakeholders. • Ability to manage multiple concurrent projects and prioritize effectively. • Experience with version control, CI/CD, and modern software development practices. Nice-to-haves • Experience with Claude or other frontier AI models in production settings. • Background in machine learning, NLP, or related fields. • Experience with A/B testing and experimentation frameworks (e.g., Statsig). • Familiarity with AI safety and alignment considerations. • Experience building tools and infrastructure for ML/AI workflows. • Track record of improving AI system performance through systematic evaluation and iteration. Benefits • competitive compensation and benefits • optional equity donation matching • generous vacation and parental leave • flexible working hours • a lovely office space in which to collaborate with colleagues Apply tot his job Apply To this Job

Apply Now

Prompt Engineer, Agent Prompts & Evals

Similar Jobs

Experienced Customer Service Representative – Work From Home Opportunity at arenaflex

Freelance CRM Analytics Specialist Data Intelligence · Atlanta

Experienced Entry-Level Data Entry Clerk – Remote Work Opportunity at arenaflex

Advisor, Government Relations (20-month term)

Experienced Data Entry Clerk – Remote Work Opportunity with arenaflex

Experienced Virtual Customer Service Representative – Remote Work Opportunity with arenaflex for Career Growth and Development

UT Dallas Data Analyst Entry Level Opportunity

[Remote] Sr. Product Manager III (6274)

Manager/Senior Manager, Pharmacovigilance Operations

Part time jobs for students very simple just typing in notepad

Licensed School Social Worker!

[Work From Home] UPS Remote Jobs (Data Entry| Full Time) $260/Day

Senior Software Engineer, Machine Learning, Google Cloud Business...

Apply Now: Front Office VIP Coordinator Â Full Time, $33.44/Hour

Experienced or Entry-Level Remote Data Entry Specialist for Logistics and Supply Chain Management at blithequark

Experienced Overnight Customer Support Specialist – Pet Insurance Industry Leader

Remote Sales Specialist

Security Researcher, Malware Triage; Remote

fb content moderator job (Work From Home Remote)

Experienced Data Entry Specialist - Remote Opportunity at blithequark

Prompt Engineer, Agent Prompts & Evals

Similar Jobs

**Experienced Customer Service Representative – Work From Home Opportunity at arenaflex**

Freelance CRM Analytics Specialist Data Intelligence · Atlanta

**Experienced Entry-Level Data Entry Clerk – Remote Work Opportunity at arenaflex**

Advisor, Government Relations (20-month term)

**Experienced Data Entry Clerk – Remote Work Opportunity with arenaflex**

Experienced Virtual Customer Service Representative – Remote Work Opportunity with arenaflex for Career Growth and Development

UT Dallas Data Analyst Entry Level Opportunity

[Remote] Sr. Product Manager III (6274)

Manager/Senior Manager, Pharmacovigilance Operations

Part time jobs for students very simple just typing in notepad

Licensed School Social Worker!

[Work From Home] UPS Remote Jobs (Data Entry| Full Time) $260/Day

Senior Software Engineer, Machine Learning, Google Cloud Business...

Apply Now: Front Office VIP Coordinator Â Full Time, $33.44/Hour

Experienced or Entry-Level Remote Data Entry Specialist for Logistics and Supply Chain Management at blithequark

**Experienced Overnight Customer Support Specialist – Pet Insurance Industry Leader**

Remote Sales Specialist

Security Researcher, Malware Triage; Remote

fb content moderator job (Work From Home Remote)

**Experienced Data Entry Specialist - Remote Opportunity at blithequark**

Experienced Customer Service Representative – Work From Home Opportunity at arenaflex

Experienced Entry-Level Data Entry Clerk – Remote Work Opportunity at arenaflex

Experienced Data Entry Clerk – Remote Work Opportunity with arenaflex

Apply Now: Front Office VIP Coordinator Â Full Time, $33.44/Hour

Experienced Overnight Customer Support Specialist – Pet Insurance Industry Leader

Experienced Data Entry Specialist - Remote Opportunity at blithequark