Back to Jobs

QA Engineer – AI Systems

Remote, USA Full-time Posted 2026-05-07
Dice is the leading career destination for tech experts at every stage of their careers. Our client, Everest Technologies, is seeking the following. Apply via Dice today! We are seeking a QA Engineer with a strong background in API testing and LLM fine-tuning/evaluation . You will be responsible for the quality assurance of our Agent Mesh infrastructure, ensuring that the correctly translate enterprise business logic into machine-readable actions. Your goal is to ensure that AI agents interact with our reliably, securely, and without hallucinating tool calls. Key Responsibilities • AI Tool Validation: Test the accuracy of by verifying that LLMs correctly interpret OpenAPI specifications and trigger the right C#/.NET backend logic. • Fine-Tuning Data Preparation: Curate and clean high-quality datasets (JSON/JSONL) in Python to fine-tune models for specific domain tasks and tool-calling accuracy. • Prompt Regression Testing: Develop automated test suites to ensure that updates to underlying APIs or MCP servers do not break the reasoning or planning capabilities of the AI agents. • Security & Auth QA: Validate that in Gravitee correctly enforce OAuth 2.1 and OpenFGA, preventing unauthorized data leakage through agent conversations. • Performance Testing: Use to measure latency in the agent-to-API loop and identify bottlenecks in MCP server responses. Technical Qualifications • API Testing Mastery: Expert knowledge of REST, OpenAPI, and tools like Postman or Insomnia. • Scripting: Proficiency in Python (for data processing and eval frameworks) and familiarity with C# (to understand backend MCP implementation). • LLM Evaluation: Experience with frameworks like DeepEval, Ragas, or LangSmith to measure model performance (faithfulness, relevancy, and tool-call precision). • API Management: Hands-on experience with or similar gateways to monitor and intercept traffic. • Model Context Protocol: Understanding of and how it standardizes the way LLMs access external data. Preferred Skills • Experience with Red Teaming AI agents to identify prompt injection vulnerabilities. • Knowledge of Vector Databases and how RAG (Retrieval-Augmented Generation) interacts with live API tools. • Familiarity with GitHub Actions for CI/CD integration of AI evaluation pipelines. Apply tot his job Apply To this Job

Similar Jobs

Remote Excel Data Entry Jobs - Work Online From Anywhere

Remote, USA Full-time

Disney Remote Data Entry Jobs USA

Remote, USA Full-time

**Experienced Data Entry Specialist – Alternate Investments (Remote Opportunity)**

Remote, USA Full-time

Events and Catering Administrative Assistant

Remote, USA Full-time

E-commerce Account Manager

Remote, USA Full-time

EAP Worklife Analyst - Medicare Team

Remote, USA Full-time

**Experienced Ramp Agent (Customer Service Representative) - MSP**

Remote, USA Full-time

**Experienced Customer Service Representative – E-commerce Support and Resolution**

Remote, USA Full-time

**Experienced Online Customer Service Representative – Delivering Exceptional Arenaflex Customer Experiences**

Remote, USA Full-time

**Experienced Customer Service Representative – Work From Home Opportunity at arenaflex**

Remote, USA Full-time

Content Creator, Social - Temporary Worker

Remote, USA Full-time

Remote Healthcare Call Center Representative

Remote, USA Full-time

Experienced Remote Customer Service Representative – Work from Home Opportunity with Amazon

Remote, USA Full-time

Head of Go-to-Market

Remote, USA Full-time

Dealer Performance Manager- Automotive

Remote, USA Full-time

Authorization Specialist I

Remote, USA Full-time

**Experienced Data Entry Specialist – Remote Opportunity at arenaflex**

Remote, USA Full-time

Supply Chain Manager - Partnership Product Center (Remote)

Remote, USA Full-time

Licensed Life and Health Insurance Agent – Remote Position - Flexible Hours

Remote, USA Full-time

SMB Account Manager

Remote, USA Full-time