AI Evaluation Specialist
In this role, you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input.
Key Responsibilities
* Design and implement self-contained evaluation tasks, including prompts, supporting files, and detailed grading rubrics to assess AI performance on practical computer-based workflows.
* Define clear, unambiguous written criteria for what constitutes successful and unsuccessful task completion across diverse administrative and workflow scenarios.
* Meticulously observe and document AI agent behaviors, producing crisp, precise summaries and reports in high-quality English.
* Iterate and refine evaluation tasks and rubrics based on feedback and team collaboration to ensure robust benchmarking methodologies.
* Work cross-functionally across a wide range of domains, adapting evaluation frameworks as project requirements evolve.
* Collaborate with the customer's team to share insights and help drive continuous improvement in AI evaluation techniques.
* Champion meticulousness, structured observation, and clear written communication throughout all project deliverables.
Required Skills and Qualifications
* Minimum 3 years of experience in roles emphasizing written precision and structured thinking, such as paralegal, executive assistant, junior analyst, librarian, document archival specialist, research assistant, technical writer, or QA analyst.
* Native or fluent in English writing, with a demonstrated ability to produce observations that are succinct, specific, and unambiguous.
* Proven skill in designing or applying rubric-based evaluation, grading against set criteria, or building structured scoring frameworks.
* High attention to detail and ability to notice subtle patterns or inconsistencies others might miss.
* Exceptional written and verbal communication skills, especially for documenting nuanced observations and feedback.
* Fluency in navigating computers, common SaaS tools, web browsers, file management, and document editing platforms.
* Strong self-direction, with the ability to independently take ownership of ambiguous or loosely defined projects.
Recommended Jobs
Physician Liaison
Job Description Job Description FYZICAL Therapy & Balance Centers is seeking a full-time Physician Liaison to join our team at our Webster, TX, location! The Physician Liaison works closely wit…
Sr Application Developer - Charles River Development (CRD)
Client: Wealth Management Position: Sr Application Developer - Charles River Development (CRD) Compensation: $140K + Bonus Location: First 90 days on site and then 1 day a week WFH …
Direct Sales Representative - San Bernardino, CA
KOMPAN U.S. is looking for a Direct Sales Representative to function as the region's commercial playground and outdoor fitness equipment consultant. This position will promote and sell KOMPAN pr…
Staff Embedded & Control Systems Engineer
It's Time to Join Stryker! We are seeking a Staff Embedded & Control Systems Engineer to design and develop embedded software and control systems for medical devices. In this role, you will contri…
Conveyor Technician
Job Title: Mechatronics and Robotics Tech - Pay: $32.75/Hour Job Description As a Mechatronics & Robotics Technician, you will play a crucial role in supporting the Operations Maintenance tea…
CDL Regional Tanker/HazMat Driver
CDL A Hazmat Local Driver - QUANTIX LIQUIDS: Qualifications : ~ Safe, Dependable, and Punctual driver. ~2-years of Tanker and 1-year HazMat experience. ~ Understands and speaks Englis…
Junior Construction Engineer
Job Title: Steel Project Engineer Are you looking for an opportunity to build your career in steel construction ? Join our team and work on exciting commercial and industrial projects supportin…
Lube Technician
Loader Operator Alamo Concrete Products Company, ACP, is headquartered in San Antonio, TX with 8 Divisions throughout the Greater Central/South Texas Region. ACP produces and delivers concrete and o…