Data Scientist Job at Mercor, New York, NY

U3FmbjJSTHFiTzQ0TmpBQmIyWWVldDJWL2c9PQ==
  • Mercor
  • New York, NY

Job Description

Job Description: AI Task Evaluation & Statistical Analysis Specialist

Role Overview

We're seeking a data-driven analyst to conduct comprehensive failure analysis on AI agent performance across finance-sector tasks. You'll identify patterns, root causes, and systemic issues in our evaluation framework by analyzing task performance across multiple dimensions (task types, file types, criteria, etc.).

Key Responsibilities

  • Statistical Failure Analysis : Identify patterns in AI agent failures across task components (prompts, rubrics, templates, file types, tags)

  • Root Cause Analysis : Determine whether failures stem from task design, rubric clarity, file complexity, or agent limitations

  • Dimension Analysis : Analyze performance variations across finance sub-domains, file types, and task categories

  • Reporting & Visualization : Create dashboards and reports highlighting failure clusters, edge cases, and improvement opportunities

  • Quality Framework : Recommend improvements to task design, rubric structure, and evaluation criteria based on statistical findings

  • Stakeholder Communication : Present insights to data labeling experts and technical teams

Required Qualifications

  • Statistical Expertise : Strong foundation in statistical analysis, hypothesis testing, and pattern recognition

  • Programming : Proficiency in Python (pandas, scipy, matplotlib/seaborn) or R for data analysis

  • Data Analysis : Experience with exploratory data analysis and creating actionable insights from complex datasets

  • AI/ML Familiarity : Understanding of LLM evaluation methods and quality metrics

  • Tools : Comfortable working with Excel, data visualization tools (Tableau/Looker), and SQL

Preferred Qualifications

  • Experience with AI/ML model evaluation or quality assurance

  • Background in finance or willingness to learn finance domain concepts

  • Experience with multi-dimensional failure analysis

  • Familiarity with benchmark datasets and evaluation frameworks

  • 2-4 years of relevant experience

Job Tags

Remote job,

Similar Jobs

Augusta University

Cyber Security Training Developer Job at Augusta University

 ...Cyber Security Training Developer Job ID: 292890 Location: Augusta University Full/Part Time: Full Time Regular/Temporary...  ...Linux+; Cisco CCNA/CyberOps; AWS Cloud Practitioner/Associate; entry-level GIAC). Preferred Qualifications IT or cyber related... 

West Valley-Mission Community College District

Full-Time Biology Instructor Job at West Valley-Mission Community College District

 ...Full-Time Biology Instructor Closing Date: 02/17/2026 Definition: The Department of Biology is seeking a talented individual to fill the position of Cell and General Biology full-time instructor . West Valley College is part of the West Valley-Mission Community... 

Driscoll Foods

Class A/B Delivery Driver Job at Driscoll Foods

Join Driscoll Foods Day Warehouse for the opportunity to build a career, make great money, and become involved with a growing company! Description of Essential Job Functions: Able to work a flexible schedule including any of the shifts (Day, night, weekend). Unload...

Project Solutions Inc.

Certified Wastewater Treatment Operator Job at Project Solutions Inc.

 ...with occasional on-call duties Are you a hands-on Wastewater Treatment Plant Operator with a knack for keeping things running smoothly...  ...environment. Must pass a pre-employment background check Water and Wastewater Certification Preferred. Excellent... 

The Shella Foundation

Data Entry Assistant Job at The Shella Foundation

 ...initiatives, we empower individuals to live independently in their homes. Our work also inspires families to advocate for accessible, high-...  ...Enter data accurately into spreadsheets or online systems Update and maintain existing records Check...