Remote Language Model Evaluator at Mercor

Mercor • toronto, Canada

Location toronto, on
Job Type Full-time
Posted May 26, 2026

Role Description

Elevate AI language models as a Language Model Evaluator with Mercor, working remotely. Conduct evaluations in English and Punjabi to enhance model performance and accuracy.
Mercor seeks a Generalist for a contract role focused on language modeling. This remote position requires you to possess native Punjabi skills and strong English writing ability. Your primary task will involve generating high-quality evaluation data that assesses reasoning quality, clarity, and factual accuracy across model responses.
Key Responsibilities:
• Conduct fact-checking using trusted public sources
• Generate evaluation data based on response quality
• Assess reasoning clarity and tone in model outputs
• Ensure model responses meet conversational guidelines
• Work independently to meet deadlines and improve AI performance
Requirements:
• Bachelor's degree in a relevant field
• Native Punjabi and strong English writing skills
• Significant experience with large language mod...

Ready to Apply?

Apply for this Position