Writer | $36.16/hr Remote
Crossing Hurdles • Remote, Canada
Role Description
- Evaluate LLM-generated responses for accuracy, relevance, and effectiveness across a wide range of topics.
- Perform fact-checking using reliable public sources and external tools.
- Create high-quality human evaluation data by annotating response strengths, gaps, and factual errors.
- Assess reasoning quality, tone, clarity, and completeness of AI-generated outputs.
- Ensure responses follow expected conversational behavior and system guidelines.
- Apply consistent annotations using defined taxonomies, benchmarks, and evaluation frameworks.
Requirements
- Native-level or near-native fluency in French (ILR 5 / CEFR C2) with strong English proficiency.
- Proven experience using large language models and understanding real-world LLM use cases.
- Excellent writing skills with the ability to provide clear, nuanced, and structured feedback.
- Strong attention to detail and analytical thinking. ...