Bilingual AI Response Evaluator
Obsidian
Job Description
PLEASE NOTE: You must take the Bilingual Competency interview in Russian to be considered for this role.
Location
Global
Type
Contract Work
Fluent Language Skills Required
Russian (native fluency) and English (strong proficiency)
Why this role matters
Your job is to assess Russian AI-generated responses and identify specific strengths and areas of improvement for these responses – your work will be used to create the “perfect AI-generated response” at a later stage of this project. Note the analysis you create will be in English.
What You'll Do
- Conduct fact-checking using trusted public sources and external tools
- Generate high-quality human evaluation data by identifying response strengths, areas for improvement, and factual inaccuracies
- Assess reasoning quality, clarity, tone, and completeness of responses
- Ensure model responses align with expected conversational behav...