Trust & Safety Evaluator with English from United Kingdom

TELUS Digital AI Data Solutions

🌍 100% Remote Full-time

Job Description

Trust & Safety Evaluator conduct adversarial testing and safety evaluation of generative AI features. Main tasks are crafting queries, evaluating the safety of generated content and providing critical feedback. This role requires creative thinking about potential misuse, deep cultural and linguistic knowledge, and the ability to identify subtle safety risks.

Key Responsibilities

  • Write, review and evaluate diverse and challenging queries designed to test the system's limits and expose problematic outputs. Queries will target specific risk topics including explicit and/or offensive content
  • Design and execute sequences of queries simulating realistic, unfolding conversations.
  • Craft attack scenarios using techniques like crescendo attacks and context manipulation to test the system
  • Age-Appropriate Safety Evaluation: to guide adversarial query crafting and safety evaluation.
  • Assign risk ratings to AI Generated content bas...