Join us at the Trust & Safety UK Summit Join us
Improve your detection and simplify moderation - in one AI-powered platform.
Stay ahead of novel risks and bad actors with proactive, on-demand insights.
Proactively stop safety gaps to produce safe, reliable, and compliant models.
Deploy generative AI in a safe and scalable way with active safety guardrails.
Online abuse has countless forms. Understand the types of risks Trust & Safety teams must keep users safe from on-platform.
Protect your most vulnerable users with a comprehensive set of child safety tools and services.
Our out-of-the-box solutions support platform transparency and compliance.
Keep up with T&S laws, from the Online Safety Bill to the Online Safety Act.
Over 70 elections will take place in 2024: don't let your platform be abused to harm election integrity.
Protect your brand integrity before the damage is done.
From privacy risks, to credential theft and malware, the cyber threats to users are continuously evolving.
Stay ahead of industry news in our exclusive T&S community.
UK, England / Full-time / Hybrid
ActiveFence is seeking a motivated and detail-oriented professional to play a key role in shaping the future of technology as a Generative AI Tools Researcher. This is your chance to work at the forefront of innovation, analyzing emerging content trends and helping leading foundation models stay ahead of the curve while prioritizing safety and reliability.
In this role, you’ll collaborate with experts across diverse fields such as Hate Speech, Misinformation, Intellectual Property and Copyright, and Child Safety, contributing to meaningful advancements in AI. You’ll also oversee complex workflows with multiple stakeholders to deliver high-quality data used to evaluate cutting-edge AI models, including Large Language Models (LLMs), Text-to-Image, Text-to-Video, and more.
Responsibilities:
Must-Have Qualifications:
Preferred Qualifications:
ActiveFence is a mission-oriented company that proactively protects online users and platforms from harmful content, disinformation, and coordinated manipulation. We are a part of the Trust & Safety industry, providing a critical layer of defense against dangerous online activity. Our innovative technology detects and analyzes problematic content across multiple platforms, enabling our customers to stay one step ahead of bad actors.