Discover 3 key automations to optimize your moderation efforts Read 3 Essential Automations for Smarter Moderation
Improve your detection and simplify moderation - in one AI-powered platform.
Stay ahead of novel risks and bad actors with proactive, on-demand insights.
Proactively stop safety gaps to produce safe, reliable, and compliant models.
Deploy generative AI in a safe and scalable way with active safety guardrails.
Online abuse has countless forms. Understand the types of risks Trust & Safety teams must keep users safe from on-platform.
Protect your most vulnerable users with a comprehensive set of child safety tools and services.
Our out-of-the-box solutions support platform transparency and compliance.
Keep up with T&S laws, from the Online Safety Bill to the Online Safety Act.
Over 70 elections will take place in 2024: don't let your platform be abused to harm election integrity.
Protect your brand integrity before the damage is done.
From privacy risks, to credential theft and malware, the cyber threats to users are continuously evolving.
Stay ahead of industry news in our exclusive T&S community.
Make compliance an integral part of your Trust & Safety posture. From contextual AI to custom insights, ActiveFence assists leading Trust & Safety teams with compliance amidst a turbulent regulatory climate.
The Digital Services Act aims to create a single, frictionless set of rules for digital service providers to operate in when conducting business in each of the EUʼs 27 member states.
The DSA requires online platforms to establish both ad-hoc and ongoing Trust & Safety mechanisms, in accordance with the active EU user-base. Failure to comply risks heavy fines, which can add up to 6% of annual turnover. In the case of repeat breaches, companies risk a ban in the entire EU.
ActiveFence provides both automatic operational tools and tailored intelligence solutions to support DSA compliance, including:
Australia’s Online Safety Act requires platforms to proactively protect their users from harmful content. Platforms must detect and remove illegal content within 24 hours of notification. This includes cyberbullying , CSAM, and terrorist content accessible to users in Australia, regardless of where it is hosted.
Platforms that fail to comply with mandatory reporting obligations risk civil penalties, or fines of up to $555,000 AUD (c. $369,270 USD).
As part of an international trend in new and stricter national accountability standards for online platforms, Ireland, Singapore and Turkey have also recently enacted online safety regulations. Platforms which are subject to these laws face new requirements for accountability and risk stringent penalties for non-compliance.
ActiveFence assists platforms with accurate detection of illegal content, automated actioning, and proactive threat intelligence, in over 100 languages and across 14+ abuse areas, such as child safety, cyberbullying, hate speech, misinformation, and terrorism.
The UK’s Online Safety Act has passed its final parliamentary debate, and is set to become law. Platforms are required to conduct risk assessments, remove illegal content, and implement measures to protect children from online harm. Social media platforms would need to remove illegal content, take down material in breach of platform policies, and demonstrate enforcement of user age limits to mitigate circumvention. This includes taking preventative measures to tackle child sexual exploitation and abuse (CSEA).
Platforms will risk fines; tech executives will risk fines, imprisonment, or both. Maximum fines for non-compliance will be the greater of— (a) £18 million (c.$21.7 million USD), and (b) 10% of the person’s qualifying worldwide revenue for the person’s most recent complete accounting period.
ActiveFence specializes in safeguarding children online, equipped with a dedicated team of researchers and years of experience. Our comprehensive suite of solutions includes AI-powered Child Safety models, proactive risk reporting, and red teaming to uncover platform loopholes that bad actors can exploit. Choose one or all of our solutions to ensure compliance and protect children from harm.
The EU’s enactment of TERREG prohibits material that incites or solicits terrorist offenses, including the provision of instructions or techniques for committing such offenses; and content inciting or soliciting participation in a terrorist group.
Platforms are required to remove or disable terrorist content within one hour upon receipt of notice. Platforms must designate points of contact to carry out removal orders. Platforms that fail to comply risk fines of up to 4% of their global turnover.
ActiveFence’s dedicated research team for violent extremism has years of experience in detecting terrorist content across geographies, and with contextual nuance. Our comprehensive solutions include terrorism AI models in 100+ languages, enabling quick action on terrorist violations, as well as proactive intelligence reporting to shed light on the most sophisticated threat actor methodologies.
As of February 2022, online platforms with over 2 million users in Germany must investigate and delete flagged content within 24 hours upon receipt of notice. Specifically, platforms are obligated to delete and report content that violates criminal law. Failure to comply risks fines of up to €50 million (c. $53.3 million USD).
ActiveFence’s AI-based detection solution triggers automatic actioning, such as removal, suspension, and escalation, allowing you to quickly tackle criminal violations. With our content moderation platform, you can create prioritized queues based on risk score and violation type, including those falling under NetzDG, so you can handle high-risk items first.
The Republic of India’s Information Technology Rules, 2021 require social media companies to remove content that is libelous, racially or ethnically objectionable, and “knowingly and intentionally communicates any information which is patently false or misleading in nature but may reasonably be perceived as a fact” within 36 hours of notification. Platforms are required to reveal the identity of the user who shared the content.
ActiveFence offers a range of detection solutions for harmful content, including profanity, hate speech, misinformation and disinformation. Our Risk Score Engine provides AI-based detection with 99%+ accuracy, triggering automated actioning to meet the 36-hour requirement.
*ActiveFence is not a legal service provider and does not provide legal advice. The content on this page is intended for informational purposes only.
Starting February 2024, the DSA is applicable for all online platforms active in the EU. Are you ready?
The much awaited Online Safety Act has been passed in the UK. Learn about its core requirements, and how to prepare.
Transparency reports are a core requirement of the DSA. Here's everything you need to know to get started.