• 5th floor, CT1, Building C14 Bac Ha, To Huu street, Dai Mo ward, Hanoi, Vietnam

  • The industry:

    Legal

  • Duration:

    3 months

Introduction

For non-governmental organizations (NGOs) focused on public safety and digital ethics, monitoring the vast landscape of the internet for illegal activity is a monumental task. We developed a specialized Law Violation Detection system designed to identify and classify illegal online gambling advertisements. Over a high-impact 3-month project, we built a sophisticated pipeline that monitors digital platforms in real-time to ensure compliance with government regulations and protect vulnerable populations from predatory content.

 

Challenges

The sheer scale and speed of digital advertising make manual oversight impossible for NGOs:

  • Multi-Platform Proliferation: Illegal ads appear across diverse ecosystems, including Facebook, YouTube, Instagram, TikTok, and thousands of independent websites.
  • Content Complexity: Violations are not just text-based; they are often hidden within images and videos, requiring more than simple keyword filtering.
  • Evolving Tactics: Advertisers frequently change formats and hosting domains to evade detection, requiring a system that can adapt to new content types quickly.
  • Manual Bottlenecks: Monitoring these platforms manually is labor-intensive, slow, and highly prone to human error or oversight.

 

Solution

Our solution combined automated data harvesting with multimodal AI to create a comprehensive detection engine.

Core Technical Implementation:

  • Automated Data Crawling: Using Selenium and Virtual Proxies, the system autonomously scrapes ads from social media and web platforms while bypassing geographical and technical restrictions.
  • Multimodal Violation Detection: We deployed Vision-Language Models (VLM) and NLP to analyze the relationship between text, imagery, and video context. This allows the system to "understand" the intent of an ad, even if explicit keywords are avoided.
  • Orchestration & Workflow: The entire pipeline is managed via Apache Airflow, ensuring that data crawling, processing, and analysis tasks are executed reliably and at scale.
  • Advanced Classification: The AI doesn't just flag content; it classifies the specific type of violation, providing NGOs with structured data for reporting to regulatory bodies.

 

Results

The 3-month implementation provided the NGO with a powerful tool for digital advocacy and enforcement:

  • Accurate, Automated Detection: Dramatically reduced the time to identify illegal gambling ads compared to manual monitoring.
  • Comprehensive Platform Coverage: Established a singular monitoring point for all major social media and video platforms.
  • Reduced Manual Effort: Freed up NGO staff to focus on advocacy and legal action rather than tedious data collection.
  • Scalable & Future-Proof: The system is designed to adapt to new content formats and emerging social platforms, ensuring long-term utility as the digital landscape evolves.

Get in touch with experts