Datagen

AI & Data Platforms Dual-Use Technology Founded 2018

Datagen is an Israeli synthetic data startup that developed a simulation platform generating photorealistic synthetic visual data for training computer vision and AI models, acquired by Unity Technologies in 2022 for its applications in autonomous systems and defense.

Visit Website

Company Overview

Datagen is a Tel Aviv-based synthetic data company, founded in 2018, that built a simulation platform capable of generating massive volumes of photorealistic synthetic visual data for training AI and computer vision models. The platform creates labeled synthetic images and videos of humans, environments, and objects with pixel-perfect annotations, eliminating the need for expensive and time-consuming manual data collection and labeling.

The technology enables AI developers to create diverse, balanced, and bias-free training datasets at scale, addressing critical challenges in computer vision model development including edge cases, rare scenarios, and privacy-compliant training data. Unity Technologies acquired Datagen in 2022 to integrate synthetic data generation into its simulation platform, recognizing the strategic importance of synthetic data for AI training across industries.

Dual-use relevance is very high: synthetic visual data generation is critical for both commercial AI development and defense applications including training autonomous vehicle perception systems, generating synthetic imagery for military simulation and training, creating training data for object recognition in surveillance systems, developing synthetic environments for military wargaming, and training AI models for satellite imagery analysis without using classified data. The technology enables defense organizations to develop AI capabilities without exposing sensitive real-world training data. Datagen raised over $50M in funding from investors including Viola Ventures, TLV Partners, and Innovation Endeavors before its acquisition.

Dual-Use Assessment

Synthetic visual data generation for AI training serves both commercial computer vision development and defense applications including military simulation, autonomous systems training, and classified AI model development.

Key Technologies

  • Photorealistic synthetic visual data generation
  • Simulation platform for diverse scenario creation
  • Pixel-perfect automatic annotation and labeling
  • Parametric human and environment model generation
  • Bias-free and privacy-compliant synthetic dataset creation

Use Cases & Applications

  • Training autonomous vehicle perception systems with synthetic edge cases
  • Generating synthetic imagery for military simulation and training
  • Training surveillance and reconnaissance AI without real imagery
  • Creating synthetic environments for military wargaming and scenario planning
  • Developing AI models for satellite imagery analysis using synthetic data

Strategic Value to U.S.-Israel Alliance

Synthetic visual data platform enabling defense AI development without classified data exposure, critical for military autonomous systems and simulation applications.

Interested in this startup?

Learn more about our investment approach or get in touch to discuss opportunities in dual-use technology.