Research Science
/March 3, 2026
/7 min read

Why Synthetic Data is the Future of UX Research

The Fourth Paradigm

UX research has long relied on real participants, long recruiting cycles, and high cost. The Fourth Paradigm changes that by combining empirical rigor with simulation to deliver directional insight in minutes, not weeks.

For decades, user research has been rooted in empirical observation—a process of collecting primary data from real-life individuals that is often described as "painful and tedious". However, as noted by research recently published by Google and Ipsos, we are entering the "Fourth Paradigm" of scientific discovery: a new era where data exploration unifies theory, experimentation, and simulation.

The researchers from the aforementioned study indicate that synthetic data is not just a static script; it consists of computer-generated information that mimics the statistical properties and behavioral patterns of real users. Unlike traditional flat personas, synthetic data leverages what the researchers call "Silicon Samples" or "AI Twins"—synthetic respondents prompted with specific qualitative backstories to replicate the sentiments of unique, often hard-to-reach communities.

  • Velocity of Learning: Traditional recruitment takes an average of 14 days; synthetic data delivers insights in as little as 3 minutes.
  • Economic Scalability: By reducing participant incentives and recruitment fees, startups can now reduce testing costs by 30–67%.
  • Privacy by Design: Synthetic data provides a "safe practice field," allowing designers, researchers, and founders to test UX and UI patterns without risking sensitive user data or violating strict regulations like GDPR.

So can a Large Language Model (LLM) truly simulate a human user? A LLM alone cannot, as it largely represents only the average. If synthetic data is to be used for UX research, the answer for having any chance at simulating the complexity of real life human contexts lies deeper in the architecture.

For proper UX research, simulating human feedback is not just about a single data point or reaction. Rather it is conditioned on walking a "day-in-the-life" of your user’s shoes in order to build empathy and understand their journey.

Doing this is what enables UX researchers to understand how to intervene with the necessary design improvements for all the outliers, edges and anomalies which represent the human context.

Imagine if you could simulate all of your different user segments interacting with your designs in order to discover insights without the traditional cost and time constraints of recruiting real participants.

While DataDisco is not yet a replacement for human-centered design, it is using architectural depth and orchestration to enable a powerful UX research tool for rapid prototyping. So how does it work exactly?

DataDisco’s agentive and generative AI to simulate the human contexts of decision-making while interacting with digital interfaces to provide instant usability feedback.

Applying a "Think-Aloud" protocol, when you drop a prototype URL into DataDisco, AI persona agents navigate the website and explore the UI while verbalizing their internal thoughts.

E.g., "I’d expect the pricing button to be in the top navigation, not hidden in the footer".

This allows you to identify 80% of obvious usability flaws rapidly.

Newsletter

The DataDisco
Journal

Essays on synthetic research, agent workflows, and product strategy. Delivered when we have something worth reading.

No spam · Unsubscribe anytime

Get every new issue

Straight to your inbox, free forever.