Deep AI Due Diligence

For investors backing companies that buildAI. When the thesis rests on the model itself, the question is simple: is the AI a genuine, defensible advantage - or a thin wrapper around someone else's? We answer that with evidence, across the full model lifecycle.

~40%

of European 'AI' startups showed no evidence of material AI use (MMC Ventures, State of AI, 2019)

>80%

of AI projects fail - about twice the rate of non-AI IT projects (RAND Corporation, 2024)

>280x

fall in the cost to run a GPT-3.5-quality query in ~18 months - model access is commoditizing (Stanford AI Index, 2025)

50-60%

typical AI gross margins, vs. 70-90% for classic SaaS (a16z, 2020)

Is the AI defensible?

Defensibility - moat or wrapper?

  • Whether core value depends on proprietary models, proprietary data, or just a public API
  • Strength of any proprietary data advantage and how it compounds with usage
  • Real switching cost for a competitor to replicate the capability
  • Model-provider concentration and lock-in - exposure to their pricing, terms, and rate limits

IP & data provenance - who owns it?

  • Ownership of models, weights, prompts, agents, and fine-tuning datasets
  • Provenance and licensing of training data, including open-weight licence obligations
  • Copyright and IP exposure arising from training-data sources
  • Rights to use customer data for training, and the consent and privacy basis behind it

Is the ML real?

Datasets & data pipeline

  • Sourcing, curation, labeling quality, and deduplication
  • Train/eval contamination and leakage checks
  • Coverage, representativeness, and known gaps
  • PII/safety filtering and reproducibility of the data pipeline

Training & alignment

  • Base-model strategy: pre-train from scratch vs. adapt an existing model - and whether the spend/claims are credible
  • Supervised fine-tuning (SFT) data and method; parameter-efficient tuning (LoRA/QLoRA)
  • Preference optimization & alignment: RLHF (reward model + PPO), RLAIF, DPO
  • Whether the team has the preference data and infra to sustain a custom-model story

Evaluation & metrics

  • Offline benchmarks and task-specific metrics
  • Hallucination/factuality and safety evaluations
  • Human eval rigor and inter-rater agreement
  • Regression testing / CI for models, plus online metrics and A/B testing

Inference, cost & MLOps

  • Serving stack, latency, and quantization/optimization
  • Cost per request/token and how it scales - direct margin impact
  • Experiment tracking, data/model versioning, and retraining pipelines
  • Drift and production monitoring, and the data-flywheel/feedback loop

Deep AI Due Diligence is a specialist engagement, scoped to the target's stack and stage. Findings land in a board-ready report and red-flag list. Book a call and we'll scope it with you.

Investing in a company that simply uses AI? See SaaS Due Diligence.