02 / 20

It's 3:14 AM.
Your phone is screaming.

The average SRE stack costs $300K–$1M/year. PagerDuty fires 500+ alerts per week. Mean time to resolve: 47 minutes. Meanwhile, engineers burn out and quit.

$300K+

Annual SRE tooling spend (mid-size)

47 min

Industry average MTTR

500+

Alerts per week (typical)

40%

SRE burnout rate

The incumbents were built for dashboards and humans on-call. The world has moved to AI agents — but the infrastructure hasn't.

03 / 20

Why now?

AI Cost Curve

100x cheaper in 18 months

Frontier model costs dropped from $60/M tokens to $0.60. Multi-agent systems that were economically impossible in 2024 are now viable at scale.

Model Capabilities

Agents can reason, not just respond

Claude, Gemini, and GPT-4 can now follow multi-step playbooks, parse logs, and execute remediation with human-level judgment.

Market Shift

Observability → Autonomous Ops

Datadog and PagerDuty bolt on AI. We built for it from day one. The category is "Agentic SRE" and it doesn't exist yet.

04 / 20

The Multi-Agent OS
for SRE & DevOps

100+ AI agent personas organized into 16 specialized teams. 78 active in the production orchestrator. Each agent has a defined role, expertise domain, and tier-appropriate model assignment.

Active agent personas in orchestrator

Specialized teams

300+

Native integrations

Production safety gates

Incident Response

Autonomous Triage & Remediation

Incident Commander, Auto-Remediation, Escalation Manager, and SLA Guardian work together to resolve incidents without waking humans.

Observability AI

Anomaly Detection → Root Cause

Pattern Miner, Correlation Engine, and Root Cause Analyzer process metrics, logs, and traces to pinpoint issues in seconds.

Platform Governance

Policy, Quotas, Compliance

Policy Enforcer, Quota Manager, and Compliance Officer ensure every agent action meets organizational standards.

05 / 20

Three surfaces.
One platform.

WEB DASHBOARD

639 pages. Real-time war rooms.

React + TypeScript + Socket.IO. Full incident management, agent activity, integration health, cost tracking, and team dashboards. Every metric live via WebSocket.

NOVA CLI

120+ CLI tools. Ship from terminal.

Manage incidents, query agents, drill into metrics, manage runbooks, control escalations, and configure integrations — all without leaving the terminal.

NOVA SHELL AGENT

1,438 lines. 800+ metrics per tick.

Production bash agent for Linux, macOS, Windows. Collects CPU, memory, disk, network, GPU, Docker, and auto-discovers running services. Zero dependencies.

06 / 20

How we're different.

Capability	Datadog / PagerDuty	Single-agent wrappers	Nova AI Ops
Multi-agent orchestration	✗	1 agent	78 agents, 16 teams
Multi-model routing	✗	1 model	4 providers, 3 tiers, circuit breakers
Safety gates	Basic RBAC	✗	13 production gates
Post-remediation verification	✗	✗	T+5m/1h/24h probes, auto kill-switch
Digital-twin dry-run	✗	✗	Simulate before execute
Consensus voting	✗	✗	Multi-agent proposals, human override
Native integrations	700+	~20	300+ (323 connector files)
Pricing (10-seat team)	$200K+/yr	$5K–$20K/yr	$3,480/yr Standard

07 / 20

Competitive landscape.

We've analyzed 75 competitors. The market splits on two axes: AI-native vs bolt-on, and single-agent vs multi-agent OS.

AI-Native

Bolt-on AI

Single Agent

Multi-Agent OS

Datadog Bits AI

PagerDuty AIOps

Cleric AI

Causely

Nova AI Ops

08 / 20

Why this is defensible.

1
Multi-Agent Orchestration
78 specialized agents across 16 teams, each with defined roles, expertise, and model tiers. Competitors would need to build the orchestrator, the safety layer, and the agent library simultaneously.
2
13-Gate Safety Layer
Kill switch, prompt-injection defense, cost breaker, SLO gate, tenant isolation, ground-truth verifier, consensus arbiter, simulation engine, counterfactual replay, dangerous command guard, blast radius guard, context redactor, prompt egress scanner. Every gate has API routes, core logic, and audit tables.
3
Integration Density
323 native connector files covering AWS, Azure, GCP, Kubernetes, Datadog, PagerDuty, Splunk, Terraform, and 300+ more. Each integration makes the next one more valuable.
4
Enterprise RBAC From Day One
Organizations → Workspaces → Teams (hierarchical), time-bound permissions, SAML 2.0 + OIDC with PKCE, SCIM provisioning, MFA. Not bolted on — it's in the schema and middleware.
5
Multi-Model Cost Engine
Routes across 4 providers (Anthropic, Google, DeepSeek, OpenAI) with per-agent tier assignment, circuit breakers (5 failures → 60s open), and automatic fallback chains. Every LLM call logged with provider, model, tokens, latency, and cost. Designed for 5–50x lower inference cost vs single-model wrappers.

09 / 20

13 production safety gates.

Every agent action passes through multiple gates before it touches production. Every decision is logged for audit.

GATE 01

Kill Switch

Three-scope emergency brake: agent, tenant, or global. Arm/disarm with full audit trail.

GATE 02

Prompt Injection Defense

~20 regex patterns, severity ladder (none→critical), quarantine table for forensics.

GATE 03

Cost Circuit Breaker

Configurable spend ceilings per tenant. Auto-halt on breach. Budget check before every LLM call.

GATE 04

Error Budget / SLO Gate

Policy matrix: risk level × budget remaining. Blocks risky actions when error budget is depleted.

GATE 05

Tenant Isolation

Defense-in-depth: org_id on all tables, middleware verification, violation recording.

GATE 06

Ground Truth Verifier

T+5m, T+1h, T+24h post-remediation probes. Auto kill-switch on critical regression.

GATE 07

Consensus Arbiter

Multi-agent proposals, resolution voting, human override for escalated ties.

GATE 08

Simulation Engine

Digital-twin dry-run: service graph snapshot, step handlers, risk scoring before execution.

GATE 09

Counterfactual Replay

Replay past incidents with alternative actions to validate agent decision quality.

GATE 10

Dangerous Command Guard

Pattern-match destructive commands (rm -rf, DROP TABLE, kubectl delete) before execution.

GATE 11

Blast Radius Guard

Estimate impact scope before any remediation. Block actions affecting too many services.

GATE 12

Context Redactor

Strip secrets, PII, and credentials from agent context before LLM calls.

+ Gate 13: Prompt Egress Scanner — scans outbound prompts for data leakage

10 / 20

The Cost Manager moat.

Not a wrapper around one frontier model. An inference orchestration layer that routes the cheapest capable model per task and fails over automatically.

TIERED ROUTING

Right model, right task

Opus (8192 tokens) for Incident Commander and RCA. Sonnet (4096) for most agents. Haiku (1024) for scribe and summary. Per-agent tier assignment, not one-size-fits-all.

4-PROVIDER FALLBACK

Circuit breakers + auto-failover

Anthropic → Google Gemini → DeepSeek → OpenAI. Circuit breaker opens after 5 failures, 60s reset. No single provider can take down the system.

LLM TELEMETRY

Every call logged and costed

Provider, model, input/output tokens, latency, cost estimate — recorded to SQLite and broadcast via WebSocket in real time. Full audit trail for compliance.

File: backend/src/core/aiModelRouter.js · backend/src/core/llmTelemetry.js

11 / 20

Pre-revenue, not pre-usage

Real users on day 60+.

Organic signups, unpaid

Active product users

3,632

Unique site visitors

Demo bookings

Backed by the platforms we're built on

Confluent Databricks Google for Startups NVIDIA Inception MongoDB for Startups Redis for Startups AWS Activate

12 / 20

Pricing that wins.

Free

forever · 1 seat

10 agents
7-day retention
Community support
Core integrations

Standard

$50

/user/mo · up to 3 users

25 agents
30-day retention
Email support
All integrations

Team

Custom

scoped to your team

55 agents
90-day retention
Priority support
SSO / SAML
Custom dashboards

Enterprise

Custom

unlimited seats

100 agents
1-year+ retention
Dedicated CSM
SCIM provisioning
SLA guarantees
HIPAA BAA

Annual discount: 20%. Multi-year: 22% (2yr), 30% (3yr) on Enterprise.

13 / 20

$6.8B bottom-up TAM.

The SRE and DevOps tooling market is massive and fragmented. The average mid-size team spends $300K–$1M/year across observability, incident management, and automation.

$200K+

Datadog avg contract/yr

$50–100K

PagerDuty avg contract/yr

$300K–$1M

Total SRE stack per team/yr

68K+

Companies with SRE teams (est.)

Nova replaces or consolidates 3–5 tools in the stack. The wedge is incident response; the platform absorbs observability, runbooks, and cost management.

14 / 20

Go-to-market: PLG wedge.

1. Wedge

Free tier → first auto-resolved incident → "it just fixed itself" moment. Developer signs up, installs agent, connects PagerDuty. First value in under 5 minutes.

↓

2. Expand

Team invites colleagues. Standard tier unlocks 25 agents, 30-day retention, and team dashboards. Usage-based expansion within the org.

↓

3. Platform

Enterprise conversation. SSO/SAML, SCIM, tenant isolation, SLA guarantees. Nova replaces 3–5 tools and becomes the ops backbone.

SEO moat: 75 competitor comparison pages, 2,289 technical blog posts, owning "Agentic SRE" SERP.

15 / 20

The team.

Dr. Samson Tanimawo

Founder, CEO & CTO

SRE with 10+ years at JPMorgan Chase and the US Navy. PhD, MSc, MBA. Built the entire platform.

Lashae Tanimawo

Co-Founder & CMO

Brand, content, and go-to-market strategy. Driving the PLG motion and community growth.

Ikechukwu Ofeweke

Chief Strategy Officer

M.Eng, PMP. Enterprise partnerships, strategic planning, and investor relations.

Jennifer Broxson

Head of AI Strategy & BD

AI strategy, business development, and key account management.

Alim Mohammad

Founding AI Engineer

Core agent development, model routing, and safety gate implementation.

Hiring: 3 planned — Senior Backend Engineer, ML Engineer, DevRel.

16 / 20

Architecture depth.

This is not a demo. Every layer has real code, real tests, and real infrastructure.

1,282

Backend source files

910

Frontend source files

645

API route files

323

Integration connector files

DATA LAYER

Multi-tenant from day one

SQLite (WAL) with Postgres dialect translator. org_id on all tables. 7 migration files. Audit ledger, LLM event log, isolation violation tracking.

OBSERVABILITY

OpenTelemetry + Jaeger

Full OTEL SDK with auto-instrumentation. Jaeger/OTLP exporters. Prometheus metrics. Structured logging. Self-monitoring.

INFRASTRUCTURE

K8s + Blue-Green Deploy

Kustomize manifests with dev/staging/prod overlays. HPA auto-scaling. Nginx blue-green traffic switching with health checks. GitHub Actions CI/CD.

17 / 20

Credibility signals.

STARTUP PROGRAMS

7 programs accepted

Confluent, Databricks, Google for Startups, NVIDIA Inception, MongoDB for Startups, Redis for Startups, AWS Activate.

COMPLIANCE

SOC 2 Type II in progress

GDPR and CCPA live. HIPAA BAA on request. Controls modeled on AICPA Trust Services Criteria and ISO 27001 Annex A. DPA available.

SECURITY

Enterprise-grade from launch

TLS 1.3, AES-256 at rest, SAML 2.0 SSO (Okta, Azure AD, Google, JumpCloud), SCIM, MFA, API keys with prefix-based lookup, IP whitelisting.

18 / 20

Risks & mitigations.

No revenue yet

Pre-revenue by design — PLG requires product-market fit before monetization. 69 organic signups validate demand.

→ First paid conversions targeted Q3 2026

Small team (5 people)

Founder built the entire 2,000+ file codebase. Team is lean but the architecture is production-grade.

→ 3 hires planned with round proceeds

Test coverage is thin

44 test files for 2,000+ source files. Safety gates and core path are the priority.

→ Prioritizing safety gate + integration tests in Q3

Incumbents could build this

Datadog and PagerDuty have the data. But their architecture is dashboard-first, not agent-first. Retooling is a multi-year effort.

→ 13 safety gates + 78 agents = 18+ months head start

19 / 20

The ask.

Round size

$2M

Pre-seed, open

Lead indicated

$1M

Gacsym Ventures (verbal)

Close target

Q3 2026

Matches at $1M co-invested

Instrument

SAFE

or priced, founder-friendly

Use of funds

60%

Engineering

3 hires: Senior Backend, ML Engineer, DevRel. Accelerate agent library, test coverage, and integrations.

25%

Go-to-Market

PLG infrastructure, content, community, and first enterprise pilots.

15%

Infrastructure

SOC 2 completion, multi-region deploy, and compliance certifications.

The Multi-Agent OSfor SRE & DevOps

It's 3:14 AM.Your phone is screaming.

Why now?

100x cheaper in 18 months

Agents can reason, not just respond

Observability → Autonomous Ops

The Multi-Agent OSfor SRE & DevOps

Autonomous Triage & Remediation

Anomaly Detection → Root Cause

Policy, Quotas, Compliance

Three surfaces.One platform.

639 pages. Real-time war rooms.

120+ CLI tools. Ship from terminal.

1,438 lines. 800+ metrics per tick.

How we're different.

Competitive landscape.

Why this is defensible.

Multi-Agent Orchestration

13-Gate Safety Layer

Integration Density

Enterprise RBAC From Day One

Multi-Model Cost Engine

13 production safety gates.

Kill Switch

Prompt Injection Defense

Cost Circuit Breaker

Error Budget / SLO Gate

Tenant Isolation

Ground Truth Verifier

Consensus Arbiter

Simulation Engine

Counterfactual Replay

Dangerous Command Guard

Blast Radius Guard

Context Redactor

The Cost Manager moat.

Right model, right task

Circuit breakers + auto-failover

Every call logged and costed

Real users on day 60+.

Pricing that wins.

Free

Standard

Team

Enterprise

$6.8B bottom-up TAM.

Go-to-market: PLG wedge.

The team.

Dr. Samson Tanimawo

Lashae Tanimawo

Ikechukwu Ofeweke

Jennifer Broxson

Alim Mohammad

Architecture depth.

Multi-tenant from day one

OpenTelemetry + Jaeger

K8s + Blue-Green Deploy

Credibility signals.

7 programs accepted

SOC 2 Type II in progress

Enterprise-grade from launch

Risks & mitigations.

No revenue yet

Small team (5 people)

Test coverage is thin

Incumbents could build this

The ask.

Use of funds

Engineering

Go-to-Market

Infrastructure

Let's talk.

The Multi-Agent OS
for SRE & DevOps

It's 3:14 AM.
Your phone is screaming.

The Multi-Agent OS
for SRE & DevOps

Three surfaces.
One platform.