Pawvlov NLT Labs NLT Labs

For Investors

Process

The engine run that produced this brief. Every claim above traces back to a specific agent step, with its duration, its cost, and the citations it emitted.

Pipeline Run Telemetry

How this run was built

20 agents · same phases as portfolio.nltlabs.ai/pipeline. Cards show live stats from pipeline-run.json.

Partial pipeline telemetry

Lineage: PE evaluation + POC build.

Flags: workflow_in_progress

20
Agents
16h 30m
Wall
$15.80
Cost
40
Source URLs
Phase 1 — Evaluation9m 01s · 9 agents
Parallel · 6
sonnet
Market Researcher
TAM/SAM/SOM sizing + customer voice
⏱ 22s 💵 $0.22 🛠 0 🔗 7
complete confidence 55%
sonnet
Competitive Analyst
Competitor teardown + funding + moat score
⏱ 53s 💵 $0.26 🛠 0 🔗 10
complete confidence 60%
sonnet
Feasibility Analyst
Tech architecture + build risks
⏱ 26s 💵 $0.23 🛠 0 🔗 7
complete confidence 70%
sonnet
Financial Modeler
Unit economics + revenue projections
⏱ 26s 💵 $0.24 🛠 0 🔗 6
complete confidence 55%
sonnet
Regulatory Analyst
Compliance + jurisdictional risk
⏱ 35s 💵 $0.23 🛠 0 🔗 10
complete confidence 72%
sonnet
Creative Director
Brand name + tagline + positioning
⏱ 11s 💵 $0.20 🛠 0
complete confidence 72%
sonnet
Devil's Advocate
Stress-tests bull claims, surfaces fatal flaws
⏱ 1m 17s 💵 $0.27 🛠 0
complete confidence 78%
sonnet
Number Auditor
Broken-math hard cap, TAM math, contradictions
⏱ 1m 19s 💵 $0.35 🛠 0
complete
sonnet
PE Firm
9-lens rubric, score, fund/no-fund verdict
⏱ 3m 29s 💵 $0.13 🛠 0
complete
Phase 2 — Build1h 27m · 11 agents
opus
POC Director
Locks thesis, slug, brief, downstream specs
⏱ 17m 44s 💵 $4.03 🛠 0
complete confidence 85%
sonnet
Product Designer
Hardware BOM + 3D + electrical spec
⏱ 12m 56s 💵 $1.38 🛠 0
complete confidence 92%
sonnet
GTM Operator
Go-to-market plan + launch sequencing
⏱ 5m 33s 💵 $0.62 🛠 0
complete confidence 82%
sonnet
GTM QA
18-section population gate; gtm_regen converge signal
⏱ 18s 💵 $0.21 🛠 0
complete confidence 97%
Parallel · 5
sonnet
Market Pricing Researcher
Pricing tiers + comparable validation
⏱ 4m 49s 💵 $0.57 🛠 0
complete confidence 60%
opus
Visual Assets
Nano Banana · 4 photorealistic renders (hardware only)
⏱ 5m 52s 💵 $1.87 🛠 0 🔁 ×3 loop $5.56
complete confidence 75%
opus
Visual QA
Anatomy + brand-fit verdict on renders
⏱ 1m 27s 💵 $1.10 🛠 0 🔁 ×3 loop $3.36
complete confidence 100%
sonnet
Web Designer
Visual differentiation + brand tokens
⏱ 6m 48s 💵 $0.77 🛠 0
complete confidence 88%
sonnet
Copywriter
Research-first, pain-led copy
⏱ 16m 30s 💵 $1.36 🛠 0
complete confidence 82%
sonnet
Content QA
Sales-readiness rubric per page
⏱ 15m 21s 💵 $1.76 🛠 0
complete confidence 88%
opus
Frontend Builder
Astro site, /inside two-door, deploy
⏱ — 💵 $0.00 🛠 0
running

Quality scorecard

Claim evidence mix

0
Cited
1
Derived
0
Assumed
77%
Avg confidence
Key Decisions
  • [dev] Fatal flaw
    Fatal flaw

    Iatrogenic harm risk is structural, not solved: the device's value prop attacks Furbo for rewarding during fear states, but Pawvlov's own ESP32-S3 4-class classifier running at 5fps on novel pose data (no public labeled dataset exists per feas brief) will inevitably misclassify and reward fear states. One viral 'Pawvlov made my dog worse' Reddit thread destroys the entire 'therapy, not toy' positioning the brand is built on — this is a brand-existential risk the founder explicitly created by accusing competitors of the same failure mode.

  • [dev] Fatal flaw
    Fatal flaw

    $215K seed is critically underfunded with zero CAC budget: $105K tooling + $45K eng + $12K SAB + $12K FCC + $9K beta + $6K insurance + $4K reserve = $193K committed before any marketing, customer acquisition, founder salary beyond 90 days, working capital for inventory restock, classifier data collection (10K+ labeled frames), or contingency. The fin brief assumes 3.5K Y1 units sold with no documented CAC channel and no marketing line item. Hardware DTC with $0 acquisition budget does not ship 3.5K units.

  • [dev] Fatal flaw
    Fatal flaw

    Timeline math is broken: 90-day MVP is contradicted by the feas brief (6-9 month industry norm), 8-12 week tooling, 8-12 week FCC cert, and a non-existent labeled training dataset. Realistic first ship is month 9-12, leaving 0-3 months in Y1 to hit the fin brief's $1M revenue target. Seed funds only ~12 months of opex per fin brief; a 3-month slip burns the entire runway before unit-1 retail.

  • [dev] Rebuttal
    Rebuttal · market · medium/high

    TAM ~$2.16B by applying ~$60 annual anxiety-solution spend across the ~36M US dogs with noise anxiety → The $60/year/dog multiplier has no cited source. ThunderShirt is a $40 one-time purchase, Adaptil diffusers are $25-60/year, and trazodone runs $15-40/month for severe cases only (~5-10% of anxious dogs, not 33%). The 33% prevalence figure is owner-reported behavioral observation, not clinical diagnosis — addressable population for a $249 device is materially smaller. Realistic TAM is likely $300-500M, not $2.16B.

  • [dev] Rebuttal
    Rebuttal · comp · high/medium

    Moat score of 4 with silent-auger + floor-tray delivery, DACVB SAB endorsement, and 'therapy not toy' brand wedge as defensibility → None of these are durable moats. The silent auger is mechanical engineering, not patentable IP — PETLIBRO/Furbo can replicate in one hardware cycle. SAB endorsements transfer to any well-funded competitor. The 'therapy' brand wedge is an FTC substantiation liability without clinical trials, not a moat. Comp brief admits incumbents could ship anxiety mode in 12-18 months; realistic moat score is 2-3.

  • [dev] Rebuttal
    Rebuttal · feas · high/high

    Feasibility score 6 with 90-day MVP buildable using mature off-the-shelf components → The feas brief itself contradicts the score by flagging the 90-day timeline as ~2-3x unrealistic, the labeled dataset as non-existent, the auger spec as aggressive, and the 49% tooling commitment as the dominant capital risk. With four structural unknowns (classifier accuracy, dataset acquisition, auger noise floor, 3-SKU coordination), the score should be 3-4, not 6.

  • [dev] Rebuttal
    Rebuttal · fin · high/high

    Year 1 revenue $1M with breakeven month 18, assuming 70% subscription attach and $60-90 CAC → Both undefended assumptions. Furbo Nanny attach reportedly <30%; Peloton attach <50%. DTC pet-hardware CAC benchmarks $100-200. At 40% attach and $150 CAC, the fin brief's own sensitivities show breakeven slips past month 24 — beyond the seed runway. The Y1 $1M figure also requires 3.5K units shipped, which assumes first retail ship by month 6 — impossible given 8-12 week tooling + 8-12 week FCC + non-existent classifier dataset.

  • [dev] Rebuttal
    Rebuttal · reg · high/medium

    Risk level 'medium' with FCC + UL certifications and ToS architecture sufficient to manage state privacy/biometric exposure → Continuous in-home camera doing pose classification plus always-on audio recording in two-party-consent states is high-risk, not medium. BIPA class actions have settled at $650M (Facebook), $92M (TikTok). Pose data may qualify as biometric identifier under IL/TX/WA statutes. Cloud inference path means data leaves the device, eliminating the 'on-device' defense. Risk level should be elevated to high; a single BIPA class certification kills the company.

Showing first 8 of 9 decisions.

Source URLs cited (40)

Pipeline v3.2 · run 2026-06-20 · pe=0a113dbc · poc=1fe2d510