What does Josh Arnold invest in?

$10-75K angel checks for AI-native companies at pre-seed and seed stage, focused on teams building compounding AI systems.

What is Josh Arnold's consulting focus?

Outcome-first AI sprints for teams moving AI from prototype to production. Applied intelligence for enterprise operations.

Email hello@josharnold.co with your deck or project brief. 48-hour response time.

Playbook

Recommendation thesis

Run vendor selection like procurement plus experiments: score quality, cost, latency, and risk on your real tasks before signing.

Why now

Run vendor selection like procurement plus experiments: score quality, cost, latency, and risk on your real tasks before signing.

Evaluation dataset represents real production prompts and constraints.

What breaks without this

Teams choosing vendors based only on benchmark marketing.

Decision framework

Evaluation dataset represents real production prompts and constraints.

Commercial terms include explicit usage, retention, and support expectations.

You can run side-by-side tests on quality and operational cost.

Recommended path

Run vendor selection like procurement plus experiments: score quality, cost, latency, and risk on your real tasks before signing.

Side-by-side scoring prevents lock-in based on superficial demos.

Implementation sequence

Commercial terms include explicit usage, retention, and support expectations.

Tradeoffs

Organizations without legal or security review capacity.

Criterion	Recommended when	Not recommended when
Evaluation dataset represents real production prompts and constraints.	Evaluation dataset represents real production prompts and constraints.	Teams choosing vendors based only on benchmark marketing.
Commercial terms include explicit usage, retention, and support expectations.	Commercial terms include explicit usage, retention, and support expectations.	Organizations without legal or security review capacity.
You can run side-by-side tests on quality and operational cost.	You can run side-by-side tests on quality and operational cost.	Projects that have not defined latency and quality requirements.

Before

Teams choosing vendors based only on benchmark marketing.

After

Side-by-side scoring prevents lock-in based on superficial demos.