What does The Portland Company do?

The Portland Company is a software development agency that builds web apps, native iOS and Android apps, AI and LLM integrations, and blockchain applications. We work with founders, product teams, and enterprises to ship production software fast — handling architecture, design, engineering, deployment, and ongoing iteration. With two decades of experience across consumer and enterprise products, we specialize in turning ambitious ideas into shipped software using modern stacks like Next.js, Swift, Kotlin, and the latest AI tooling.

Spencer Hill is the founder of The Portland Company. He has spent two decades building software across web, mobile, AI, and blockchain, working with everyone from early-stage startups to Fortune 500 companies. Spencer leads engagements personally — clients work directly with him on architecture, technical strategy, and execution rather than being passed off to junior staff. He is hands-on with code and product, and he is the primary point of contact for every project the firm takes on.

Do you work with startups?

Yes. A significant share of our work is with early-stage startups, from pre-seed founders building a first prototype to Series A and B companies scaling their engineering capacity. We help founders move from idea to shipped MVP quickly, then continue as a long-term engineering partner as the company grows. We are comfortable working with non-technical founders, technical co-founders who need extra hands, and venture-backed teams who need senior engineering leadership without the time and cost of hiring full-time.

Do you build AI agents?

Yes. We build custom AI agents, LLM-powered features, and retrieval-augmented generation systems using the OpenAI API, Anthropic API, LangChain, and the Model Context Protocol. Our work includes tool-using agents, document understanding pipelines, voice interfaces, and agentic workflows wired into existing products. We focus heavily on evaluations, guardrails, observability, and cost control so AI features actually make it to production instead of getting stuck in proof-of-concept purgatory. Engagements range from focused prototypes to long-running AI product partnerships.

Do you build blockchain apps?

Yes. We build smart contracts and decentralized applications on Ethereum and EVM-compatible chains, written in Solidity and tested with Foundry and Hardhat. Our blockchain work includes DeFi protocols, token systems, NFT mechanics, on-chain governance, and wallet-integrated front-ends using wagmi and viem. We treat security as non-negotiable: every contract is unit-tested, fuzz-tested, and prepared for third-party audit before launch. We work with both crypto-native founders and traditional companies bringing existing products on-chain.

Where are you located?

The Portland Company is headquartered in Portland, Oregon, United States. We work remotely with clients worldwide and have shipped projects for teams across North America, Europe, and Asia. Time zone overlap is rarely an issue — we adapt our working hours to match client needs and use asynchronous communication for everything that does not strictly require a live conversation. When projects warrant it, we travel to client sites for kickoffs, workshops, or critical milestones, but the vast majority of engagements run end-to-end remotely.

Do you work with enterprises?

Yes. We work with enterprise teams that need senior engineering capacity without the overhead of expanding headcount, or that need specialized expertise in AI, blockchain, or mobile that is hard to hire for. We are comfortable with procurement processes, SOC 2 requirements, NDAs, and security reviews. Past enterprise work has spanned financial services, healthcare, retail, and media. Engagements typically run as multi-month or multi-year partnerships where we embed alongside internal teams, ship production code, and transfer knowledge as we go.

What sets The Portland Company apart?

Three things. First, you work directly with Spencer and a small senior team — there is no layer of account managers or junior engineers between you and the people writing code. Second, we have genuine breadth across web, mobile, AI, and blockchain, so we can build complete products rather than handing off pieces to other vendors. Third, we ship. We have spent twenty years optimizing the path from idea to production software, and our engagements are measured in shipped features and live users, not slide decks and roadmaps.

Scope an AI Agent Project: From Idea to MVP

What you'll learn

Most agent projects fail in scoping, not in code. This guide is the checklist we run with every client before a single prompt is written: how to pick a success metric, inventory the tools the agent needs, choose between single-shot, ReAct, and planner-executor architectures, set up evals from day one, model the unit economics, and cut a defensible MVP. The output is a short scoping doc that gives engineering a real plan and gives stakeholders a real number to grade the project on.

Prerequisites

Access to the workflow or process the agent will automate
A stakeholder who can name the business outcome in one sentence
API access to the systems the agent will read from and write to
A sample of 30-100 real inputs (tickets, emails, PRs, etc.)
Budget authority for model spend and engineering time

Steps

1
Define the single success metric
Pick one number that, if it moves, the project worked. Resolution rate on support tickets, deals enriched per hour, percent of PRs auto-triaged correctly. Vague goals like 'improve productivity' kill agent projects because there's no way to know when you're done.
2
Map the tools the agent actually needs
List every external action: read a CRM record, write a calendar event, query a warehouse, call an internal API, post to Slack. Each tool needs an owner, a stable schema, idempotency guarantees, and a permission model. The tool surface is 70% of the engineering work — model choice is 5%.
3
Choose a loop architecture
Three viable shapes. (a) Single-shot: one prompt in, one structured output out — use when the task is bounded and tools aren't needed. (b) ReAct / tool-calling loop: model decides at each step whether to act or finish — use for most agent work. (c) Planner-executor: a planning pass produces a typed plan, a deterministic executor runs it — use when steps are expensive, audit logs matter, or actions touch money.
4
Stand up evals before writing the agent
Build a labeled dataset of 30-100 representative inputs with expected outputs or rubric scores. Wire it into CI so every prompt change is measured. Without evals you're flying blind — every 'this feels better' becomes religious debate.
5
Estimate the unit economics and budget
Calculate tokens-per-task across input, output, and tool round-trips. Multiply by model price and expected task volume. Cap max_tokens, max_steps, and total spend per task. Agents without budgets routinely spend $10 on $0.50 jobs because nobody set the guardrails.
6
Design the human-in-the-loop fallback
Decide which actions require human approval, which can run autonomously, and what the escalation path looks like when confidence is low. The agent should know how to say 'I don't know' and where to send it.
7
Cut the MVP scope ruthlessly
From the full feature list, pick the smallest slice that moves the success metric on a defined subset of inputs. One workflow, one tool surface, one user cohort. Ship in 4-6 weeks. Everything else is roadmap.
8
Plan rollout, monitoring, and a kill switch
Run shadow mode (agent runs, humans still act), then assisted mode (agent suggests, humans approve), then autonomous mode for the safest task subset. Log every step. Wire a feature flag to stop the agent globally in under 60 seconds.

Common pitfalls

Skipping evals. Without a labeled dataset you cannot tell if a prompt change made the agent better or worse. Build evals before the agent.
Treating the LLM as the product. The LLM is the engine. The product is the tools, the eval harness, the rollout strategy, and the kill switch.
No step or token cap. An agent without max_steps and max_tokens will eventually do something expensive and embarrassing.
Scope creep before the MVP ships. Every new tool added pre-launch doubles the eval surface. Resist.
Skipping shadow mode. Going straight to autonomous is how you discover, in production, that the agent confidently mislabels 8% of inputs.

Next steps

If you want a partner to run this scoping with you and ship the MVP:

AI & agent development services — agent design, evals, and production rollout.
Web development — internal tooling and dashboards for human-in-the-loop review.
Book a scoping call.

What you'll learn

Prerequisites

Steps

Define the single success metric

Map the tools the agent actually needs

Choose a loop architecture

Stand up evals before writing the agent

Estimate the unit economics and budget

Design the human-in-the-loop fallback

Cut the MVP scope ruthlessly

Plan rollout, monitoring, and a kill switch

Common pitfalls

Next steps