What does The Portland Company do?

The Portland Company is a software development agency that builds web apps, native iOS and Android apps, AI and LLM integrations, and blockchain applications. We work with founders, product teams, and enterprises to ship production software fast — handling architecture, design, engineering, deployment, and ongoing iteration. With two decades of experience across consumer and enterprise products, we specialize in turning ambitious ideas into shipped software using modern stacks like Next.js, Swift, Kotlin, and the latest AI tooling.

Spencer Hill is the founder of The Portland Company. He has spent two decades building software across web, mobile, AI, and blockchain, working with everyone from early-stage startups to Fortune 500 companies. Spencer leads engagements personally — clients work directly with him on architecture, technical strategy, and execution rather than being passed off to junior staff. He is hands-on with code and product, and he is the primary point of contact for every project the firm takes on.

Do you work with startups?

Yes. A significant share of our work is with early-stage startups, from pre-seed founders building a first prototype to Series A and B companies scaling their engineering capacity. We help founders move from idea to shipped MVP quickly, then continue as a long-term engineering partner as the company grows. We are comfortable working with non-technical founders, technical co-founders who need extra hands, and venture-backed teams who need senior engineering leadership without the time and cost of hiring full-time.

Do you build AI agents?

Yes. We build custom AI agents, LLM-powered features, and retrieval-augmented generation systems using the OpenAI API, Anthropic API, LangChain, and the Model Context Protocol. Our work includes tool-using agents, document understanding pipelines, voice interfaces, and agentic workflows wired into existing products. We focus heavily on evaluations, guardrails, observability, and cost control so AI features actually make it to production instead of getting stuck in proof-of-concept purgatory. Engagements range from focused prototypes to long-running AI product partnerships.

Do you build blockchain apps?

Yes. We build smart contracts and decentralized applications on Ethereum and EVM-compatible chains, written in Solidity and tested with Foundry and Hardhat. Our blockchain work includes DeFi protocols, token systems, NFT mechanics, on-chain governance, and wallet-integrated front-ends using wagmi and viem. We treat security as non-negotiable: every contract is unit-tested, fuzz-tested, and prepared for third-party audit before launch. We work with both crypto-native founders and traditional companies bringing existing products on-chain.

Where are you located?

The Portland Company is headquartered in Portland, Oregon, United States. We work remotely with clients worldwide and have shipped projects for teams across North America, Europe, and Asia. Time zone overlap is rarely an issue — we adapt our working hours to match client needs and use asynchronous communication for everything that does not strictly require a live conversation. When projects warrant it, we travel to client sites for kickoffs, workshops, or critical milestones, but the vast majority of engagements run end-to-end remotely.

Do you work with enterprises?

Yes. We work with enterprise teams that need senior engineering capacity without the overhead of expanding headcount, or that need specialized expertise in AI, blockchain, or mobile that is hard to hire for. We are comfortable with procurement processes, SOC 2 requirements, NDAs, and security reviews. Past enterprise work has spanned financial services, healthcare, retail, and media. Engagements typically run as multi-month or multi-year partnerships where we embed alongside internal teams, ship production code, and transfer knowledge as we go.

What sets The Portland Company apart?

Three things. First, you work directly with Spencer and a small senior team — there is no layer of account managers or junior engineers between you and the people writing code. Second, we have genuine breadth across web, mobile, AI, and blockchain, so we can build complete products rather than handing off pieces to other vendors. Third, we ship. We have spent twenty years optimizing the path from idea to production software, and our engagements are measured in shipped features and live users, not slide decks and roadmaps.

Integrate Claude into a Web App: A Practical Guide

What you'll learn

This guide walks through every layer of a production-grade Claude integration in a TypeScript web app: secure key handling, prompt structure, streaming responses to the browser over Server-Sent Events, tool use loops, error recovery, and cost observability. By the end you'll have a working pattern you can drop into a Next.js, Remix, or Hono app without having to reverse-engineer the SDK.

Prerequisites

Node.js 20+ and a TypeScript-enabled web framework (Next.js App Router assumed)
An Anthropic API key with billing enabled
Familiarity with async iterators and Server-Sent Events
An observability sink (Datadog, Axiom, OpenTelemetry collector, or equivalent)

Steps

1
Provision API access and store the key securely
Create an Anthropic console account, generate an API key, and store it as a server-side environment variable. Never expose the key in client bundles — all Claude calls must originate from a server route or edge function.

Install the SDK and create a typed client

Install @anthropic-ai/sdk and instantiate it once per process. Keep the client in a shared module so connection pooling and instrumentation are consistent across routes.

// lib/anthropic.ts
import Anthropic from "@anthropic-ai/sdk";

export const anthropic = new Anthropic({
  apiKey: process.env.ANTHROPIC_API_KEY!,
  defaultHeaders: { "anthropic-beta": "prompt-caching-2024-07-31" },
});

3
Design the prompt with system, user, and assistant roles
Use the system parameter for stable instructions and personality. Pass conversation history as ordered messages. Keep system prompts short and concrete — long system prompts hurt latency and quality.

Stream responses to the browser

Use stream: true and pipe the SSE chunks back to the client through a server route. Streaming dramatically improves perceived latency, and Claude's content_block_delta events map cleanly onto incremental UI rendering.

// app/api/chat/route.ts
import { anthropic } from "@/lib/anthropic";

export async function POST(req: Request) {
  const { messages } = await req.json();

  const stream = await anthropic.messages.stream({
    model: "claude-opus-4-7",
    max_tokens: 1024,
    system: "You are a concise assistant.",
    messages,
  });

  const encoder = new TextEncoder();
  const body = new ReadableStream({
    async start(controller) {
      for await (const event of stream) {
        if (event.type === "content_block_delta" &&
            event.delta.type === "text_delta") {
          controller.enqueue(encoder.encode(event.delta.text));
        }
      }
      controller.close();
    },
  });

  return new Response(body, {
    headers: { "Content-Type": "text/plain; charset=utf-8" },
  });
}

Add tool use for actions the model needs to take

Define tools with JSON Schema. When Claude returns a tool_use block, execute the tool server-side, append the tool_result to the message history, and loop until the model produces a final text response.

const tools = [{
  name: "get_order_status",
  description: "Look up the current status of a customer order.",
  input_schema: {
    type: "object",
    properties: { orderId: { type: "string" } },
    required: ["orderId"],
  },
}];

async function runAgent(messages) {
  while (true) {
    const res = await anthropic.messages.create({
      model: "claude-opus-4-7",
      max_tokens: 1024,
      tools,
      messages,
    });

    if (res.stop_reason !== "tool_use") return res;

    const toolUse = res.content.find(b => b.type === "tool_use");
    const result = await runTool(toolUse.name, toolUse.input);

    messages.push({ role: "assistant", content: res.content });
    messages.push({
      role: "user",
      content: [{
        type: "tool_result",
        tool_use_id: toolUse.id,
        content: JSON.stringify(result),
      }],
    });
  }
}

6
Handle errors, rate limits, and partial failures
Catch 429s with exponential backoff, surface 400 validation errors as user-facing messages, and persist partial assistant turns so a network drop mid-stream does not corrupt history.
7
Instrument cost and latency from day one
Log input_tokens, output_tokens, model, and request duration to your observability stack. Alert on per-user token spend and tail latency. Enable prompt caching for any repeated system prompt or long context.

Common pitfalls

Shipping the API key to the client. Anthropic does not support a public/browser key — any client-side call must proxy through your server.
Ignoring the tool_use loop. A single call is rarely enough. Build a bounded while-loop with a max-step guard so you never spin forever on a model that keeps calling tools.
Forgetting prompt caching. Long system prompts repeated per request are the #1 source of avoidable spend. Mark the static prefix as cacheable and you'll see costs drop by 60-90% for chat workloads.
Trusting model output blindly in tools. Validate tool inputs with Zod or the schema layer of your choice before touching production systems.
No max_tokens cap on user-facing turns. Always set a ceiling. A runaway 4,000-token reply on every chat message compounds quickly.

Next steps

If you want a partner to take this from prototype to production, we can help end-to-end:

AI & LLM integration services — production Claude, OpenAI, and agentic systems.
Web development — Next.js, Remix, and Cloudflare-native apps.
Get in touch to scope an integration.

What you'll learn

Prerequisites

Steps

Provision API access and store the key securely

Install the SDK and create a typed client

Design the prompt with system, user, and assistant roles

Stream responses to the browser

Add tool use for actions the model needs to take

Handle errors, rate limits, and partial failures

Instrument cost and latency from day one

Common pitfalls

Next steps