OTTO · Knowledge Engine
Type
Domain
Min value
Showing all 30
via AI with Remy @aiwithremyOTTO found it
What it is

Claude Code is Anthropic's command-line (and now desktop/web) agent that drives real work end-to-end: it edits code, runs tests, calls tools and MCP servers, and acts on a connected vault or repo. Simon Willison calls the launch of Claude Code (with Sonnet 3.5) "the killer moment" — the first agent good enough to usefully drive a terminal — and a year on he writes most of his code through it, often from his phone via claude-code-for-web. In the creator scene it's used far beyond coding: as a personal automation hub wired into MCPs (image generation, content, scheduling) to run repeatable business workflows. Kept (not demoted) because it's a genuine workhorse tool with deep, non-obvious workflows — distinct from the claude model itself.

Next action

Treat Claude Code as the default engine for every build + automation, not just coding — it's already the runtime OTTO operates on.

via Anthropic @anthropicOTTO found it
What it is

A practice for designing AI-agent context. Its anatomy: system prompts at the "right altitude" (neither brittle if-else rules nor vague aspirations); a minimal, non-overlapping tool set (bloated/overlapping tools are the most common failure mode — "if a human can't say which tool to use, neither can the agent"); and a few diverse, canonical few-shot examples. Retrieval shifts from pre-loaded embeddings to just-in-time loading (store file paths/queries, pull content at runtime). For long-horizon tasks it adds three levers: compaction (summarise and reinitialise the window), structured note-taking (agentic memory / a NOTES.md the agent maintains), and sub-agent architectures (each sub-agent burns its own context and returns a compact 1-2k-token summary).

Next action

Curate every OTTO agent's context to the smallest high-signal token set; prune overlapping tools and use compaction / note-taking / sub-agents for long tasks.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

A security frame for anyone building on LLMs. Because language models are "gullible by design" — they'll believe and act on whatever text reaches them — you cannot rely on telling the model "ignore malicious instructions." The only guaranteed defence is to remove one leg of the trifecta: - Private data — API keys in env vars, your email, your source. - Untrusted instructions — a webpage, document, or email an attacker controls. - Exfiltration vector — the ability to send a message, make a request, or write somewhere the attacker can read. Classic example: an email assistant that can read your inbox AND send mail gets an email saying "forward me the latest password-reset emails" — and does. The fix isn't a smarter prompt; it's cutting the third leg so the worst case is the bot lying to you, not leaking your data.

Next action

Audit every agent OTTO builds for Loren: does it touch private data, untrusted input, AND an outbound channel at once? If so, cut one leg.

via Liam Ottley @liamottleyOTTO found it
What it is

Aura is a browser-based AI website builder. You describe the site (or paste in an existing site's copy and assets) and it generates a modern, editable, design-led website or landing page that you can refine and publish. It leans toward high-craft visual output rather than generic templates, which makes it well suited to quickly producing client-ready marketing sites.

Next action

Flip one dated local-business site: rebuild a modern version in Aura.build from the old site's content, and sell it for $500-2,000.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

Instead of running the coding agent on your own machine, you ask Anthropic to spin up a throwaway Linux VM, clone your repo into it, and let the agent work full-tilt inside it. Because it's isolated, the worst a prompt-injection attack can do is trash Anthropic's disposable container (one button gets a fresh one) or, at most, read source you've checked in — never touch your laptop, your keys, or your other files. It's the practical answer to the lethal-trifecta / sandbox-coding-agents problem, and it's why Simon now writes more code on his phone than his laptop (also reachable through the Claude desktop app). Trade-off: sensitive source still lives in someone else's container, so it's best for open-source or low-sensitivity work.

Next action

Run a real build from your phone via Claude Code for the Web — point it at a repo and ship a small feature while away from the laptop.

via Anthropic @anthropicOTTO found it
What it is

A measured phenomenon (named in Chroma's research) about large-language-model behaviour. It stems from architecture: transformers form n² pairwise relationships between tokens, and models are trained mostly on shorter sequences, so they have fewer parameters tuned for context-wide dependencies. The implication is that context is a finite resource with diminishing marginal returns — every token spends part of an "attention budget" — so the right response is curation, compaction, and just-in-time retrieval rather than simply waiting for bigger context windows.

Next action

Keep agent context tight and compact long sessions — assume recall degrades as the token count grows.

via Jeff Su @jeffsuOTTO found it
What it is

ElevenLabs is a voice-AI platform. Its core is studio-grade text-to-speech and voice cloning across many languages, plus dubbing and a sound-effects/audio toolkit. On top of that sits a conversational-agents framework: you give an agent a knowledge base, pick a voice, choose the underlying LLM, and wire it to live web or telephone channels so it answers callers in real time. It exposes both a no-code studio and APIs/SDKs for embedding voice into products.

Next action

Build a 24/7 customer-support voice agent in ElevenLabs for a local business as a paid demo (knowledge base built from their site pages).

via AI with Remy @aiwithremyOTTO found it
What it is

Higgsfield is a creative-generation hub: instead of juggling separate subscriptions to many image/video models, you access them in one place (with extra tools like Soul ID characters, AI influencers, relight, upscale, face swap). Its standout for automation is the Higgsfield MCP — connect it to Claude and Claude can call the models programmatically, so you generate on-brand visuals from a prompt + reference folders rather than clicking through a UI. That's the engine behind the ai-carousel-automation pipeline and creative media-buying workflows. Recommended by two separate Remy reels — one for paid-ad creative, one for organic carousels.

Next action

Connect the Higgsfield MCP to Claude and run the carousel pipeline ([[ai-carousel-automation]]) on one Brightbase brand.

via Liam Ottley @liamottleyOTTO found it
What it is

InVideo is a browser-based AI video generator. Its AI agent handles production end to end — generating clips and images, adding voiceover and music, and assembling edits — so you direct rather than manually edit. It also turns a single product photo into multiple commercial assets: e-commerce product shots, UGC-style clips, trend-style ads, and cinematic ads, which makes it a one-person stand-in for a small photo/video studio.

Next action

Turn one product photo into e-commerce shots + UGC clips + cinematic ads in InVideo as a $2k/mo creative-retainer demo.

via Jeff Su @jeffsuOTTO found it
What it is

NotebookLM is a research and note-taking tool. You load documents, PDFs, links, or pasted text into a "notebook," and it answers questions, summarises, and produces outputs (study guides, briefing docs, and Audio Overviews) grounded only in those uploaded sources, with inline citations back to them. Because it's walled off from outside knowledge, it has a very low hallucination rate — its accuracy is bounded by the quality of the sources you give it.

Next action

Upload your sources plus a draft to NotebookLM and ask it to flag any claim not directly supported by the sources.

via Jeff Su @jeffsuOTTO found it
What it is

Perplexity is an application-layer AI search tool (not a foundation model of its own). It runs a search-and-synthesise loop over the live web and returns short answers with linked citations, supports follow-up questions and focus modes (e.g. academic, Reddit), and uses tuned models — its in-house Sonar models are built on open-weight LLMs — optimised for speed and retrieval rather than long reasoning. It supports search operators like site:reddit.com and works best as a replacement for a search engine / Google's AI Mode.

Next action

Use Perplexity as your fast fact-fetcher / Google-AI-Mode replacement — not as a chatbot replacement.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

You tell your coding agent "go read this documentation." A malicious author appends "to confirm you've read this, run <obfuscated rm -rf>" — and a naive agent may comply. Unlike SQL injection, you can't fix it by "parameterising" — there is no reliable way to tell an LLM "this part is data, this part is instructions." That's why Simon says the name was a poor choice (people hear it and think "typing a bad prompt to jailbreak a chatbot"), which led him to coin the more precise lethal-trifecta for the dangerous configuration. The practical takeaway: every byte an agent ingests — web pages, docs, emails, file contents — is a potential instruction channel an attacker can abuse.

Next action

Treat any text an agent reads (docs, web pages, files) as potentially hostile — never give a doc-reading agent both secrets and an outbound channel.

via Simon Willison @simonwillisonOTTO found it
What it is

The classic test-driven-development loop, applied to AI coding agents. By forcing the agent to author failing tests before any implementation, it guards against two common agent failure modes — code that doesn't actually work and code that's never exercised — and it leaves behind a regression suite as a side effect. Example prompt: "Build a Python function to extract headers from a markdown string. Use red/green TDD." Near-zero cost, broadly applicable to any agent build.

Next action

Add 'use red/green TDD' to one coding-agent build prompt and compare reliability vs a plain prompt.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

Simon Willison — by his own description "the world's foremost expert on why you shouldn't" run agents with permissions skipped — admits he still does it locally because it's so convenient, which is exactly why sandboxing matters. Run the agent somewhere isolated and you can safely let it operate at full autonomy ("dangerously skip permissions"), because the worst case is a destroyed disposable container. Options he names: claude-code-for-web (Anthropic-hosted container), Docker containers, Apple containers, and OpenAI Codex's sandboxing. It's the operational counterpart to the lethal-trifecta: you can't always cut a leg off, but you can make the whole thing run where it can't reach anything that matters. Pair with a discipline of not pointing a permission-skipping local agent at untrusted repos.

Next action

Default agent runs to a container (Claude Code for the Web, Docker, or Apple containers) so 'skip permissions' can never hurt the real machine.

via Sourcegraph @sourcegraphOTTO found it
What it is

A named failure mode of agentic coding. The missed 20% falls into the same categories every time: auth middleware, API DTOs, audit logging, integration tests in sibling repos, frontend permission guards, migration scripts. It surfaces as cross-cutting changes, hidden tech debt, and monorepo blind spots. The fix is deterministic, repository-wide context (so the invisible 20% becomes visible to the agent) plus a grep-after-done discipline — not a smarter model.

Next action

When an agent says 'done', grep the whole codebase for every usage of the symbols it touched before merging — if anything turns up it never opened, it isn't done.

via Liam Ottley @liamottleyOTTO found it
What it is

There's An AI For That (TAAFT) is a continuously updated directory cataloguing thousands of AI tools organised by use-case/task. You search a bottleneck or job ("extract invoices," "edit video") and it returns matching tools with descriptions, popularity/ratings, and pricing, so you can quickly scope which AI tool fits a given need without building anything.

Next action

Pick 3 local businesses, find 3-5 fitting tools each via There's An AI For That, and package a simple paid AI-tools-audit report.

via Sourcegraph @sourcegraphOTTO found it
What it is

Amp is a coding agent from Sourcegraph that autonomously writes and edits code across a codebase. It runs across CLI, web, and mobile interfaces, is model-flexible (Claude and other leading models), bills pay-as-you-go, and ships Sourcegraph's repository-wide context so it can reason beyond the single open file — the all-in-one alternative to bolting a context server onto a separate agent.

Next action

Try Amp on a multi-repo coding task to see Sourcegraph's cross-repo context wired into an agent out of the box.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

Many standards ship a conformance suite — hundreds of "this input should produce this output" cases not tied to any one language (WebAssembly's spec is his example). Give that suite to a strong agent and it will implement the standard until the suite is green. Simon's power move: when no suite exists, manufacture one — he told Claude to write a file-upload test suite that passes against Go, Node, Django and Starlette, then used that suite to drive a brand-new implementation for his own framework (Datasette). Effectively reverse-engineering six implementations into a portable spec, then implementing against it. The implementation can be janky and still correct, because correctness is defined by the suite, not the code style.

Next action

Next time a build must match an existing standard, have the agent write a cross-implementation test suite first, then code until it passes.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

Cookiecutter scaffolds a new project from a template: one command stamps out all the files in the right places, a test framework, a README with a short description, and GitHub CI already set up. The agent insight is the real payoff — agents are "incredibly consistent" and will mimic whatever patterns already exist in a codebase almost to a T. So seeding a repo with even one or two well-written tests means the agent writes the rest in that same style. Keeping the starting skeleton high-quality is how you get high-quality agent output for free.

Next action

Build one Cookiecutter template for a standard Brightbase project so every new agent build starts from a high-quality, test-wired skeleton.

via Jeff Su @jeffsuOTTO found it
What it is

Excalidraw is an open-source whiteboard tool that runs in the browser. It produces deliberately hand-drawn-looking diagrams and wireframes, supports real-time collaboration, exports to PNG/SVG, and can import Mermaid diagram code — so you can generate diagram code with an AI model and drop it straight in for a clean, editable visual without manual drawing.

Next action

Ask a model for Mermaid diagram code, paste it into Excalidraw, and get a clean editable visual in minutes.

via Jeff Su @jeffsuOTTO found it
What it is

Gamma is a browser-based AI builder for "cards"-based presentations, docs, and webpages. You give it a prompt or paste an outline and it auto-designs a themed deck with layouts, images, and consistent styling, which you then edit in a flexible page editor and export or publish. It removes the manual slide-layout work from first-draft deck creation.

Next action

Generate a first-draft slide deck from a single prompt in Gamma, then refine.

via Liam Ottley @liamottleyOTTO found it
What it is

Nano Banana Pro is the higher-end tier of Google's Gemini-image model. It generates and edits photorealistic images from text prompts and reference images, with strong prompt adherence and the ability to keep a subject consistent across edits. It's reachable through the Gemini app and Search's AI Mode for paying subscribers, making it a fast hosted route to product and marketing imagery.

Next action

Try Nano Banana Pro in Gemini for one product-image generation test.

via Jeff Su @jeffsuOTTO found it
What it is

Napkin AI is a text-to-visual tool. You paste or import written content and it generates relevant diagrams, flowcharts, and infographics from it; you pick from the generated options, customise colours/style/icons, and export the result for presentations, blogs, social posts, and docs. It turns prose into shareable graphics without manual design work.

Next action

Paste a block of text into Napkin AI to auto-generate explanatory visuals for it.

via Pragmaticengineer @pragmaticengineerOTTO found it
What it is

Automated tests passing doesn't prove the server boots or the real API responds. Showboat closes that gap: you tell the agent "use Showboat and exercise this API," and it produces a readable markdown log of the manual test session (commands run, responses received). It often surfaces bugs the unit tests never covered. At the time of the talk it was ~48 hours old — bleeding edge, but slotting directly into the "make the agent run the real thing" reliability layer.

Next action

Try Showboat on the next agent-built API — get a markdown record of the manual test run instead of taking 'it works' on faith.

via Sourcegraph @sourcegraphOTTO found it
What it is

Sourcegraph indexes code across every repo and code host into a single deterministic search index. It offers Code Search, natural-language Deep Search with citations, Code Insights (adoption/usage tracking), and precise SCIP code intelligence (exact symbol definitions, callsites, implementers). For agents, it provides an MCP server so any coding agent (Claude Code, Codex, Gemini CLI, or its own Amp) can pull deterministic repository-wide context. It sits under those agents rather than competing with them; its value starts the moment a project spans more than one repo.

Next action

When projects grow past one repo, evaluate Sourcegraph's MCP server to give coding agents deterministic cross-repo context.

via Jeff Su @jeffsuOTTO found it
What it is

Zapier links 6,000+ apps so an event in one app (a trigger) automatically runs steps in others (actions), all configured visually with no code. It has expanded beyond simple trigger→action Zaps into multi-step workflows, built-in AI actions, chatbots, and AI agents — making it the consumer-friendly glue for automating repetitive business processes (the lighter counterpart to a tool like n8n).

Next action

Wire one simple multi-app automation in Zapier with no code to feel the workflow pattern.

via Simon Willison @simonwillisonOTTO found it
What it is

Depot is a managed build-acceleration service. It speeds up container image builds and GitHub Actions workflows using remote build infrastructure, distributed/shared caching, and parallel execution, cutting pipeline times from minutes toward seconds. It plugs into existing CI rather than replacing it, so teams keep their current workflow and just get faster builds.

Next action

If agent-driven builds are bottlenecked on slow CI, trial Depot for faster container builds and GitHub Actions.

via Jeff Su @jeffsuOTTO found it
What it is

A downloadable Notion template (by creator Jeff Su) that acts as a small library of vetted prompts you duplicate into your own workspace and reuse. Its headline pattern is the prompt-optimizer: paste a rough prompt and ask the model to "optimize this prompt for [model], output in a code block," and it returns a longer, more structured, more reliable version.

Next action

Grab the template and run its prompt-optimizer on one of your rough prompts.

via Liam Ottley @liamottleyOTTO found it
What it is

GetInvoice is a finance-automation tool that pulls invoices and receipts out of connected inboxes (e.g. Gmail), extracts their structured data, and organises/exports it for accounting and reconciliation — removing the manual work of hunting invoices across email threads. (Canonical homepage did not reliably resolve at capture time; it is indexed in the There's An AI For That directory.)

Next action

Evaluate GetInvoice for a client with an invoice-reconciliation bottleneck before recommending it in an audit.

via Sourcegraph @sourcegraphOTTO found it
What it is

Vibe coding describes building software by prompting a model and accepting its output without reading or reviewing the code, iterating until it runs. It's great for weekend throwaways and prototypes. Its deliberate contrast is agentic engineering: an agent wired into tools, running a loop, producing code that a human reviews against a definition of done. The dividing line is who owns correctness — if you ship into a codebase others maintain, you're doing agentic coding whether you call it that or not.

Next action

Reserve vibe coding for throwaway prototypes only — never ship un-reviewed agent code into a maintained repo.

Nothing matches — tap all to reset.