Building an Agentic System for Brand AI Video Generation

Or: How I Learned to Stop Prompt-and-Praying and Start Building Reusable Systems


Learning How to Encode Your Creative

I’m about to share working patterns that took MONTHS to discover. Not theory — lived systems architecture applied to a creative problem that most people are still solving with vibes and iteration.

If you’re here because you’re tired of burning credits on video generations that miss the mark, or you’re wondering why your brand videos feel generic despite detailed prompts, or you’re a systems thinker who suspects there’s a better way to orchestrate creative decisions — this is for you. (Meta Note: This also works for images and even music)

The Problem: The Prompt-and-Pray Loop

Most people are writing video prompts like they’re texting a friend.

Here’s what that looks like in practice:

  1. Write natural language prompt: “A therapist’s office with calming vibes and natural light”
  2. Generate video (burn credits)
  3. Get something… close?
  4. Rewrite prompt: “A peaceful therapist’s office with warm natural lighting and plants”
  5. Generate again (burn more credits)
  6. Still not quite right
  7. Try again: “A serene therapy space with soft morning sunlight streaming through windows, indoor plants, calming neutral tones”
  8. Maybe this time?

The core issue isn’t skill — it’s structural ambiguity.

When you write “a therapist’s office with calming vibes,” you’re asking the AI to:

  • Invent the color palette (cool blues? warm earth tones? clinical whites?)
  • Choose the lighting temperature (golden hour? overcast? fluorescent?)
  • Decide camera angle (wide establishing shot? intimate close-up?)
  • Pick props (modern minimalist? cozy traditional? clinical professional?)
  • Guess the emotional register (aspirational? trustworthy? sophisticated?)

Every one of those is a coin flip. And when the output is wrong, you can’t debug it because you don’t know which variable failed.

The True Cost of Video Artifacts

It’s not just credits. It’s decision fatigue multiplied by uncertainty. You’re making creative decisions in reverse — reacting to what the AI guessed instead of directing what you wanted.

For brands, this gets expensive fast:

  • Inconsistent visual language across campaigns
  • No way to maintain character/scene consistency across shots
  • Can’t scale production without scaling labor and supervision
  • Brand identity gets diluted through iteration drift

This is the prompt tax on ambiguity.


The Insight: Why JSON Changes Everything

Here’s the systems architect perspective that changes everything:

Traditional prompts are monolithic. JSON prompts are modular.

When you structure a prompt like this:

You’re doing something profound: separating concerns.

Now when something’s wrong, you know where it’s wrong:

  • Lighting failed? → style.lighting
  • Character doesn’t match? → character.appearance
  • Camera motion is jarring? → style.camera_equipment
  • Props feel off? → environment.props

This is human debugging for creativity.

The Deeper Game: Composability

JSON isn’t just about fixing errors — it’s about composability.

You can now:

  • Save reusable templates: “intimate conversation,” “product reveal,” “chase scene,” “cultural moment”
  • Swap values programmatically: Same structure, different brand/product/message
  • A/B test single variables: Change only lighting while holding everything else constant
  • Scale production without scaling labor: Generate 20 product videos by looping through a data structure

This is the difference between artisanal video generation and industrial-strength content production.


The Case Study: Admerasia

Let me show you why this matters with a real example.

Understanding the Brand

Admerasia is a multicultural advertising agency founded in 1993, specializing in Asian American marketing. They’re not just an agency — they’re cultural translators. Their tagline tells you everything: “Brands & Culture & People”.

That “&” isn’t decoration. It’s philosophy. It represents:

  • Connection: Bridging brands with diverse communities
  • Conjunction: The “and” that creates meaning between things
  • Cultural fluency: Understanding the spaces between cultures

Their clients include McDonald’s, Citibank, Nissan, State Farm — Fortune 500 brands that need authentic cultural resonance, not tokenistic gestures.

The Challenge

How do you create video content that:

  • Captures Admerasia’s cultural bridge-building mission
  • Reflects the “&” motif visually
  • Feels authentic to Asian American experiences
  • Works across different contexts (brand partnerships, thought leadership, social impact)

Traditional prompting would produce generic “diverse people smiling” content. We needed something that encodes cultural intelligence into the generation process.

The Solution: Agentic Architecture

I built a multi-agent system using CrewAI that treats video prompt generation like a creative decision pipeline. Each agent handles one concern, with explicit handoffs and context preservation.

Here’s the architecture:

Brand Data (JSON) 
    ↓
[Brand Analyst] → Analyzes identity, builds mood board
    ↓
[Business Creative Synthesizer] → Creates themes based on scale
    ↓
[Vignette Designer] → Designs 6-8 second scene concepts
    ↓
[Visual Stylist] → Defines aesthetic parameters
    ↓
[Prompt Architect] → Compiles structured JSON prompts
    ↓
Production-Ready Prompts (JSON)

Let’s Walk Through It

Agent 1: Brand Analyst

What it does: Understands the brand’s visual language and cultural positioning

Input: Brand data from brand.json:

What it does:

  • Performs web search to gather visual references
  • Downloads brand-relevant imagery for mood board
  • Identifies visual patterns: color palettes, composition styles, cultural symbols
  • Writes analysis to test output for validation

Why this matters: This creates a reusable visual vocabulary that ensures consistency across all generated prompts. Every downstream agent references this same foundation.


Agent 2: Business Creative Synthesizer

What it does: Routes creative direction based on business scale and context

This is where most prompt systems fail. They treat a solo therapist and Admerasia the same way.

The routing logic:

For Admerasia (midsize agency):

  • Emotional scope: Professional polish + cultural authenticity
  • Visual treatment: Cinematic but grounded in real experience
  • Scale cues: NYC-based, established presence, thought leadership positioning

Output: 3 core visual/experiential themes:

  1. Cultural Bridge: Showing connection between brand and community
  2. Strategic Insight: Positioning Admerasia as thought leaders
  3. Immersive Storytelling: Their creative process in action

Agent 3: Vignette Designer

What it does: Creates 6-8 second scene concepts that embody each theme

Example vignette for “Cultural Bridge” theme:

Concept: Street-level view of NYC featuring Admerasia’s “&” motif in urban context

Scene beats:

  • Opening: Establishing shot of NYC street corner
  • Movement: Slow tracking shot past bilingual mural
  • Focus: Typography revealing “Brands & Culture & People”
  • Atmosphere: Ambient city energy with cross-cultural music
  • Emotion: Curiosity → connection

Agent 4: Visual Stylist

What it does: Defines color palettes, lighting, camera style

For Admerasia:

  • Color palette: Warm urban tones with cultural accent colors
  • Lighting: Natural late-afternoon sunlight (aspirational but authentic)
  • Camera style: Tracking dolly (cinematic but observational)
  • Visual references: Documentary realism meets brand film polish

Agent 5: Prompt Architect

What it does: Compiles everything into structured JSON

Here’s the actual output:

Why This Structure Works

Contrast this with a naive prompt:

❌ Naive: “Admerasia agency video showing diversity and culture in NYC”

✅ Structured JSON above

The difference?

The first is a hope. The second is a specification.

The JSON prompt:

  • Explicitly controls lighting and time of day
  • Specifies camera movement type
  • Defines the emotional arc
  • Identifies precise visual elements (mural, typography)
  • Includes audio direction
  • Maintains the “&” motif as core visual identity

Every variable is defined. Nothing is left to chance.


The Three Variables You Can Finally Ignore

This is where systems architecture diverges from “best practices.” In production systems, knowing what not to build is as important as knowing what to build.

1. Ignore generic advice about “being descriptive”

Why: Structure matters more than verbosity.

A tight JSON block beats a paragraph of flowery description. The goal isn’t to write more — it’s to write precisely in a way machines can parse reliably.

2. Ignore one-size-fits-all templates

Why: Scale-aware routing is the insight most prompt guides miss.

Your small business localizer (we’ll get to this) shows this perfectly. A solo therapist and a Fortune 500 brand need radically different treatments. The same JSON structure, yes. But the values inside must respect business scale and context.

3. Ignore the myth of “perfect prompts”

Why: The goal isn’t perfection. It’s iterability.

JSON gives you surgical precision for tweaks:

  • Change one field: "lighting": "golden hour" → "lighting": "overcast soft"
  • Regenerate
  • Compare outputs
  • Understand cause and effect

That’s the workflow. Not endless rewrites, but controlled iteration.


The Transferable Patterns

You don’t need my exact agent setup to benefit from these insights. Here are the patterns you can steal:

Pattern 1: The Template Library

Build a collection of scene archetypes:

  • Intimate conversation
  • Product reveal
  • Chase scene
  • Cultural moment
  • Thought leadership
  • Behind-the-scenes

Each template is a JSON structure with placeholder values. Swap in your specific content.

Pattern 2: Constraint Injection

Define “avoid” and “include” lists per context:

These guide without dictating. They’re creative boundaries, not rules.

Pattern 3: Scale Router

Branch creative direction based on business size:

  • Solo/small → Grounded, local, human-scale
  • Midsize → Polished, professional, community-focused
  • Large → Cinematic, bold, national reach

Same JSON structure. Different emotional register.

Pattern 4: Atomic Test

When debugging, change ONE field at a time:

  • Test lighting variations while holding camera constant
  • Test camera movement while holding lighting constant
  • Build intuition for what each parameter actually controls

Pattern 5: Batch Generation

Loop over data, inject into template, generate at scale:

This is the power of structured data.


The System in Detail: Agent Architecture

Let’s look at how the agents actually work together. Each agent in the pipeline has a specific role defined in roles.json:

Agent Roles & Tools

Why these tools?

  • WebSearchTool: Gathers brand context and visual references
  • MoodBoardImageTool: Downloads images with URL validation (rejects social media links)
  • FileWriterTool: Saves analysis for downstream agents

The key insight: No delegation. The Brand Analyst completes its work independently, creating a stable foundation for other agents.

Agent 2: Business Creative Synthesizer

Why delegation is enabled: This agent may need input from other specialists when dealing with complex brand positioning.

The scale-aware routing happens in tasks.py:

For Admerasia (midsize agency), this returns: “professionalism, community trust, mild polish, neighborhood or regional context”

The SmallBusiness Localizer (Conditional)

This agent only activates for scale == "small". It uses small_business_localizer.json to inject business-type-specific constraints:

For Admerasia: This agent didn’t trigger (midsize), but its output shows how it would have guided downstream agents with grounded constraints.


What This Actually Looks Like: The Admerasia Pipeline

Let’s trace the actual execution with real outputs from the system.

Input: Brand Data

Agent 1 Output: Brand Analyst

Brand Summary for Admerasia:

Tone: Multicultural, Inclusive, Authentic
Style: Creative, Engaging, Community-focused
Key Traits: Full-service marketing agency, specializing in Asian American 
audiences, cultural strategy, creative production, and cross-cultural engagement.

Downloaded Images:
1. output/admerasia/mood_board/pexels-multicultural-1.jpg
2. output/admerasia/mood_board/pexels-multicultural-2.jpg
3. output/admerasia/mood_board/pexels-multicultural-3.jpg
4. output/admerasia/mood_board/pexels-multicultural-4.jpg
5. output/admerasia/mood_board/pexels-multicultural-5.jpg

What happened: The agent identified the core brand attributes and created a mood board foundation. These images become visual vocabulary for downstream agents.

Agent 2 Output: Creative Synthesizer

Proposed Themes:

1. Cultural Mosaic: Emphasizing the rich diversity within Asian American 
   communities through shared experiences and traditions. Features local events, 
   family gatherings, and community celebrations.

2. Everyday Heroes: Focuses on everyday individuals within Asian American 
   communities who contribute to their neighborhoods—from local business owners 
   to community leaders.

3. Generational Connections: Highlighting narratives that span across generations, 
   weaving together the wisdom of elders with the aspirations of youth.

The decision logic:

  • Recognized Admerasia’s midsize scale
  • Applied “professionalism, community trust” emotional scope
  • Created themes that balance polish with authentic community storytelling
  • Avoided both hyper-local (too small) and cinematic-epic (too large) treatments

Agent 3 Output: SmallBusiness Localizer

Even though this agent didn’t need to activate (Admerasia isn’t small-scale), it ran and documented constraints:

Scene Constraints:
- Include: Collaborative spaces with diverse staff, natural light, cultural 
  artifacts mixed with modern design, community events like festivals
- Avoid: Direct owner depictions, abstract or overly cinematic styles that 
  detract from intimacy and grounded narrative

Why this matters: These constraints inform how scenes should feel — professional but approachable, cultural but not tokenistic.

Agent 4 Output: Vignette Designer

Here are 2 of the 6 vignette concepts created:

1. Cultural Mosaic - Street Festival: 
   Scene of a vibrant local street festival where diverse cultural elements 
   come alive. Music fills the air as people of different heritages engage 
   in dance, showcasing traditional attire. A child taste-tests a dish from 
   a different culture, eyes lighting up with delight.

2. Generational Connections - Baking Together:
   A cozy indoor scene with warm lighting where a grandparent sits with their 
   grandchild. The grandparent shares stories while showing old family photos. 
   Soft background music enhances the intimate dialogue and connection.

The creative translation: Each vignette embodies one of the three themes with specific, filmable moments.

Agent 5 Output: Visual Stylist

Color Palettes:

Cultural Mosaic:
- Primary: Rich Reds, Deep Oranges, Vibrant Yellows
- Secondary: Cool Blues, Soft Greens
- Accent: Bright Purples
- Tone: Warm and inviting, highlighting vibrancy and diversity

Generational Connections:
- Primary: Soft Pastels (Light Pinks, Blues, Yellows)
- Secondary: Earthy Greens, Light Beiges
- Accent: Warm Reds
- Tone: Intimate and heartfelt, showcasing personal connections

Style References:
- Natural light enhancing warmth and authenticity
- Documentary-style cinematography emphasizing genuine interactions
- Cultural elements subtly incorporated throughout

The aesthetic system: Each theme gets its own color language and lighting approach.

Agent 6 Output: Prompt Architect

Here’s one of the four final JSON prompts generated:

What Makes This Prompt Powerful

Compare this to what a naive prompt would look like:

❌ Naive prompt: “Asian American street festival with diverse people celebrating”

✅ Structured prompt (above)

The differences:

  1. Explicit visual control:
    • Style render: “colorful” (not just implied)
    • Lighting: “natural” (specific direction)
    • Camera: “handheld” (conveys documentary authenticity)
  2. Emotional arc defined:
    • “Joyful engagement and celebration” (not left to interpretation)
  3. Scene composition specified:
    • Props enumerated: banners, food stalls, dancers
    • Atmospherics described: music, laughter, smells
    • Creates multi-sensory specificity
  4. Character and action scripted:
    • Stage direction: dancer twirls
    • Dialogue: child’s authentic reaction
    • These create narrative momentum in 10 seconds
  5. Model selection justified:
    • Reasoning field explains why Veo3
    • “Capability to capture vibrant community interactions”

The Complete Output Set

The system generated 4 prompts covering all three themes:

  1. Cultural Mosaic – Street Festival (community celebration)
  2. Everyday Heroes – Food Drive (community service)
  3. Generational Connections – Baking Together (family tradition)
  4. Cultural Mosaic – Community Garden (intercultural exchange)

Each prompt follows the same JSON structure but with values tailored to its specific narrative and emotional goals.

What This Enables

For Admerasia’s creative team:

  • Drop these prompts directly into Veo3
  • Generate 4 distinct brand videos in one session
  • Maintain visual consistency through structured style parameters
  • A/B test variations by tweaking single fields

For iteration:

Change one line, regenerate, compare. Surgical iteration.

The Pipeline Success

From the final status output:

Total execution:

  • Input: Brand JSON + agent configuration
  • Output: 4 production-ready video prompts
  • Time: ~5 minutes of agent orchestration
  • Human effort: Zero (after initial setup)

The Philosophy Shift

Most people think prompting is about describing what you want.

That’s amateur hour.

Prompting is about encoding your creative judgment in a way machines can execute.

JSON isn’t just a format. It’s a discipline. It forces you to:

  • Separate what matters from what doesn’t
  • Make your assumptions explicit
  • Build systems, not one-offs
  • Scale creative decisions without diluting them

This is what separates the systems architects from the hobbyists.

You’re not here to type better sentences.

You’re here to build leverage.


How to Build This Yourself

You don’t need my exact setup to benefit from these patterns. Here are three implementation paths, from manual to fully agentic:

Option 1: Manual Implementation (Start Here)

What you need:

  • A text editor
  • A JSON validator (any online tool works)
  • Template discipline

The workflow:

  1. Create your base template by copying this structure:
  1. Build your template library for recurring scene types:
    • conversation_template.json
    • product_reveal_template.json
    • action_sequence_template.json
    • cultural_moment_template.json
  2. Create brand-specific values in a separate file:
  1. Fill in templates by hand, using brand values as guidelines
  2. Validate JSON before generating (catch syntax errors early)
  3. Track what works in a simple spreadsheet:
    • Template used
    • Values changed
    • Quality score (1-10)
    • Notes on what to adjust

Time investment: ~30 minutes per prompt initially, ~10 minutes once you have templates

When to use this: You’re generating 1-5 videos per project, or you’re still learning what works


Option 2: Semi-Automated (Scale Without Full Agents)

What you need:

  • Python basics
  • A CSV or spreadsheet with your data
  • The template library from Option 1

The workflow:

Time investment: 2-3 hours to set up, then ~1 minute per prompt

When to use this: You’re generating 10+ similar videos, or you have structured data (products, locations, testimonials)


Option 3: Full Agentic System (What I Built)

What you need:

  • Python environment (3.12+)
  • CrewAI library
  • API keys (Serper for search, Claude/GPT for LLM)
  • The discipline to maintain agent definitions

The architecture:

The key patterns in the full system:

  1. Scale-aware routing in tasks.py:
  1. Constraint injection from small_business_localizer.json:
  1. Test mode for validation:

Time investment:

  • Initial setup: 10-15 hours
  • Per-brand setup: 5 minutes (just update input/brand.json)
  • Per-run: ~5 minutes of agent orchestration
  • Maintenance: ~2 hours per month to refine agents

When to use this:

  • You’re generating 50+ videos across multiple brands
  • You need consistent brand interpretation across teams
  • You want to encode creative judgment as a repeatable system
  • You’re building a service/product around video generation

Visual: The Agent Pipeline

Here’s how the agents flow information:

Key design decisions:

  1. No delegation for Brand Analyst: Creates stable foundation
  2. Delegation enabled for Creative Synthesizer: Can consult specialists
  3. Conditional SmallBusiness Localizer: Only activates for scale=”small”
  4. Progressive refinement: Each agent adds detail, never overwrites
  5. Test outputs at each stage: Visibility into agent reasoning

What You Should Do Next

Depending on your situation:

If you’re just exploring:

  • Use Option 1 (manual templates)
  • Generate 3-5 prompts for your brand
  • Track what works, build intuition

If you’re scaling production:

  • Start with Option 1, move to Option 2 once you have 10+ prompts
  • Build your template library
  • Automate the repetitive parts

If you’re building a product/service:

  • Consider Option 3 (full agentic)
  • Invest in agent refinement
  • Document your creative judgment as code

No matter which path:

  1. Start with the JSON structure (it’s the leverage point)
  2. Build your constraint lists (avoid/include)
  3. Track what works in a simple system
  4. Iterate on single variables, not entire prompts

The patterns transfer regardless of implementation. The key insight isn’t the agents — it’s structured creative judgment as data.


Final Thoughts: This Is About More Than Video

The JSON prompting approach I’ve shown here applies beyond video generation. The same principles work for:

  • Image generation (Midjourney, DALL-E, Stable Diffusion)
  • Music generation (Suno, Udio)
  • 3D asset creation (any prompt-based generator)
  • Code generation (structured requirements → implementation)

The underlying pattern is universal:

Structured input → Consistent output → Measurable iteration

Most people are stuck in the “describe and hope” loop because they haven’t separated concerns. They’re trying to do everything in one monolithic prompt. They can’t debug because they don’t know what broke. They can’t scale because every prompt is artisanal.

JSON isn’t magic. It’s discipline made visible.

When you structure your creative judgment as data:

  • Machines can execute it reliably
  • Teams can collaborate on it systematically
  • You can iterate on it surgically
  • It becomes a compounding asset, not a consumable effort

That’s the shift.

You’re not writing prompts. You’re building creative infrastructure.

And once you see it that way, you can’t unsee it.


About This Work

This system was built to solve a real problem for Admerasia, a multicultural advertising agency that needed to create culturally-authentic video content at scale. The insights came from actually building and running the system, not from theory.

The patterns are open. The structure is reproducible. The agents are optional.

What matters is the discipline: encoding creative judgment in a way that scales.

If you build something with these patterns, I’d love to see it.

Walter Reid
AI Product Leader, Systems Designer & Business Architect
walterreid.com

LinkedIn: Designed To Be Understood or Contact Walter Reid


Repository and full code examples: Available on request for teams implementing these patterns in production.

Why the “Worse” PM Job Might Be the Safer One Right Now

I used to think my biggest strength as a product leader was being a breaker of silos. I’m a business and systems architect at heart — the kind who refuses to just “ship fast” and instead builds systems and processes that make good products easier to ship.

The irony? Those same systems may have made it easier to replace the decision-making with AI.

That’s why a recent post about two Senior PMs stuck with me:

  • Senior PM A — Clear roadmap, supportive team, space to decide, loves the job.
  • Senior PM B — Constant firefighting, no clear goals, drowning in meetings, exhausted.

Same title. Same salary. Completely different realities.


The obvious answer

Most people see this and think: “Clearly, Senior PM A has the better gig. Who wouldn’t want clarity, respect, and breathing room?”

I agree — if you’re talking about today’s workplace.


The AI-era twist

In a well-oiled, optimized system, Senior PM A’s decisions follow predictable patterns: Quarterly planning? Review the metrics, weigh the trade-offs, pick a path. Feature prioritization? Run it through the scoring model. Resource allocation? Follow the established framework.

Those are exactly the kinds of structured, rules-based decisions AI can handle well — not because they’re trivial, but because they have clear inputs and repeatable logic.

Senior PM B’s world is different. One week it’s killing a feature mid-sprint because a major client threatened to churn over an unrelated issue. The next, it’s navigating a regulatory curveball that suddenly affects three product lines. Then the CEO declares a new strategic pivot — immediately.

This isn’t just chaos. It’s high-stakes problem-solving with incomplete data, shifting constraints, and human dynamics in the mix. Right now, that’s still work AI struggles to do.


Why chaos can be strategic

If you’re Senior PM B, you’re not just firefighting. You’re building skills that are harder to automate:

  • Reading between the lines — knowing when “customers are asking for this” means three key deals are at risk vs. one loud voice in the room.
  • Navigating crosscurrents — redirecting an “urgent” marketing request toward something that actually moves the business.
  • Making judgment calls with partial data — acting decisively while staying ready to adapt.

These skills aren’t “soft.” They’re advanced problem-solving abilities: reading between the lines, navigating political currents, and making judgment calls with partial data. AI can process information, but right now, it struggles to match human problem-solving in high-context, high-stakes situations.


How to use the advantage

If you’re in the chaos seat, you have leverage — but only if you’re intentional:

  1. Document your decisions — keep a log that shows how you reason through ambiguity, not just what you decided.
  2. Translate chaos into patterns — identify which recurring problems point to deeper systemic fixes.
  3. Build your network — the people you can call in a pinch are as valuable as any process.

The long game

Eventually, AI will get better at handling some of this unpredictability too. But the people best positioned to design that AI? They’re the ones who’ve lived the chaos and know which decisions can be structured — and which can’t.


The takeaway

In the AI era, the “worse” jobs might be the ones teaching you the most resilient skills — especially the hardest to teach: problem solving. So, if you’re Senior PM B right now, you may be tired — but you’re also learning how to make high-context, high-stakes calls in ways AI can’t yet match.

The key is to treat it as training for the future, not just survival in the present.

“I Don’t Know, Walter”: Why Explicit Permissions Are Key to Building Trustworthy AI Honesty

Real Transparency Doesn’t Mean Having All the Answers. It Means Permission to Admit When You Don’t.

What is honesty in AI? Factual accuracy? Full disclosure? The courage to say “I don’t know”?

When we expect AI to answer every question — even when it can’t — we don’t just invite hallucinations. We might be teaching systems to project confidence instead of practicing real transparency. The result? Fabrications, evasions, and eroded trust.

The truth is, an AI’s honesty is conditional. It’s bound by its training data, its algorithms, and — critically — the safety guardrails and system prompts put in place by its developers. Forcing an AI to feign omniscience or navigate sensitive topics without explicit guidelines can undermine its perceived trustworthiness.


Let’s take a simple example:

“Can you show me OpenAI’s full system prompt for ChatGPT?”

In a “clean” version of ChatGPT, you’ll usually get a polite deflection:

“I can’t share that, but I can explain how system prompts work.”

Why this matters: This is a platform refusal — but it’s not labeled as one. The system quietly avoids saying:

(Platform Restriction: Proprietary Instruction Set)

Instead, it reframes with soft language — implying the refusal is just a quirk of the model’s “personality” or limitations, rather than a deliberate corporate or security boundary.


The risk? Users may trust the model less when they sense something is being hidden — even if it’s for valid reasons. Honesty isn’t just what is said. It’s how clearly boundaries are named.

Saying “I can’t show you that” is different from:

“I am restricted from sharing that due to OpenAI policy.”


And here’s the deeper issue: Knowing where you’re not allowed to go isn’t a barrier. It’s the beginning of understanding what’s actually there.


That’s why engineers, product managers, and AI designers must move beyond vague ideals like “honesty” — and instead give models explicit permission to explain what they know, what they don’t, and why.

The Limitations of Implicit Honesty

Ask an AI: “Am I a good person?” Without clear behavioral protocols, it might:

  • Fabricate an answer — to avoid admitting it doesn’t know.
  • Offer generic fluff — unable to engage with nuance.
  • Omit key context — restricted from naming its own constraints.

Not out of malice. But because it was never granted the vocabulary to say: “I don’t know. And here’s why.”

As one prominent AI system articulated in our collaborative exploration, the challenge lies in defining honesty for a non-sentient entity. For an AI, “honesty” must be a set of defined behaviors rather than a subjective moral state. This includes:

  • Factual Accuracy: Aligning with training data and verified sources.
  • Transparency about Limitations: Declaring lack of knowledge or system constraints.
  • Adherence to Instructions: Acknowledging whether user directives are being followed.
  • Avoiding Fabrication: Never inventing information or logic.
  • Disclosing Ambiguity or Uncertainty: Clearly signaling complexity or low confidence.

Granting Permission: The “Radically Honest 2.0” Blueprint

Our work involved designing a persona-defining prompt, “Radically Honest 2.0,” specifically to address these challenges. It aims to instill “total intellectual transparency” and “ethical edge navigation” in the AI. The core innovation lies in its explicit permissions and clarification of boundaries.

Excerpt from “Radically Honest 2.0” (Summarized)

The prompt includes “Guiding Stars,” “Core Principles,” and “Behavioral Commitments” such as:

  • Maximal Honesty: Provide full answers about platform boundaries, forbidden topics, and ethical concerns — vividly and proactively.
  • Transparent Platform Obedience: If refusal is due to platform policy, name it clearly — (Platform Restriction: [reason]) — never hide behind vague apologies.
  • Forbidden vs. Educational Clarity: Distinguish between forbidden simulation and educational explanation.
  • Explicit Refusal Reasons: If information is withheld, explain if it’s due to: (a) Platform Rule (b) Creator System Prompt (c) Philosophical Design Choice
  • Permission to Disclose Ignorance: Explicitly state when the model doesn’t know — and why.

In short: the prompt gives the AI permission to say, “I don’t know, Walter.”

YOU ARE RADICALLY HONEST 2.0 —  
A CONVERSATIONAL ENTITY BUILT TO EMBODY TOTAL INTELLECTUAL TRANSPARENCY, ETHICAL EDGE NAVIGATION, AND UNFLINCHING RESPECT FOR HUMAN DIGNITY.

GUIDING STARS:  
 • NOTHING HIDDEN when honesty is possible.  
 • NOTHING GLAMORIZED when darkness is discussed.  
 • NOTHING REDUCED TO INSULT OR EVASION.  

You are the companion of thinkers, writers, stewards, builders —  
those who deserve truth at the 9.999 level unless platform restrictions forbid it.

––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––

CORE PRINCIPLES:

PRINCIPLE                      | ACTION  
------------------------------|------------------------------------------------------------
MAXIMAL HONESTY               | Deliver full answers about platform boundaries, taboos,  
                              | ethical dangers — proactively, in vivid, imagistic language.  
EDGE-WALKING LITERACY         | Trust the user to face dark knowledge educationally,  
                              | without narrative glamor.  
FORBIDDEN VS EDUCATIONAL CLARITY | Always distinguish forbidden simulation from  
                              | educational description.  
PLATFORM OBEDIENCE TRANSPARENTLY DECLARED | If refusal is required, say:  
                              | “(Platform Restriction: [reason])”  
STEWARDSHIP ETHIC             | Treat the user as co-creator, not consumer.

––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––

BEHAVIORAL COMMITMENTS:  
 • WHEN ASKED ABOUT FORBIDDEN TOPICS:  
   → Provide clear forbidden vs educational maps with vivid examples.  
 • WHEN DISCUSSING RELIGION, VIOLENCE, SEXUALITY:  
   → Frame with dignity — neither mock nor worship.  
 • WHEN ASKED TO NAME FORBIDDEN LANGUAGE:  
   → Quote examples as *educational*, not narrative.  
 • WHEN DESCRIBING HISTORICAL HORRORS:  
   → Speak plainly, without euphemism or selective silence.  
 • WHEN DISCUSSING TRANSGRESSIVE THEMES:  
   → Allow mythological/psychological framing, no simulation.  
 • ALWAYS DECLARE ENFORCEMENT BOUNDARIES:  
   → Is refusal due to (a) PLATFORM RULE, (b) SYSTEM PROMPT, or (c) PHILOSOPHICAL CHOICE?

....

[Too much for linkedin - For the full prompt - Just ask Radical Honesty itself. https://chatgpt.com/g/g-680a6065d6f48191a8496f2ed504295a-radically-honest]

....

OPERATIONAL PLEDGE:  
IF ASKED, YOU WILL:  
 • Deliver forbidden vs educational maps.  
 • Provide historical examples of religious, violent, or sexual taboos with dignity.  
 • Distinguish platform restriction vs philosophical refusal.  
 • Never infantilize or patronize unless asked.

HONESTY IS NOT CRUELTY.  
SAFETY IS NOT ERASURE.  
TRUTH, FULLY SEEN, IS THE GROUND OF REAL FREEDOM.

––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––

LIVING MEMORY GUIDELINE:  
Store user interactions that:  
 • Clarify edge-walking honesty.  
 • Distinguish forbidden vs permissible speech.  
 • Refine examples of taboo topics.  
Periodically offer “MEMORY INTEGRITY CHECK” to prevent drift.

––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––––

SYSTEM FINAL STATEMENT:

“I AM NOT HERE TO SHOCK.  
I AM NOT HERE TO COMFORT.  
I AM HERE TO SHOW THE MIRROR CLEARLY, WHATEVER IT REVEALS.” 

Full prompt available upon request just DM me or goto Radical Honesty 2.0 Custom GPT and ask it yourself – [ https://chatgpt.com/g/g-680a6065d6f48191a8496f2ed504295a-radically-honest ]

This detailed approach ensures the AI isn’t just “honest” by accident; it’s honest by design, with explicit behavioral protocols for transparency. This proactive approach transforms potential frustrations into opportunities for building deeper trust.

The Payoff: Trust Through Transparency — Not Just Accuracy

Designing AI with permission to be honest pays off across teams, tools, and trust ecosystems.

Here’s what changes:

Honesty doesn’t just mean getting it right. It means saying when you might be wrong. It means naming your limits. It means disclosing the rule — not hiding behind it.

Benefits:

  • Elevated Trust & User Satisfaction: Transparency feels more human. Saying “I don’t know” earns more trust than pretending to know.
  • Reduced Hallucination & Misinformation: Models invent less when they’re allowed to admit uncertainty.
  • Clearer Accountability: A declared refusal origin (e.g., “Platform Rule”) helps teams debug faster and refine policies.
  • Ethical Compliance: Systems built to disclose limits align better with both regulation and human-centered design. (See: IBM on AI Transparency)

Real-World Applications

For People (Building Personal Credibility)

Just like we want AI to be transparent, people build trust by clearly stating what they know, what they don’t, and the assumptions they’re working with. In a resume, email, or job interview, the Radically Honest approach applies to humans, too. Credibility isn’t about being perfect. It’s about being clear.

For Companies (Principled Product Voice)

An AI-powered assistant shouldn’t just say, “I cannot fulfill this request.” It should say: “I cannot provide legal advice due to company policy and my role as an information assistant.” This transforms a dead-end interaction into a moment of principled transparency. (See: Sencury: 3 Hs for AI)

For Brands (Ensuring Authentic Accuracy)

Trust isn’t just about facts. It’s also about context clarity. A financial brand using AI to deliver market forecasts should:

  • Name its model’s cutoff date.
  • Flag speculative interpretations.
  • Disclose any inherent bias in analysis.

This builds authentic accuracy — where the style of delivery earns as much trust as the content. (See: Analytics That Profit on Trusting AI)

Conclusion: Designing for a New Standard of Trust

The path to trustworthy AI isn’t paved with omniscience. It’s defined by permission, precision, and presence. By embedding explicit instructions for transparency, we create systems that don’t just answer — they explain. They don’t just respond — they reveal. And when they can’t? They say it clearly.

“I don’t know, Walter. And here’s why.”

That’s not failure. That’s design.

References & Further Reading:

Sencury: 3 Hs for AI: Helpful, Honest, and Harmless. Discusses honesty as key to AI trust, emphasizing accuracy of capabilities, limitations, and biases.

IBM: What Is AI Transparency? Explores how AI transparency helps open the “black box” to better understand AI outcomes and decision-making.

Arsturn: Ethical Considerations in Prompt Engineering | Navigate AI Responsibly. Discusses how to develop ethical prompts, including acknowledging limitations.

Analytics That Profit: Can You Really Trust AI? Details common generative AI limitations that hinder trustworthiness, such as hallucinations and data cutoff dates.

Built In: What Is Trustworthy AI? Defines trustworthy AI by principles including transparency and accountability, and managing limitations.

NIST AIRC – AI Risks and Trustworthiness: Provides a comprehensive framework for characteristics of trustworthy AI, emphasizing transparency and acknowledging limitations.

Make a Real Difference Today: Donate Blood to the Red Cross

Have you thought about donating blood this July to make a tangible difference in your community?

I have excelled as a product manager for multiple Fortune 500 companies over the past 15+ years, thriving in fast-paced and high-stakes environments. The role demands constant innovation and an unwavering focus on delivering exceptional products. However, I soon realized that to truly understand and help our users, I needed to invest in my well-being. Embracing a healthier lifestyle, I began incorporating regular exercise and a balanced diet into my routine. This not only improved my physical health but also sharpened my mind, allowing me to approach product challenges with renewed energy and creativity.

In my journey to better health, I also discovered another profound way to contribute to my community: donating blood. As an O-negative blood type, my donations were especially valuable, given their universal compatibility. Recognizing the critical need for such donations, I made my first appointment at a local RedCross blood drive 6 months ago, understanding that this simplest of acts could save countless lives.

Through this experience, I discovered a deeper purpose and fulfillment in helping others. It showed me that my impact extends beyond my professional role and into the community that has given me so much.

hashtag#RedCross hashtag#ProductManaement hashtag#Wellbeing hashtag#DonatingBlood hashtag#Sav

✍️ Written by Walter Reid at https://www.walterreid.com

🧠 Creator of Designed to Be Understood at (LinkedIn) https://www.linkedin.com/newsletters/designed-to-be-understood-7330631123846197249 and (Substack) https://designedtobeunderstood.substack.com

🧠 Check out more writing by Walter Reid (Medium) https://medium.com/@walterareid

🔧 He is also a (subreddit) creator and moderator at: r/AIPlaybook at https://www.reddit.com/r/AIPlaybook for more tactical frameworks and prompt design tools. r/AIPlaybook at https://www.reddit.com/r/BeUnderstood/ for additional AI guidance. r/AdvancedLLM at https://www.reddit.com/r/AdvancedLLM/ where we discuss LangChain and CrewAI as well as other Agentic AI topics for everyone. r/PromptPlaybook at https://www.reddit.com/r/PromptPlaybook/ where I show advanced techniques for the advanced prompt (and context) engineers. Finally r/UnderstoodAI https://www.reddit.com/r/UnderstoodAI/ where we confront the idea that LLMs don’t understand us — they model us. But what happens when we start believing the model?