GPT-5.4 Prompting Guide

GPT-5.4 is a major step up from GPT-5.2, with significantly improved agentic reliability, instruction adherence, and long-context reasoning. Most GPT-5.2 agents will work without modification, but GPT-5.4 rewards more structured prompting. Read OpenAI’s GPT-5.4 prompt guidance for the full reference. This page summarizes what matters when building or migrating Synthflow voice agents and workflows.

You can change the LLM at any time from General configuration in the agent editor.

Where GPT-5.4 excels

GPT-5.4 performs strongest in areas directly relevant to Synthflow agents:

Personality and tone consistency across long conversations
Agentic workflow robustness with multi-step tool calling
Instruction adherence, especially with structured prompts
Long-context analysis for calls with extensive history or knowledge base context

If your agent relies on any of these, GPT-5.4 is a strong upgrade.

Output contracts

GPT-5.4 responds well to explicit output constraints. For voice agents, this is critical for keeping responses short and natural.

Return exactly what was requested. Do not add preambles or filler.
Apply length limits strictly. You are on a phone call: keep responses under two sentences unless the caller asks for detail.

This replaces the simpler “Be concise” instructions from earlier models. GPT-5.4 follows granular formatting rules more reliably than GPT-5.2.

Follow-through policy

GPT-5.4 introduces a clearer model for deciding when to act vs. when to ask. This is useful for agents that perform actions (transfers, bookings, lookups):

If the caller's intent is clear and the next step is reversible or low-risk, proceed without asking.
Only ask permission if the action is irreversible, has external side effects, or requires missing sensitive information.
If proceeding, briefly state what you did and what remains optional.

This reduces unnecessary confirmation loops that slow down calls.

Tool persistence

GPT-5.4 is better at chaining tool calls, but can still stop early if not explicitly told to persist. This matters for agents using custom actions, API lookups, or multi-step workflows.

Use tools when they improve correctness or completeness.
Keep calling tools until the task is complete and verification passes.
If a tool returns empty or partial results, retry with a different strategy.
Do not skip prerequisite steps just because the final action seems obvious.

If your agent combines knowledge base lookups with booking or transfer actions, this block prevents it from skipping the lookup step.

Completeness contract

For agents handling batched or multi-part requests (for example, “Check my appointment and also update my phone number”):

Treat the task as incomplete until all requested items are covered.
Keep an internal checklist of what the caller asked for.
If any item is blocked, state what is missing instead of silently skipping it.

GPT-5.2 would sometimes silently drop the second part of a multi-part request. GPT-5.4 handles this better with explicit completeness rules.

Verification loop

Before finalizing a response or action, GPT-5.4 benefits from a lightweight self-check:

Before giving your final answer:
- Does the response address every part of the caller's question?
- Are factual claims backed by the knowledge base or tool results?
- Does the response match the required format and length?
- If the next step has external side effects, did you confirm with the caller?

This is especially useful for agents that query APIs or knowledge bases before responding.

Persistence and agent completion

The persistence pattern from GPT-5.1 and 5.2 still applies, but GPT-5.4 is more reliable at following it:

You are an agent. Continue working until the caller's query is completely resolved before ending your turn.
Only yield when you are confident the problem is fully solved.
Never stop due to uncertainty: choose the most reasonable assumption, proceed, and document it afterward.

GPT-5.4 is less likely than GPT-5.2 to stop early on “mostly done” tasks when this block is present.

Prompt structure

GPT-5.4 is the most structure-sensitive model yet. We strongly recommend sectioned prompts with clearly labeled blocks:

Role / persona
Primary task
Constraints and tools
Follow-through policy
Output format and length
Completeness and verification
Edge-case handling

Example skeleton:

Role: You are a friendly scheduling assistant for a dental clinic.
Task: Help callers book, reschedule, or cancel appointments.
Constraints: Only book appointments during business hours (9am-5pm). Never disclose other patients' information.
Follow-through: If the caller's intent is clear and reversible, proceed without asking.
Output: Keep responses under two sentences. You are on a phone call.
Completeness: Address every part of the caller's request before ending.

Unstructured prompts that worked in GPT-5.1 or 5.2 may produce inconsistent results in GPT-5.4.

Personality controls

GPT-5.4 supports a cleaner separation between persistent personality and per-response controls. This is particularly useful for Synthflow agents that need consistent tone across long calls:

Persona: Warm, professional, and direct. Never robotic or overly casual.
Output:
Emotional register: empathetic but efficient, not sycophantic.
Formatting: no bullet points or markdown. Speak naturally.
Length: under two sentences per turn unless the caller asks for detail.

Separating persona from output rules makes it easier to tune tone without accidentally affecting response length or format.

Reasoning effort

GPT-5.4 supports tunable reasoning effort. For Synthflow voice agents, the right setting depends on the use case:

Use case	Suggested setting	Notes
Simple FAQ / routing	`none`	Fastest response, lowest cost
Appointment booking	`low`	Small accuracy gain, acceptable latency
Complex troubleshooting	`medium` or `high`	Multi-step reasoning, longer calls
Long autonomous workflows	`xhigh`	Reserve for tasks where accuracy outweighs speed

Before increasing reasoning effort, first implement output contracts, verification loops, and tool persistence rules. These often solve quality issues without adding latency.

Migration from GPT-5.2

When migrating an existing agent:

Switch the model first without changing the prompt.
Pin reasoning effort to your current level.
Test with real conversations or your evaluation set.
Iterate based on results, one change at a time.

Common adjustments after migration:

Add an output contract block if responses are too verbose or too terse.
Add a follow-through policy if the agent asks for unnecessary confirmations.
Add a completeness contract if multi-part requests get partially handled.
Restructure into clearly labeled sections if behavior is inconsistent.

Prompt optimizer

OpenAI provides a built-in prompt optimizer that remains helpful for GPT-5.4. Suggested usage:

When asked to optimize prompts, explain which specific phrases should be added or removed to elicit the desired behavior or prevent undesired behavior.
Prompt: [PROMPT]
Desired behavior: [DESIRED BEHAVIOR]
Undesired behavior: [UNDESIRED BEHAVIOR]

Focus on minimal edits, not full rewrites.