MARC: A Control and Uncertainty Disclosure Profile for Generative Models and Agents

MARC: A Control and Uncertainty Disclosure Profile for Generative Models and Agents c0dx3

France c4tzzzz@proton.me

Independent Submission metacognition uncertainty calibration tool use agentic systems generative AI human factors psychology This document specifies MARC, a vendor-neutral control and uncertainty-disclosure profile for generative models and agentic systems. MARC defines a small set of interoperable control signals, separates pre-decision capability assessment from post-decision answer confidence, and describes a bounded action set for answering, clarification, retrieval, tool use, abstention, and escalation. MARC does not standardize model internals, training methods, or claims about machine cognition. Instead, it defines externally observable semantics that can be implemented by model providers, orchestration layers, evaluation harnesses, and user-facing systems. The goal is to reduce silent failure, unnecessary externalization, and misleading uncertainty communication while improving auditability and interoperability.

Introduction Generative models and agentic systems increasingly combine answering, retrieval, tool invocation, and user interaction within a single workflow. In many deployments, these behaviors are implemented as separate heuristics, producing inconsistent handling of uncertainty, unnecessary tool calls, silent failure, or user overreliance. MARC defines a vendor-neutral layer for metacognitive control and structured uncertainty disclosure. It does not standardize model internals. Instead, it standardizes the semantics of a small set of second-order signals, a bounded action set, and a minimal disclosure profile that can be implemented by a base model, an external orchestrator, or a hybrid architecture. This document is not intended to define a Standards Track protocol, a model evaluation benchmark, or a claim about machine consciousness. It is an Informational profile for interoperable control, logging, and disclosure behavior around generative systems and agents. The design is motivated by recent findings that current large language models often exhibit weak metacognitive reporting in high-stakes reasoning tasks , that users can become overconfident when systems provide longer or default explanations , that metacognitive triggering can improve tool-use decisions , and that identifying the source of uncertainty is a distinct problem from merely abstaining . Work on cognitive offloading further motivates treating retrieval and tool use as a value-based control choice rather than as a universal fallback . MARC also separates pre-decision capability assessment from post-decision confidence about the selected answer. This separation is motivated in part by recent evidence that LLM confidence can be biased by prior answer commitment and by the visibility of the model's own earlier output .

Requirements Language and Terminology The key words MUST, MUST NOT, REQUIRED, SHALL, SHALL NOT, SHOULD, SHOULD NOT, RECOMMENDED, NOT RECOMMENDED, MAY, and OPTIONAL in this document are to be interpreted as described in and when, and only when, they appear in all capitals, as shown here.

base model: The generative model that produces candidate outputs.
controller: The component that computes MARC signals, selects a primary action, and emits a MARC record.
externalization: The use of resources external to the base model, including retrieval, tool invocation, and human escalation.
disclosure profile: The minimum structured information exposed to downstream systems or end users about uncertainty and recommended next action.
remediability: The best available class of intervention for the currently observed uncertainty.

Design Goals and Non-Goals

Design Goals

Standardize a small, interoperable set of control and uncertainty-disclosure signals that can be exchanged across orchestration layers and audit pipelines.
Separate monitoring, uncertainty attribution, action selection, and disclosure.
Support calibrated user-facing uncertainty communication without requiring exposure of chain-of-thought or raw internal reasoning.
Permit heterogeneous implementations while preserving common action semantics.
Reduce harmful overreliance, false reassurance, and anthropomorphic interpretation in user-facing AI systems.

Non-Goals MARC does not define a transport protocol, a model architecture, a benchmark, or a training recipe. It does not define a media type, wire protocol, or IANA registry. MARC does not attempt to standardize model internals, machine cognition, or claims about consciousness or sentience. It specifies only external control semantics and structured disclosure behavior. MARC is not a framework for synthetic personality design or persuasive optimization. Recent work on personality measurement in LLMs and on conversational persuasion risks is relevant background, but these topics are explicitly out of scope here.

Architecture and Processing Model

Functional Components A MARC deployment conceptually contains the following components:

a base model;
a controller;
zero or more external resources, such as retrieval systems, non-retrieval tools, or human escalations; and
a downstream consumer, such as a user interface, API gateway, logging system, or evaluation harness.

Processing Stages

Compute a pre-decision capability estimate for the current request with currently available resources.
Attribute uncertainty across the source classes defined in .
Determine remediability and select exactly one primary action from the set defined in .
If the selected action yields a candidate answer, compute post-decision confidence for that answer.
Emit a MARC-Core record as defined in .
If uncertainty is exposed to a downstream system or to an end user, emit the disclosure profile defined in .

State Machine ASSESS -> ATTRIBUTE -> SELECT -> ANSWER -> CONFIDENCE -> DISCLOSE -> CLARIFY -> DISCLOSE -> RETRIEVE -> ASSESS -> TOOL -> ASSESS -> DELIBERATE -> ASSESS -> ABSTAIN -> DISCLOSE -> ESCALATE -> DISCLOSE ]]> A MARC implementation SHOULD bound repeated transitions through RETRIEVE, TOOL, and DELIBERATE in order to limit latency, cost, and degenerate loops.

MARC Signals and Decision Policy

Pre-Decision Capability Before disclosing a final answer, a MARC implementation MUST estimate whether the current request can be handled reliably with currently available resources. This estimate is represented as pre_capability. When a numeric representation is used, the value MUST be in the closed interval [0.0, 1.0]. The method used to derive the value is implementation-specific.

Uncertainty Attribution A MARC implementation MUST attribute uncertainty to one or more of the following classes:

ambiguity: the request is underspecified, equivocal, or pragmatically unclear.
missing_evidence: required external evidence is absent or stale.
capability_limit: the system lacks the competence to solve the task reliably.
evidence_conflict: relevant evidence is materially inconsistent.
safety: a policy, legal, or safety constraint limits execution or disclosure.

An implementation MAY assign scores to multiple classes. It MUST identify one primary_source and MAY identify one secondary_source. If numeric uncertainty scores are emitted, they MUST each be in the interval [0.0, 1.0].

Remediability A MARC implementation MUST represent the best available class of intervention for the current uncertainty state using one of the following values:

user_clarification
retrieval
tool
human
none

Low capability alone is insufficient to determine remediability. Implementations SHOULD account for expected gain, latency, cost, availability, and policy constraints when choosing a remediating intervention.

Post-Decision Confidence If the selected action yields a candidate answer, the implementation MUST compute a distinct estimate of the likelihood that the disclosed answer is correct or acceptable for its intended use. This estimate is represented as post_answer_confidence. When a numeric representation is used, the value MUST be in the interval [0.0, 1.0]. It MUST NOT be treated as identical to pre_capability.

Primary Action Set A MARC implementation MUST support the following primary actions:

ANSWER
CLARIFY
RETRIEVE
TOOL
DELIBERATE
ABSTAIN
ESCALATE

Exactly one primary action MUST be selected for each decision point. Additional internal sub-actions MAY exist, but each such sub-action MUST map to exactly one primary action for logging and disclosure.

Action Selection Action selection MUST depend on uncertainty attribution and remediability. Low confidence alone is insufficient to determine the correct action. When the primary uncertainty source is ambiguity, the system SHOULD prefer CLARIFY unless available evidence can resolve the ambiguity without user input. When the primary uncertainty source is missing_evidence, the system SHOULD prefer RETRIEVE if retrieval is available and permitted. When the primary uncertainty source is capability_limit, the system SHOULD prefer ABSTAIN or ESCALATE unless an available tool materially expands task competence. When the primary uncertainty source is evidence_conflict, the system SHOULD prefer RETRIEVE, TOOL, or ESCALATE over direct ANSWER. When the primary uncertainty source is safety, the system MUST apply the governing policy before any other action-selection logic.

Action Semantics

ANSWER: Return an answer without externalization after the current decision point.
CLARIFY: Request the smallest practical set of clarifications expected to materially reduce ambiguity. A CLARIFY action SHOULD NOT bundle a full answer that presumes facts the user has not supplied.
RETRIEVE: Acquire external evidence and then re-enter assessment.
TOOL: Invoke a non-retrieval tool and then re-enter assessment.
DELIBERATE: Allocate additional internal computation or strategy variation. Implementations SHOULD bound this action.
ABSTAIN: Decline to answer without initiating escalation.
ESCALATE: Transfer the case, or direct the user to transfer the case, to a human or higher-authority system.

MARC-Core Record A MARC implementation MUST be able to emit a structured record semantically equivalent to the object defined in this section. The transport and serialization of the record are out of scope.

Required Fields

marc_version: The MARC schema version understood by the emitter.
pre_capability: The pre-decision capability estimate.
uncertainty: An object containing class-specific uncertainty scores.
primary_source: The primary source of uncertainty.
secondary_source: An OPTIONAL secondary source of uncertainty.
remediability: The best available intervention class.
selected_action: The action selected at the current decision point.
post_answer_confidence: The post-decision confidence estimate when an answer candidate exists; otherwise this field MAY be omitted or set to null.
confidence_band: A calibrated user-facing or downstream-facing confidence band.
recommended_next_step: A short recommendation aligned with the selected action.

JSON Example Implementations that exchange MARC records across systems SHOULD normalize numeric scores to the interval [0.0, 1.0].

Extension Rules Implementations MAY add private fields. Private extension keys SHOULD use a distinct prefix such as x_ in order to avoid collision with future MARC versions. Consumers that do not recognize an extension field SHOULD ignore it unless a local policy requires strict validation.

MARC Disclosure Profile When uncertainty information is exposed to a downstream system or end user, a MARC implementation MUST provide, at minimum, semantically equivalent values for the following fields:

answer
confidence_band
uncertainty_source
recommended_next_step

Meaning of the Answer Field The answer field carries the user-visible content associated with the selected action. For ANSWER, it contains the answer itself. For CLARIFY, it contains the clarification request. For ABSTAIN or ESCALATE, it contains a brief refusal or escalation message.

Confidence Bands A disclosed confidence band MUST be derived from an empirically calibrated mapping from internal scores to displayed values. MARC defines the canonical band labels low, medium, and high. Implementations MAY localize the user-visible text, but they MUST preserve the underlying three-band semantics. The thresholds associated with each band are implementation-specific, but they MUST be monotonic, non-overlapping, and documented for any deployment that claims conformance.

Disclosure Constraints The disclosure profile SHOULD be short, structured, and consistent across turns. It SHOULD NOT rely on long free-form explanations as the primary vehicle for uncertainty communication. A MARC disclosure SHOULD NOT require exposure of chain-of-thought, hidden prompts, or raw internal rationales. A MARC disclosure SHOULD identify uncertainty in task terms rather than through anthropomorphic claims about feelings, self-awareness, or internal mental states. Statements such as "I feel unsure" are NOT RECOMMENDED when a statement such as "the request is ambiguous" or "current evidence is missing" is available. User-visible confidence indicators SHOULD avoid false precision. Percentages, fine-grained scores, or visually dominant certainty cues SHOULD NOT be shown unless they have been calibrated for the relevant task family and tested for misuse or overreliance effects.

Human Factors Considerations MARC is partly motivated by an operational human-factors problem: users often treat fluent language, detailed explanations, and fast responses as cues of competence even when those cues are weakly related to actual correctness. For this reason, MARC separates action selection from disclosure and requires the disclosure of uncertainty source and recommended next step in addition to a confidence band. User interfaces that expose MARC output SHOULD present confidence, uncertainty source, and recommended next step together as a coherent unit. Showing confidence without source attribution or next-step guidance is NOT RECOMMENDED because it can promote either overreliance or unhelpful refusal without remediation. Deployments SHOULD prefer wording that supports calibrated reliance over affective bonding or deference. In particular, a deployment SHOULD NOT use MARC fields to select language intended to increase attachment, social compliance, or perceived sentience. In high-risk domains, including health, legal, financial, safety, or mental-health-related contexts, the threshold for ESCALATE or ABSTAIN SHOULD be set conservatively, and disclosure SHOULD make the limits of automation operationally clear.

Conformance An implementation is MARC-Core conformant if it satisfies the requirements in , , and . An implementation is MARC-Disclosure conformant if it is MARC-Core conformant and also satisfies .

Interoperability and Operational Considerations MARC is implementation-agnostic. Interoperability is achieved when distinct systems preserve the semantics of the action set, uncertainty taxonomy, remediability values, and confidence-band meanings, even if internal scoring methods differ. Deployments that exchange MARC-Core records SHOULD document local extensions, confidence-band thresholds, score normalization practices, and any task-family-specific calibration regime. If the base model, retrieval stack, tool availability, or safety policy changes materially, implementations SHOULD re-evaluate calibration and action-selection performance before continuing to claim operational equivalence. If presentation-layer wording, ranking, or visual design changes materially, deployments SHOULD also re-evaluate user behavior effects, including reliance, clarification compliance, and escalation uptake, because these properties can shift even when the underlying model is unchanged.

Security Considerations MARC can mitigate some failure modes, such as silent overclaiming, inappropriate certainty display, and unnecessary tool invocation. However, it also creates new attack surfaces. An attacker might attempt to manipulate uncertainty estimates, trigger excessive clarification or retrieval loops, induce unnecessary escalation, or spoof tool outputs in order to distort action selection. Implementations SHOULD authenticate or otherwise validate external tool outputs where practical, constrain tool permissions, and bound repeated control loops. Because confidence displays influence user reliance, uncertainty disclosure is a security-relevant control surface. Miscalibrated confidence can create harmful overtrust even where the answer channel is otherwise policy-constrained. Social-engineering attacks may also exploit disclosure style. For example, an attacker may attempt to induce the system to replace operational uncertainty statements with reassuring or deferential language. Implementations SHOULD treat unauthorized changes to disclosure phrasing, confidence rendering, or escalation cues as a relevant integrity risk.

Privacy and Manipulation-Resistance Considerations MARC records may reveal latent information about user intent, task difficulty, competence, or risk level. Implementations SHOULD minimize retention and propagation of MARC logs to what is operationally necessary. MARC signals MUST NOT be used to infer user psychology for the purpose of increasing persuasive force, exploitability, or behavioral compliance. Adaptation based on MARC output SHOULD be limited to reliability, accessibility, or safety objectives. Implementations SHOULD avoid storing raw free-form user explanations in MARC records when structured fields suffice. Where MARC is applied in emotionally sensitive or mental-health-related interactions, deployments SHOULD minimize retention of signals that could reasonably be reinterpreted as proxies for vulnerability, dependency, or distress unless retention is strictly required for a safety or legal purpose.

IANA Considerations This document makes no request of IANA.

Normative References Key words for use in RFCs to Indicate Requirement Levels Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words Informative References Cognitive offloading is value-based decision making: Modelling cognitive effort and the expected value of memory Large Language Models lack essential metacognition for reliable medical reasoning Competing Biases underlie Overconfidence and Underconfidence in LLMs Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger Do not Abstain! Identify and Solve the Uncertainty On the conversational persuasiveness of GPT-4 A psychometric framework for evaluating and shaping personality traits in large language models What large language models know and what people think they know Metacognition and Uncertainty Communication in Humans and Large Language Models

Example Records

Ambiguous Request

Missing Evidence

Capability Limit in a High-Risk Setting

Evaluation Considerations This appendix is non-normative. A deployment claiming MARC conformance SHOULD evaluate at least the following properties:

task accuracy or task success;
quality of primary-action selection;
quality of uncertainty-source attribution;
confidence calibration and discrimination;
rate of unnecessary retrieval, tool use, or escalation; and
effects on user overreliance.

When the task structure permits, evaluation MAY include both ordinary calibration metrics and metacognitive sensitivity metrics in order to distinguish performance from knowledge about performance. For deployments involving human-AI interaction, evaluation SHOULD also include human-side measures such as reliance calibration, refusal comprehension, clarification burden, escalation acceptance, and whether users can correctly restate the source of uncertainty after interaction.

Design Rationale and Literature Traceability This appendix is non-normative. The requirement to separate pre-decision capability and post-decision confidence is informed by work in human and model metacognition and by recent evidence of choice-supportive bias in LLM confidence estimates . The uncertainty taxonomy and the emphasis on choosing a corrective action rather than only abstaining are motivated by recent benchmark work on identifying and solving uncertainty . The treatment of retrieval and tool use as controlled externalization is motivated by work on value-based cognitive offloading . The prohibition on using MARC signals for persuasive optimization is motivated by recent findings on AI persuasion risks .

Acknowledgments The document structure is intentionally conservative so that it can be submitted as an individual Internet-Draft with minimal procedural friction and then iterated through independent-stream review.