<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Agentic AI Archives - Zorost Intelligence | AI, Cloud &amp; Data Experts</title>
	<atom:link href="https://zorost.com/tag/agentic-ai-tag/feed/" rel="self" type="application/rss+xml" />
	<link>https://zorost.com/tag/agentic-ai-tag/</link>
	<description>Production AI systems for aviation, manufacturing, pharma, government, finance, freight, and geopolitical intelligence.</description>
	<lastBuildDate>Wed, 20 May 2026 18:52:40 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://zorost.com/wp-content/uploads/2025/08/ZOROST-Intel-Logo3_512-150x150.png</url>
	<title>Agentic AI Archives - Zorost Intelligence | AI, Cloud &amp; Data Experts</title>
	<link>https://zorost.com/tag/agentic-ai-tag/</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">81719879</site>	<item>
		<title>Air-Gapped Agentic Stacks for Sovereign Environments</title>
		<link>https://zorost.com/air-gapped-agentic-stacks-sovereign/</link>
		
		<dc:creator><![CDATA[Zorost Intelligence]]></dc:creator>
		<pubDate>Tue, 14 Apr 2026 09:00:00 +0000</pubDate>
				<category><![CDATA[Government & Federal]]></category>
		<category><![CDATA[Agentic AI]]></category>
		<category><![CDATA[Air-Gapped]]></category>
		<category><![CDATA[FedRAMP]]></category>
		<category><![CDATA[Local LLM]]></category>
		<category><![CDATA[Ollama]]></category>
		<category><![CDATA[vLLM]]></category>
		<guid isPermaLink="false">https://zorost.com/air-gapped-agentic-stacks-sovereign/</guid>

					<description><![CDATA[<p>Sovereign-AI deployment is now operationally feasible. Here is what an air-gapped agentic stack looks like, what it costs, and where it fits in federal mission environments.</p>
<p>The post <a href="https://zorost.com/air-gapped-agentic-stacks-sovereign/">Air-Gapped Agentic Stacks for Sovereign Environments</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote>
<p><strong>Pull-quote:</strong> &#8220;Sovereign AI is not &#8216;AI minus features.&#8217; It is &#8216;AI plus discipline.'&#8221;</p>
</blockquote>
<h4>Why this matters</h4>
<p>Some federal mission environments cannot accept internet egress. Some cannot accept any data leaving the customer boundary. Some cannot accept models that the customer cannot inspect end-to-end. Cloud-only AI vendors do not serve these environments.</p>
<p>The good news: <strong>air-gapped agentic AI is operationally feasible in 2026</strong>. The bad news: it requires engineering discipline that most vendors don&#8217;t have.</p>
<h4>The reference stack (engineering view)</h4>
<ul>
<li><strong>Local LLM serving.</strong> Open-weights models (Llama 3.x, Qwen 2.5, Mistral, Phi-4, Gemma 3, code-tuned variants) served via Ollama, vLLM, or llama.cpp on customer hardware.</li>
<li><strong>Local embeddings.</strong> Open-source embedding models on the same stack.</li>
<li><strong>Local vector database.</strong> pgvector, Weaviate, or Qdrant on a private subnet.</li>
<li><strong>Local model registry.</strong> MLflow Model Registry running inside the boundary.</li>
<li><strong>Local RAG pipeline.</strong> Ingestion, chunking, embedding, retrieval, re-ranking, generation — all inside the boundary.</li>
<li><strong>Local evaluation harness.</strong> Golden datasets, regression suites, hallucination detection, grounding scoring — version-controlled and runnable inside the boundary.</li>
<li><strong>Local observability.</strong> Grafana, Prometheus, Loki running inside the boundary.</li>
<li><strong>Local update pipeline.</strong> Models, weights, and corpus updates delivered as signed bundles via approved transfer.</li>
</ul>
<h4>The reference stack (governance view)</h4>
<ul>
<li><strong>Documented model selection</strong> — which model, which version, which quantization, why</li>
<li><strong>Documented evaluation</strong> — what the golden dataset is, what it tests, what passing looks like</li>
<li><strong>Documented update procedure</strong> — who signs the update bundle, who imports it, who validates it post-import</li>
<li><strong>Documented retirement</strong> — when and why a model is retired</li>
<li><strong>Audit trail</strong> — every decision the system makes is logged with model version, prompt, output, and grounding evidence</li>
</ul>
<h4>Trade-offs vs. cloud</h4>
<ul>
<li><strong>Latency.</strong> Comparable for the smaller models; better for chained calls (no network round-trip).</li>
<li><strong>Capability.</strong> Behind the absolute frontier of closed-source models. Open-weights models in 2026 are excellent but not at parity with the strongest closed-source options.</li>
<li><strong>Cost.</strong> Higher up-front (hardware), lower over time (no per-token bills).</li>
<li><strong>Update cadence.</strong> Slower because updates must clear the boundary.</li>
<li><strong>Evaluation discipline.</strong> Tighter, because there is no vendor evaluation to lean on.</li>
<li><strong>Sovereignty.</strong> Complete. The customer owns the stack end-to-end.</li>
</ul>
<h4>Where it fits in federal posture</h4>
<p>Air-gapped agentic stacks fit:</p>
<ul>
<li>Classified or otherwise sensitive environments without internet egress</li>
<li>Mission environments where data cannot leave the customer boundary</li>
<li>Programs where the agency requires end-to-end inspection and audit of the AI stack</li>
</ul>
<p>It does not fit:</p>
<ul>
<li>Environments where the very latest closed-source model capability is required and the data sensitivity allows cloud</li>
<li>Environments where rapid model iteration is more important than sovereignty</li>
</ul>
<h4>Closing</h4>
<p>Sovereign agentic AI is real. It requires engineering discipline. We&#8217;ve built it for our manufacturing-quality platform (with a partner) and we apply the same discipline to federal mission environments. The deployment shape is different from cloud. The trade-offs are real. For the customers who need it, no other shape fits.</p>
<hr>
<p>The post <a href="https://zorost.com/air-gapped-agentic-stacks-sovereign/">Air-Gapped Agentic Stacks for Sovereign Environments</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24299</post-id>	</item>
		<item>
		<title>When Agents Call Agents: Why the MCP Server Matters in Freight</title>
		<link>https://zorost.com/mcp-server-freight-agents-call-agents/</link>
		
		<dc:creator><![CDATA[Zorost Intelligence]]></dc:creator>
		<pubDate>Tue, 24 Feb 2026 09:00:00 +0000</pubDate>
				<category><![CDATA[Freight & Logistics]]></category>
		<category><![CDATA[Agentic AI]]></category>
		<category><![CDATA[FreightCortex]]></category>
		<category><![CDATA[MCP]]></category>
		<category><![CDATA[Model Context Protocol]]></category>
		<category><![CDATA[Multi-Agent]]></category>
		<guid isPermaLink="false">https://zorost.com/mcp-server-freight-agents-call-agents/</guid>

					<description><![CDATA[<p>Model Context Protocol lets external AI agents call FreightCortex tools natively. Here is why that matters — and what it unlocks for the freight intelligence stack.</p>
<p>The post <a href="https://zorost.com/mcp-server-freight-agents-call-agents/">When Agents Call Agents: Why the MCP Server Matters in Freight</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote>
<p><strong>Pull-quote:</strong> &#8220;If your platform isn&#8217;t callable by other agents, your platform isn&#8217;t future-proof.&#8221;</p>
</blockquote>
<h4>Why this matters</h4>
<p>The next generation of enterprise software is being shaped by a simple fact: <strong>users have agents now</strong>. Claude Desktop, custom internal agents, vendor-provided agents — they&#8217;re all going to call your platform. Either they call it through your REST API (and the agent has to know your URL structure, your authentication, your error semantics) or they call it through a standard protocol.</p>
<p>That standard is <strong>Model Context Protocol (MCP)</strong>.</p>
<h4>What MCP is</h4>
<p>MCP is an open protocol developed by Anthropic and adopted across the agent ecosystem. It defines how an AI agent describes its tools, how a host (the agent&#8217;s runtime) discovers and calls those tools, and how results are returned. The result is a clean separation: tools are <em>advertised</em>, agents <em>discover and call them</em>, and you can swap tool servers without touching the agent.</p>
<p>For FreightCortex, the MCP server is a thin layer that exposes our 16 tools using the protocol. An external agent — a customer&#8217;s internal Claude Desktop, an OEM&#8217;s analytics chatbot, or a third-party tool — can connect to our MCP endpoint and <em>use FreightCortex like a native tool</em>.</p>
<h4>What this unlocks</h4>
<p>Three things:</p>
<ol>
<li><strong>Native callability from any MCP-compatible agent.</strong> Customers do not need to write custom integrations. Their agent just connects to our MCP server.</li>
<li><strong>Composability with other tools.</strong> A customer agent can use FreightCortex tools alongside their own internal tools. The agent decides when to call which.</li>
<li><strong>Future-proofing.</strong> As the agent ecosystem grows, MCP-compatible platforms are accessible by default. REST-only platforms have to be manually integrated, one customer at a time.</li>
</ol>
<h4>What it requires</h4>
<p>Three engineering investments:</p>
<ol>
<li><strong>Tool contracts</strong> — every tool we want to expose has a typed schema. (We already had this.)</li>
<li><strong>The MCP server itself</strong> — a thin transport layer over those tools.</li>
<li><strong>Authentication and rate limiting</strong> — MCP doesn&#8217;t replace your existing auth; it sits on top of it.</li>
</ol>
<h4>A concrete example</h4>
<p>An analyst is using Claude Desktop on her workstation. She asks &#8220;what&#8217;s driving the cost increase on the Atlanta–Dallas corridor?&#8221; Claude knows about the FreightCortex MCP server (configured once per workstation) and decides to use it. It calls <code>query_corridor_metrics</code>, <code>compute_anomaly_score</code>, <code>query_carrier_metrics</code>, and <code>run_capacity_simulation</code> — and produces an answer with the same structure as the answer it would have given inside the FreightCortex web app, except this time it is in her existing analyst environment.</p>
<p>The customer never had to log in to FreightCortex.</p>
<h4>Closing</h4>
<p>If your platform isn&#8217;t callable by other agents, your platform isn&#8217;t future-proof. MCP is how you make that callable. It is a small engineering investment with very high leverage.</p>
<hr>
<p>The post <a href="https://zorost.com/mcp-server-freight-agents-call-agents/">When Agents Call Agents: Why the MCP Server Matters in Freight</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24291</post-id>	</item>
		<item>
		<title>Building Multi-Agent Workflows on Databricks (mosaic AI Agent Framework)</title>
		<link>https://zorost.com/multi-agent-databricks-mosaic-ai-agent-framework/</link>
		
		<dc:creator><![CDATA[Zorost Intelligence]]></dc:creator>
		<pubDate>Tue, 24 Feb 2026 09:00:00 +0000</pubDate>
				<category><![CDATA[Databricks Modernization]]></category>
		<category><![CDATA[Agent Framework]]></category>
		<category><![CDATA[Agentic AI]]></category>
		<category><![CDATA[MLflow]]></category>
		<category><![CDATA[Mosaic AI]]></category>
		<category><![CDATA[Multi-Agent]]></category>
		<guid isPermaLink="false">https://zorost.com/multi-agent-databricks-mosaic-ai-agent-framework/</guid>

					<description><![CDATA[<p>Multi-agent workflows native to the Lakehouse — designed, built, evaluated, and deployed on the Mosaic AI Agent Framework with typed tools and an evaluation harness.</p>
<p>The post <a href="https://zorost.com/multi-agent-databricks-mosaic-ai-agent-framework/">Building Multi-Agent Workflows on Databricks (mosaic AI Agent Framework)</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote>
<p><strong>Pull-quote:</strong> &#8220;Agents on the Lakehouse mean tools that read and write Delta tables, models that serve under MLflow, and evaluations that ship as Delta tables themselves.&#8221;</p>
</blockquote>
<h4>Why this matters</h4>
<p>Agentic workflows are the next layer on the Lakehouse — agents that reason, plan, call tools, and produce verifiable artifacts. The Mosaic AI Agent Framework provides the runtime. The architectural decisions still belong to you.</p>
<h4>Reference architecture</h4>
<pre><code>┌──────────────────────────────────────────────────────────────────┐
│                    AGENT (LangGraph / LlamaIndex / Custom)        │
│                                                                    │
│   Planner ──► Executor ──► Critic ──► Referee                    │
└─────────────────────┬────────────────────────────────────────────┘
                      │
                      ▼
       ┌──────────────────────────────┐
       │   Typed Tools                 │ ◄── Tool catalog
       │   - read Delta tables         │     (Unity Catalog)
       │   - write Delta tables        │
       │   - call MLflow models        │
       │   - call REST APIs            │
       └──────────────┬───────────────┘
                      │
                      ▼
       ┌──────────────────────────────┐
       │   Mosaic AI Model Serving     │
       │   - foundation models         │
       │   - fine-tuned models         │
       │   - per-agent traffic split   │
       └──────────────┬───────────────┘
                      │
                      ▼
       ┌──────────────────────────────┐
       │   Evaluations as Delta tables │ ◄── Versioned
       │   - golden datasets           │
       │   - regression suite          │
       │   - hallucination detection   │
       └──────────────────────────────┘</code></pre>
<h4>What &#8220;typed tools&#8221; means</h4>
<p>Every tool has a JSON schema for inputs and outputs. The agent cannot call a tool with invalid inputs — the schema rejects the call. This eliminates an entire class of failure that plagues unconstrained agents.</p>
<h4>What &#8220;evaluations as Delta tables&#8221; means</h4>
<p>Evaluation results are stored as rows in versioned Delta tables. Each row is <code>(agent_version, input, expected_output, actual_output, score, metadata)</code>. Regression analysis is a <code>JOIN</code> between two <code>agent_version</code> slices. New versions don&#8217;t promote unless they pass.</p>
<h4>The agent / human contract</h4>
<p>Where humans fit:</p>
<ul>
<li><strong>High-risk operations</strong> require human-in-the-loop checkpoints. Agents can propose; humans approve.</li>
<li><strong>Critic disagreements with the executor</strong> route to humans when the referee cannot adjudicate.</li>
<li><strong>Periodic spot-checks</strong> on agent decisions are scheduled into the evaluation harness.</li>
</ul>
<p>This is not &#8220;manual override.&#8221; This is a designed-in contract about which decisions are agent-final and which are human-final.</p>
<h4>Common architectural decisions</h4>
<table>
<thead>
<tr>
<th>Decision</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>Number of executors</td>
<td>One unless sub-goals are independent</td>
</tr>
<tr>
<td>Critic per executor or shared</td>
<td>Shared unless executors are heterogeneous</td>
</tr>
<tr>
<td>Memory model</td>
<td>Working memory in agent state; long-term memory in Delta table</td>
</tr>
<tr>
<td>Tool call timeout</td>
<td>30 s default, with retries on idempotent tools</td>
</tr>
<tr>
<td>Cost ceiling per session</td>
<td>Configurable; defaults to a hard cap</td>
</tr>
</tbody>
</table>
<h4>Closing</h4>
<p>Multi-agent workflows on Databricks are productive when the framework is paired with discipline: typed tools, deterministic logging, evaluations as Delta tables, and a designed-in agent / human contract. The Mosaic AI Agent Framework is the runtime; the architecture is yours.</p>
<hr>
<p>The post <a href="https://zorost.com/multi-agent-databricks-mosaic-ai-agent-framework/">Building Multi-Agent Workflows on Databricks (mosaic AI Agent Framework)</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24306</post-id>	</item>
		<item>
		<title>The Agent Factory: Planner, Executor, Critic, Referee</title>
		<link>https://zorost.com/agent-factory-planner-executor-critic-referee/</link>
		
		<dc:creator><![CDATA[Zorost Intelligence]]></dc:creator>
		<pubDate>Tue, 23 Dec 2025 09:00:00 +0000</pubDate>
				<category><![CDATA[Agentic AI Engineering]]></category>
		<category><![CDATA[Agentic AI]]></category>
		<category><![CDATA[Evaluation]]></category>
		<category><![CDATA[Governance]]></category>
		<category><![CDATA[LangGraph]]></category>
		<category><![CDATA[Multi-Agent]]></category>
		<guid isPermaLink="false">https://zorost.com/agent-factory-planner-executor-critic-referee/</guid>

					<description><![CDATA[<p>Most production agentic systems converge on the same architecture: a planner, an executor, a critic, and a referee. Here is the pattern, why it works, and how we apply it across industries.</p>
<p>The post <a href="https://zorost.com/agent-factory-planner-executor-critic-referee/">The Agent Factory: Planner, Executor, Critic, Referee</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote>
<p><strong>Pull-quote:</strong> &#8220;The four-role pattern is not an opinion. It&#8217;s the architecture every production multi-agent system converges on once it survives the first round of real users.&#8221;</p>
</blockquote>
<h4>Why this matters</h4>
<p>Multi-agent AI starts as a clever idea (let agents talk to each other!) and dies in production as an unreliable mess (agents hallucinate to each other, disagreements never resolve, the audit trail is unreadable). The fix is structural: four roles, typed contracts, deterministic logs.</p>
<h4>The four roles</h4>
<ol>
<li><strong>Planner</strong> — decomposes the high-level goal into sub-goals and decides the sequence. Reads the task, the available tools, and the agent&#8217;s memory; emits a structured plan.</li>
<li><strong>Executor(s)</strong> — carries out sub-goals. Calls tools. Returns structured outputs. Knows nothing about the high-level plan; just executes its assigned sub-goal honestly.</li>
<li><strong>Critic</strong> — reviews each executor output adversarially. Looks for unsupported claims, broken citations, missed evidence, alternative interpretations. Does not propose new actions; only critiques.</li>
<li><strong>Referee</strong> — adjudicates when the critic disagrees with the executor. Has explicit criteria. Produces the final decision with explicit reasoning.</li>
</ol>
<h4>Why this works</h4>
<ul>
<li><strong>Planner / executor separation</strong> prevents the planner from drifting into execution and getting confused by tool errors.</li>
<li><strong>Critic separation</strong> prevents the executors from grading their own work, which is a category error.</li>
<li><strong>Referee separation</strong> prevents endless analyst-vs-critic loops.</li>
</ul>
<h4>Common variations</h4>
<ul>
<li><strong>Single executor vs. multi-executor (parallelism).</strong> Parallel executors for independent sub-goals; serial for dependent ones.</li>
<li><strong>Critic per executor or shared critic.</strong> Per-executor for specialized critique; shared for consistency across the run.</li>
<li><strong>Hierarchical planning.</strong> A meta-planner produces a plan that includes &#8220;now plan this sub-task in detail&#8221; steps.</li>
</ul>
<h4>What we standardize</h4>
<p>We standardize three things across every production agentic system:</p>
<ol>
<li><strong>Typed tool contracts</strong> — every tool has explicit input/output schemas. No improvisation.</li>
<li><strong>Deterministic logs</strong> — every call (planner → executor, executor → tool, critic → executor) is logged with timestamps and parameters.</li>
<li><strong>Evaluation harnesses</strong> — every system ships with a golden dataset, a regression suite, hallucination detection, and grounding scoring. New versions are evaluated before promotion.</li>
</ol>
<h4>Where we run this pattern</h4>
<ul>
<li><strong>AeroFarr</strong> — multi-tool aviation analyst (planner / executor / critic over the prediction core, the cascade GNN, the causal engine, and the RAG corpus)</li>
<li><strong>EvidAI</strong> — 4-model consensus screening with explicit critic and referee</li>
<li><strong>FreightCortex</strong> — 16-tool AI freight analyst with planner / executor and a critic on report quality</li>
<li><strong>Aquil</strong> — sourcers / analysts / critic / referee for OSINT</li>
<li><strong>SPCio</strong> (with a manufacturing intelligence partner) — 8 specialized agents with a meta-coordinator</li>
</ul>
<h4>Closing</h4>
<p>The four-role pattern is not an opinion. It is the architecture every production multi-agent system converges on once it survives the first round of real users. Skipping it is a tax you pay later.</p>
<hr>
<p>The post <a href="https://zorost.com/agent-factory-planner-executor-critic-referee/">The Agent Factory: Planner, Executor, Critic, Referee</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24295</post-id>	</item>
	</channel>
</rss>
