<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Agentic AI Engineering Archives - Zorost Intelligence | AI, Cloud &amp; Data Experts</title>
	<atom:link href="https://zorost.com/category/agentic-ai/feed/" rel="self" type="application/rss+xml" />
	<link>https://zorost.com/category/agentic-ai/</link>
	<description>Production AI systems for aviation, manufacturing, pharma, government, finance, freight, and geopolitical intelligence.</description>
	<lastBuildDate>Wed, 20 May 2026 18:52:40 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://zorost.com/wp-content/uploads/2025/08/ZOROST-Intel-Logo3_512-150x150.png</url>
	<title>Agentic AI Engineering Archives - Zorost Intelligence | AI, Cloud &amp; Data Experts</title>
	<link>https://zorost.com/category/agentic-ai/</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">81719879</site>	<item>
		<title>Hybrid Retrieval: Why Vector Alone Isn&#8217;t Enough</title>
		<link>https://zorost.com/hybrid-retrieval-vector-alone-not-enough/</link>
		
		<dc:creator><![CDATA[Zorost Intelligence]]></dc:creator>
		<pubDate>Tue, 17 Feb 2026 09:00:00 +0000</pubDate>
				<category><![CDATA[Agentic AI Engineering]]></category>
		<category><![CDATA[BM25]]></category>
		<category><![CDATA[Evaluation]]></category>
		<category><![CDATA[Hybrid Retrieval]]></category>
		<category><![CDATA[RAG]]></category>
		<category><![CDATA[Vector Search]]></category>
		<guid isPermaLink="false">https://zorost.com/hybrid-retrieval-vector-alone-not-enough/</guid>

					<description><![CDATA[<p>Vector search is excellent at semantic similarity and bad at named entities. BM25 is the opposite. Production-grade retrieval is hybrid — and the architecture decisions matter.</p>
<p>The post <a href="https://zorost.com/hybrid-retrieval-vector-alone-not-enough/">Hybrid Retrieval: Why Vector Alone Isn&#8217;t Enough</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote>
<p><strong>Pull-quote:</strong> &#8220;Pure vector retrieval is the most common production-grade RAG mistake. Pure BM25 is the second most common.&#8221;</p>
</blockquote>
<h4>Why this matters</h4>
<p>A pattern repeats in every RAG project that goes wrong: someone embeds the corpus, runs vector search, and ships. The system works in demos and disappoints in production. The fix is a structural architecture change: <strong>hybrid retrieval</strong>.</p>
<h4>The components</h4>
<pre><code>Query
  │
  ├──► Dense (vector)   — pgvector / Weaviate / Qdrant + an embedding model
  │
  ├──► Sparse (BM25)    — Postgres FTS / Elasticsearch / OpenSearch
  │
  ├──► Optional filters — date range, source, entity tags
  │
  └──► Merge (RRF or weighted) ──► Cross-encoder re-rank ──► Top-K
                                                                │
                                                                ▼
                                                Citation-grounded generation</code></pre>
<h4>Why each piece matters</h4>
<ul>
<li><strong>Vector</strong> is excellent at <em>semantic similarity</em> — finding documents that are about the same topic in different words. It is bad at <em>named entities</em> — exact terms, IDs, dates.</li>
<li><strong>BM25</strong> is the opposite — excellent at named entities, weaker on semantic similarity.</li>
<li><strong>Filters</strong> — when the question is bounded (&#8220;just look at 2024 reports about Boeing 737&#8221;), filters dramatically reduce the candidate set before ranking.</li>
<li><strong>Merge</strong> — Reciprocal Rank Fusion (RRF) is a clean default. Weighted merges work with calibrated scores.</li>
<li><strong>Cross-encoder re-rank</strong> — sees the query and the candidate document together and scores them jointly. More expensive than bi-encoder vector search, but the precision improvement on the top-K is large enough to pay for itself.</li>
</ul>
<h4>What changes when you do this right</h4>
<ul>
<li>Hallucination rate drops. The model has better evidence to ground in.</li>
<li>Citation precision goes up. The cited documents actually support the claim.</li>
<li>Edge cases (rare entity queries, exact-quote queries) work properly.</li>
<li>Generation latency stays low because the model only sees the top-K (typically 6–10), not the top-100.</li>
</ul>
<h4>Common mistakes</h4>
<ul>
<li><strong>No re-ranker.</strong> Top-50 from vector + top-50 from BM25 with RRF is a starting point, but without a re-ranker the top-K still contains noise.</li>
<li><strong>No filtering.</strong> Filtering before retrieval is essentially free if your data is properly indexed.</li>
<li><strong>Skip evaluation.</strong> Without a golden Q&amp;A dataset and grounding scoring, you have no way to compare retrieval architectures.</li>
</ul>
<h4>Closing</h4>
<p>Pure vector retrieval is the most common production-grade RAG mistake. Hybrid retrieval — vector + sparse + filters + re-rank — is the boring, reliable, production answer. Every Zorost RAG system runs this architecture.</p>
<hr>
<p>The post <a href="https://zorost.com/hybrid-retrieval-vector-alone-not-enough/">Hybrid Retrieval: Why Vector Alone Isn&#8217;t Enough</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24296</post-id>	</item>
		<item>
		<title>The Agent Factory: Planner, Executor, Critic, Referee</title>
		<link>https://zorost.com/agent-factory-planner-executor-critic-referee/</link>
		
		<dc:creator><![CDATA[Zorost Intelligence]]></dc:creator>
		<pubDate>Tue, 23 Dec 2025 09:00:00 +0000</pubDate>
				<category><![CDATA[Agentic AI Engineering]]></category>
		<category><![CDATA[Agentic AI]]></category>
		<category><![CDATA[Evaluation]]></category>
		<category><![CDATA[Governance]]></category>
		<category><![CDATA[LangGraph]]></category>
		<category><![CDATA[Multi-Agent]]></category>
		<guid isPermaLink="false">https://zorost.com/agent-factory-planner-executor-critic-referee/</guid>

					<description><![CDATA[<p>Most production agentic systems converge on the same architecture: a planner, an executor, a critic, and a referee. Here is the pattern, why it works, and how we apply it across industries.</p>
<p>The post <a href="https://zorost.com/agent-factory-planner-executor-critic-referee/">The Agent Factory: Planner, Executor, Critic, Referee</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></description>
										<content:encoded><![CDATA[<blockquote>
<p><strong>Pull-quote:</strong> &#8220;The four-role pattern is not an opinion. It&#8217;s the architecture every production multi-agent system converges on once it survives the first round of real users.&#8221;</p>
</blockquote>
<h4>Why this matters</h4>
<p>Multi-agent AI starts as a clever idea (let agents talk to each other!) and dies in production as an unreliable mess (agents hallucinate to each other, disagreements never resolve, the audit trail is unreadable). The fix is structural: four roles, typed contracts, deterministic logs.</p>
<h4>The four roles</h4>
<ol>
<li><strong>Planner</strong> — decomposes the high-level goal into sub-goals and decides the sequence. Reads the task, the available tools, and the agent&#8217;s memory; emits a structured plan.</li>
<li><strong>Executor(s)</strong> — carries out sub-goals. Calls tools. Returns structured outputs. Knows nothing about the high-level plan; just executes its assigned sub-goal honestly.</li>
<li><strong>Critic</strong> — reviews each executor output adversarially. Looks for unsupported claims, broken citations, missed evidence, alternative interpretations. Does not propose new actions; only critiques.</li>
<li><strong>Referee</strong> — adjudicates when the critic disagrees with the executor. Has explicit criteria. Produces the final decision with explicit reasoning.</li>
</ol>
<h4>Why this works</h4>
<ul>
<li><strong>Planner / executor separation</strong> prevents the planner from drifting into execution and getting confused by tool errors.</li>
<li><strong>Critic separation</strong> prevents the executors from grading their own work, which is a category error.</li>
<li><strong>Referee separation</strong> prevents endless analyst-vs-critic loops.</li>
</ul>
<h4>Common variations</h4>
<ul>
<li><strong>Single executor vs. multi-executor (parallelism).</strong> Parallel executors for independent sub-goals; serial for dependent ones.</li>
<li><strong>Critic per executor or shared critic.</strong> Per-executor for specialized critique; shared for consistency across the run.</li>
<li><strong>Hierarchical planning.</strong> A meta-planner produces a plan that includes &#8220;now plan this sub-task in detail&#8221; steps.</li>
</ul>
<h4>What we standardize</h4>
<p>We standardize three things across every production agentic system:</p>
<ol>
<li><strong>Typed tool contracts</strong> — every tool has explicit input/output schemas. No improvisation.</li>
<li><strong>Deterministic logs</strong> — every call (planner → executor, executor → tool, critic → executor) is logged with timestamps and parameters.</li>
<li><strong>Evaluation harnesses</strong> — every system ships with a golden dataset, a regression suite, hallucination detection, and grounding scoring. New versions are evaluated before promotion.</li>
</ol>
<h4>Where we run this pattern</h4>
<ul>
<li><strong>AeroFarr</strong> — multi-tool aviation analyst (planner / executor / critic over the prediction core, the cascade GNN, the causal engine, and the RAG corpus)</li>
<li><strong>EvidAI</strong> — 4-model consensus screening with explicit critic and referee</li>
<li><strong>FreightCortex</strong> — 16-tool AI freight analyst with planner / executor and a critic on report quality</li>
<li><strong>Aquil</strong> — sourcers / analysts / critic / referee for OSINT</li>
<li><strong>SPCio</strong> (with a manufacturing intelligence partner) — 8 specialized agents with a meta-coordinator</li>
</ul>
<h4>Closing</h4>
<p>The four-role pattern is not an opinion. It is the architecture every production multi-agent system converges on once it survives the first round of real users. Skipping it is a tax you pay later.</p>
<hr>
<p>The post <a href="https://zorost.com/agent-factory-planner-executor-critic-referee/">The Agent Factory: Planner, Executor, Critic, Referee</a> appeared first on <a href="https://zorost.com">Zorost Intelligence | AI, Cloud &amp; Data Experts</a>.</p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24295</post-id>	</item>
	</channel>
</rss>
