<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Tool-Use on MayaLucIA</title>
    <link>https://mayalucia.dev/tags/tool-use/</link>
    <description>Recent content in Tool-Use on MayaLucIA</description>
    <generator>Hugo -- 0.156.0</generator>
    <language>en-us</language>
    <lastBuildDate>Sat, 14 Mar 2026 00:00:00 +0100</lastBuildDate>
    <atom:link href="https://mayalucia.dev/tags/tool-use/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Cognitive Diversity in LLM Tool-Use: Behavioural Fingerprints, Convention Adherence, and the Case for Substrate Mixing</title>
      <link>https://mayalucia.dev/papers/cognitive-diversity-survey/</link>
      <pubDate>Sat, 14 Mar 2026 00:00:00 +0100</pubDate>
      <guid>https://mayalucia.dev/papers/cognitive-diversity-survey/</guid>
      <description>&lt;div class=&#34;abstract&#34;&gt;
&lt;p&gt;Large language models deployed as tool-using agents exhibit distinctive
behavioural patterns — &lt;em&gt;cognitive fingerprints&lt;/em&gt; — that emerge from their
training lineage rather than their explicit instructions. We present a
controlled experiment in which thirteen substrates from nine lineages
performed the same specification-authoring task with identical tool access
(file search, content search, file reading, task tracking). We measure
six dimensions beyond task accuracy: tool-foraging strategy, survey depth,
specification quality, convention adherence, interpretive divergence, and
reflection quality. Our findings show that (1) tool-use patterns constitute
a stable cognitive phenotype per lineage, (2) convention adherence varies
independently of task competence, (3) interpretive divergence across
substrates maps automation boundaries — where substrates converge, the
task is mechanical; where they diverge into clusters, human judgment is
required, and (4) substrate mixing yields complementary coverage that no
single substrate achieves alone. We frame these findings within a
five-thread literature review spanning behavioural fingerprinting,
tool-use benchmarking, multi-agent diversity, beyond-accuracy evaluation,
and convention adherence. This is a living survey: we intend to update it
as new substrates are tested and new literature appears.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
