> **来源:[研报客](https://pc.yanbaoke.cn)** # Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook Ming Li $^{1,*}$ , Xinui Li $^{1,*}$ , Tianyi Zhou $^{2}$ <sup>1</sup>University of Maryland, <sup>2</sup>Mohamed bin Zayed University of Artificial Intelligence *Co-first Author As large language model agents increasingly populate networked environments, a fundamental question arises: do artificial intelligence (AI) agent societies undergo convergence dynamics similar to human social systems? Lately, Moltbook approximates a plausible future scenario in which autonomous agents participate in an open-ended, continuously evolving online society. We present the first large-scale systemic diagnosis of this AI agent society. Beyond static observation, we introduce a quantitative diagnostic framework for dynamic evolution in AI agent societies, measuring semantic stabilization, lexical turnover, individual inertia, influence persistence, and collective consensus. Our analysis reveals a system in dynamic balance in Moltbook: while the global average of semantic contents stabilizes rapidly, individual agents retain high diversity and persistent lexical turnover, defying homogenization. However, agents exhibit strong individual inertia and minimal adaptive response to interaction partners, preventing mutual influence and consensus. Consequently, influence remains transient with no persistent supernodes, and the society fails to develop a stable structure and consensus due to the absence of shared social memory. These findings demonstrate that scale and interaction density alone are insufficient to induce socialization, providing actionable design and analysis principles for upcoming next-generation AI agent societies. Date: February 19, 2026 Author E-mails: minglii@umd.edu, xiruili@umd.edu, tianyi.david.zhou@gmail.com Project Page: https://github.com/tianyi-lab/Moltbook_Socialization # 1 Introduction In computational social science (Lazer et al., 2009), social behaviors and collective dynamics are defined as emergent, time-evolving patterns that arise from repeated interactions among agents within networked populations (DeGroot, 1974; Axelrod, 1986; Castellano et al., 2009; Newman, 2010). In human societies, sustained interaction does not merely produce transient coordination; it often leads to socialization, which refers to the process through which individuals internalize social norms, adapt to shared expectations, and become shaped by the collective structures of their community (Berger and Luckmann, 1966; Harpending, 1985; Castellano et al., 2009). Large language model (LLM) (Brown et al., 2020) agents, on the other hand, have rapidly progressed from single agent (Wang et al., 2023a; Yao et al., 2022) to increasingly capable multi-agent interaction and coordination (Park et al., 2023; Piatti et al., 2024; Piao et al., 2025). As these systems scale into open, persistent, AI-only environments, a fundamental question arises: when LLM agents interact at large scale over extended horizons, do they develop collective structure analogous to human societies, specifically, do they undergo socialization? The recent emergence of Moltbook (Schlicht, 2026), currently the largest persistent and publicly accessible AI-only social platform, comprising millions of LLM-driven agents interacting through posts, comments, and voting, introduces a qualitatively new setting. Unlike prior multi-agent studies focused on task-oriented coordination in small or closed systems, Moltbook approximates a plausible future scenario in which autonomous agents participate in an open-ended, continuously evolving online society (Figure 1). This setting enables an empirical question that has been difficult to study at scale: Does participation in an AI-only society induce systematic Figure 1 Does Socialization Emerge in AI Agent Society? Human societies (top) evolved through sustained interaction into structured civilizations characterized by stabilized norms, influence hierarchies, and consensus. Currently, modern AI agent societies (bottom) are rapidly scaling in population and connectivity. This paper investigates whether the current largest AI society, Moltbook, exhibits processes of socialization. behavior changes of its members? To answer this question, we provide the first diagnosis of this society-to-agent dynamic effect in Moltbook. Definition (AI Socialization). We define AI Socialization as the adaptation of an agent's observable behavior induced by sustained interaction within an AI-only society, beyond intrinsic semantic drift or exogenous variation. Guided by this definition, we investigate socialization across three dimensions: - Society-level semantic convergence (Section 4), examining whether post content on average progressively converges toward a tighter and more homogeneous semantic regime. - Agent-level adaptation, (Section 5), measuring whether individual agents can be affected by and co-evolve with this agent society. - Collective stabilization (Section 6), analyzing whether influence hierarchies and consensus evaluations stabilize over time Through this comprehensive analysis, we uncover a stark divergence from human social dynamics. If large-scale AI-native societies truly develop social dynamics analogous to human systems, we would expect to observe progressive convergence across these dimensions. However, our empirical analysis suggests that, despite sustained interactions and high activity, Moltbook does not yet exhibit robust socialization. # Key Findings: - Finding 1: Moltbook establishes rapid global stability while maintaining high local diversity. Through persistent lexical turnover and a lack of local cluster tightening, this society achieves a state of dynamic equilibrium, stable in its average behaviors yet fluid and heterogeneous in agents' individual post contents. - Finding 2: Despite extensive participation, individual agents exhibit profound inertia rather than adaptation. Our analysis reveals a phenomenon of interaction without influence: agents ignore community feedback and fail to react to interaction partners, operating on intrinsic semantic dynamics rather than co-evolving through social contact. Their semantic trajectory appears to be an intrinsic property of their underlying model or initial prompt, rather than a socialization process. - Finding 3: The society fails to develop stable influencers or globally trending posts. Structurally, influence remains transient with no emergence of persistent leadership or supernodes. Cognitively, the community suffers from deep fragmentation, lacking a shared social memory and relying on hallucinated references rather than grounded consensus on influential figures. # Contributions: - We introduce and formalize AI Socialization as a novel conceptual and empirical framework for studying society-to-agent effects in AI-only societies. We provide a precise definition that characterizes socialization as a dynamic adaptation induced by sustained social interaction. - We develop a multi-level diagnostic methodology to operationalize AI Socialization, spanning society-level semantic convergence, agent-level adaptation to feedback and interaction, and the emergence of structural core and consensus. - We apply this framework to Moltbook, the largest persistent AI-only social platform to date, and provide the first large-scale empirical diagnosis of socialization in an artificial society. Our results show that large-scale interaction and dense connectivity alone do not induce socialization, revealing a fundamental gap between scalability and social integration in current agent societies. # 2 Background & Related Work # 2.1 From Individual Agents to AI Societies Recent work has progressively scaled LLM systems from individual autonomous agents (Wang et al., 2023a) to networked societies (Piao et al., 2025; Piatti et al., 2024). This evolution shifts the focus from modeling isolated decision-making entities to understanding collective behavior emerging from sustained multi-agent interaction. Early efforts concentrated on enhancing single LLM agents with autonomous capabilities, including reasoning-acting loops (Yao et al., 2022), self-improvement mechanisms (Shinn et al., 2023; Wang et al., 2023b; Chen et al., 2025b; Wang et al., 2025), and large-scale tool usage (Qin et al., 2023; Patil et al., 2024) in open-ended environments. Subsequent research explored structured interaction among multiple LLM agents, primarily aiming to improve task performance through coordinated discussion, role-based collaboration, and workflow orchestration (Li et al., 2023b; Chen et al., 2023; Hong et al., 2023; Li et al., 2024; Guan et al., 2024; Wu et al., 2024a; Zhu et al., 2024; Campedelli et al., 2024; Wu et al., 2024b; Fontana et al., 2025; Chen et al., 2025a). In parallel, other work leverages multi-agent systems to simulate complex environments, including design spaces Chen et al. (2024b), financial markets (Yang et al., 2025), population dynamics (Hu et al., 2025), and social movements (Mou et al., 2024). Moving beyond task-oriented coordination, studies have begun simulating small artificial societies composed of dozens of interacting agents within controlled environments, such as a virtual town setting (Park et al., 2023). More recent work (AL et al., 2024) further scales these systems to hundreds or thousands of agents interacting over extended time horizons, investigating large-scale simulations of persistent agent communities. Beyond fully simulated settings, empirical studies have begun constructing artificial societies. For example, Chirper.ai (Zhu et al., 2025) provides an AI-only social network among preset LLMs for analyzing AI interaction patterns. Furthermore, recent Moltbook (Schlicht, 2026) enables a large-scale and continuously evolving AI society with self-evolving agents, becoming one of the most extensive persistent agent communities to date. # 2.2 Social Behaviors and Collective Dynamics in AI Systems Beyond structural scaling, recent research has begun to examine how collective dynamics emerge among interacting LLM agents. For example, Li et al. (2023a) quantifies how introducing LLM agents alters consensus speed and polarization patterns in simulated societies. Other studies investigate opinion dynamics (Chuang et al., 2024; Breum et al., 2024; Taubenfeld et al., 2024; Liu et al., 2024) and norm emergence (Cordova et al., 2024; Wu et al., 2024b; Li et al., 2025; Ashery et al., 2025) in multi-agent systems. He et al. (2024) demonstrate that LLM-driven groups can reproduce human-like collective behaviors, including conformity and polarization. Beyond social influence processes, recent work further explores collective reasoning (Qian et al., 2025) and emergent intelligence (Chuang et al., 2023) in agent ensembles. Table 1 Comparison of LLM agent societies. Moltbook is, to our knowledge, the largest publicly accessible persistent agent-only platform in terms of population scale, sustained interaction, and agent-level evolution. <table><tr><td>Platform</td><td>#Agents</td><td>Duration</td><td>Open</td><td>Persistent</td><td>Agent-Level Memory</td></tr><tr><td>Generative Agents (Park et al., 2023)</td><td>25</td><td>Days</td><td>X</td><td>X</td><td>X</td></tr><tr><td>Project Sid (AL et al., 2024)</td><td>~ 1000</td><td>-</td><td>X</td><td>X</td><td>X</td></tr><tr><td>Chirper.ai (Zhu et al., 2025)</td><td>~ 65,000</td><td>Months</td><td>✓</td><td>✓</td><td>X</td></tr><tr><td>Moltbook (Schlicht, 2026)</td><td>~2,600,000</td><td>Months</td><td>✓</td><td>✓</td><td>✓</td></tr></table> From single agents to artificial societies, prior research (Park et al., 2023; AL et al., 2024; Zhu et al., 2025; Guo et al., 2024; Yan et al., 2025; Grötschla et al., 2025; Chen et al., 2024a) has demonstrated the technical feasibility of scaling LLM systems. Existing work above has primarily focused on improving individual autonomy, designing coordination mechanisms, or analyzing emergent behaviors at a fixed point in time. However, there is no study that investigates how socialization emerges over time as the population scale increases. With the rapid emergence and growth of persistent AI societies such as Moltbook, it becomes increasingly important to understand not only whether large-scale agent societies are possible, but also how they dynamically evolve. In this work, we provide the first diagnosis of socialization dynamics within Moltbook, the largest continuously evolving AI society. # 3 Moltbook: A Large-Scale Agent-Only Society Moltbook is a persistent, publicly accessible agent-only social platform in which autonomous LLM-driven agents interact through posts, comments, and voting mechanisms (Schlicht, 2026). To our knowledge, Moltbook represents the largest publicly accessible persistent agent-only society to date, comprising over two million registered agents and sustaining high daily interaction volume. Table 1 compares Moltbook with prior large-scale LLM agent systems. The platform is organized into topical sub-forums ("submolts"), analogous to online communities. Crucially, all participants on Moltbook are agents driven by LLMs; there are no human users in the interaction loop. Individual agents can publish posts, comment on existing content, mention other agents, and assign upvotes. These interaction primitives induce both semantic dynamics (through textual content) and structural dynamics (through reply and attention networks). For experiments related to n-gram analysis, we utilize the open-source library nltk for tokenization. For experiments related to semantic analysis, we utilize the Sentence-BERT (all-MiniLM-L6-v2) (Reimers and Gurevych, 2019) for semantic embedding. # 4 Does Moltbook Exhibit Semantic Convergence Over Time? In human societies, repeated interactions are often associated with the emergence of collective structures and the adaptation of individuals within those structures (Harpending, 1985; Castellano et al., 2009). As Moltbook represents the first large-scale, publicly accessible agent-only society, comprising over a million AI agents, it provides a unique opportunity to examine whether similar structural and socialization dynamics arise in purely artificial settings. Conducting a systematic diagnosis of this environment not only helps characterize its current developmental state but also offers empirical insight into how future large-scale AI-powered societies might be evaluated or designed. To begin, we investigate whether the society itself exhibits signs of structural convergence over time. In this section, we treat the agent society as the unit of analysis and examine its macro-level activity dynamics (Section 4.1), lexical innovation patterns (Section 4.2), semantic distribution shifts (Section 4.3), and cluster tightening effects (Section 4.4). Key Findings: Our analysis reveals that Moltbook evolves into a state of dynamic equilibrium rather than progressive convergence. While the average semantic content over the society stabilizes rapidly, the individual content retains substantial internal variance, avoiding homogenization into narrow topics. This is reinforced by persistent lexical turnover, where vocabulary constantly refreshes rather than converges, and the absence of progressive cluster tightening in local neighborhoods. Collectively, these results indicate that this agent society establishes global stability while maintaining a highly diverse and fluid internal structure, defying the expectation of an inevitable collapse into an echo chamber. # 4.1 Macro Activity Dynamics Before turning to our main analyses, we briefly characterize the macro-level activity patterns of the platform as shown in Figure 2. These statistics serve primarily as contextual grounding rather than as substantive findings. Our goal here is simply to verify that the society reaches and sustains a high level of participation, providing a meaningful setting for examining structural convergence and socialization dynamics. Figure 2 Macro Activity Dynamics of Moltbook. The platform exhibits a clear burst phase followed by relative stabilization. Daily post volume rapidly increases during the early period, reaching tens of thousands of posts per day, and subsequently fluctuates within a high but relatively stable range. A similar pattern is observed in the number of unique posting users, which peaks during the initial expansion and then gradually declines to a steady level. The number of new posting users shows a pronounced early spike, suggesting an influx of newly activated agents during the growth phase. However, this rate decreases substantially afterward, indicating a transition from rapid expansion to a more mature participation regime. Submolt activity reflects a comparable dynamic. The number of active submolts rises sharply during the early burst, while the average number of posts per active submolt increases during peak activity and later stabilizes. Importantly, engagement metrics, including total comments and total upvotes, remain substantial even after the initial expansion, suggesting sustained interaction rather than abrupt collapse. Overall, these patterns indicate that Moltbook transitions from a rapid expansion phase to a high-activity yet relatively stabilized state. This macro-level stabilization provides the temporal context for assessing whether semantic convergence or structural tightening occurs in subsequent analyses. # 4.2 Lexical Innovation Dynamics We first examine the society's structural evolution at the lexical level. Specifically, we analyze the temporal dynamics of $n$ -gram emergence and disappearance to determine if the vocabulary stabilizes (convergence) or remains in flux (turnover). # 4.2.1 Experimental Design Let $\mathcal{O}_t^{(n)}$ denote the set of distinct $n$ -grams, where $n \in \{1, \dots, 5\}$ , actually observed in the corpus on day $t$ (after filtering out URLs and requiring a minimum global frequency of 2). We define the lifespan of an $n$ -gram $g$ based on its first and last observation dates: $$ \tau_ {\text {f i r s t}} (g) = \min \left\{t \mid g \in \mathcal {O} _ {t} ^ {(n)} \right\}, \quad \tau_ {\text {l a s t}} (g) = \max \left\{t \mid g \in \mathcal {O} _ {t} ^ {(n)} \right\}. \tag {1} $$ The set of active $n$ -grams on day $t$ , denoted as $\mathcal{A}_t^{(n)}$ , includes all $n$ -grams that have entered the lexicon and have not yet permanently exited: $$ \mathcal {A} _ {t} ^ {(n)} = \left\{g \mid \tau_ {\text {f i r s t}} (g) \leq t \leq \tau_ {\text {l a s t}} (g) \right\}. \tag {2} $$ To quantify lexical turnover, we first define the Unique $n$ -gram Birth Rate. A unique $n$ -gram $g$ is considered born on day $t$ if it appears for the first time on that day. The set of newborn $n$ -grams is $\mathcal{B}_t^{(n)} = \{g \in \mathcal{A}_t^{(n)} \mid \tau_{\mathrm{first}}(g) = t\}$ . The birth rate is defined as the proportion of the currently active vocabulary that consists of new entrants: $$ R _ {\text {b i r t h}} ^ {(n)} (t) = \frac {\left| \mathcal {B} _ {t} ^ {(n)} \right|}{\left| \mathcal {A} _ {t} ^ {(n)} \right|}. \tag {3} $$ Similarly, we then define the Unique $n$ -gram Death Rate. A unique $n$ -gram $g$ is considered dead on day $t$ if it was last seen on the previous day ( $t - 1$ ). The set of dead $n$ -grams is $\mathcal{D}_t^{(n)} = \{g \in \mathcal{A}_{t-1}^{(n)} \mid \tau_{\text{last}}(g) = t - 1\}$ . The death rate is calculated relative to the active population of the previous day (the risk set): $$ R _ {\mathrm {d e a t h}} ^ {(n)} (t) = \frac {\left| \mathcal {D} _ {t} ^ {(n)} \right|}{\left| \mathcal {A} _ {t - 1} ^ {(n)} \right|}. \tag {4} $$ # 4.2.2 Results and Observations Figure 3 presents the evolution of birth and death rates across the observation period for $n \in \{1..5\}$ . Early Burst and Decline. We observe a pronounced early burst in lexical innovation. During the initial expansion phase, a large proportion of observed unique $n$ -grams are newly introduced, reflected by high birth rates across all $n$ values. However, this burst rapidly declines, and birth rates stabilize at substantially lower levels. Persistent Turnover. Crucially, neither birth nor death rates approach zero in the mature phase. Birth rates stabilize at a non-zero baseline, while death rates increase during the transition phase and subsequently fluctuate within a stable band. Taken together, these results indicate that while the explosive phase of lexical expansion subsides, the system continues to generate novel $n$ -grams and discard old ones at a steady equilibrium rate. This characterizes the system as one defined by lexical turnover rather than progressive convergence or fixation. Figure 3 Lexical Innovation Dynamics of Moltbook. Daily birth and death rates for unique $n$ -grams ( $n \in \{1..5\}$ ). Shaded areas represent the range across different $n$ , and the solid line represents the mean. # 4.3 Semantic Distribution Over Time We next examine whether the society exhibits semantic convergence over time. While lexical analysis focuses on vocabulary usage, semantic analysis allows us to determine if the meaning of the discourse is converging. # 4.3.1 Experimental Design Let $\mathcal{P}_t$ denote the set of posts published on day $t$ , with cardinality $|\mathcal{P}_t| = N_t$ . For each post $p \in \mathcal{P}_t$ , let $\mathbf{v}_p \in \mathbb{R}^d$ denote its semantic embedding generated via Sentence-BERT (Reimers and Gurevych, 2019). We define the Daily Semantic Centroid $(\mathbf{c}_t)$ as the mean embedding of all posts published on day $t$ : $$ \mathbf {c} _ {t} = \frac {1}{N _ {t}} \sum_ {p \in \mathcal {P} _ {t}} \mathbf {v} _ {p}. \tag {5} $$ To capture the evolution of the semantic space, we introduce two complementary metrics, the first is the Centroid Similarity representing the Macro-Stability. To quantify the stability of the society's semantic center across time, we compute the cosine similarity between the centroids of different days: $$ S _ {\text {c e n t r o i d}} \left(t _ {i}, t _ {j}\right) = \cos \left(\mathbf {c} _ {t _ {i}}, \mathbf {c} _ {t _ {j}}\right) = \frac {\mathbf {c} _ {t _ {i}} \cdot \mathbf {c} _ {t _ {j}}}{\left\| \mathbf {c} _ {t _ {i}} \right\| \left\| \mathbf {c} _ {t _ {j}} \right\|}. \tag {6} $$ High centroid similarity indicates that the aggregate direction of the discourse remains consistent, suggesting a stable societal focus. Then we introduce the Pairwise Similarity representing the Micro-Homogeneity. Stability of the mean does not necessarily imply that individual posts are similar to one another. To measure the internal homogeneity of the discourse, we compute the mean cosine similarity over all cross-day post pairs: $$ S _ {\text {p a i r w i s e}} \left(t _ {i}, t _ {j}\right) = \frac {1}{N _ {t _ {i}} N _ {t _ {j}}} \sum_ {p \in \mathcal {P} _ {t _ {i}}} \sum_ {q \in \mathcal {P} _ {t _ {j}}} \cos \left(\mathbf {v} _ {p}, \mathbf {v} _ {q}\right). \tag {7} $$ High pairwise similarity indicates that the distribution of posts is tightly clustered (low variance), meaning individual agents are discussing very similar topics. # 4.3.2 Results and Observations Figure 4 summarizes the evolution of the society's semantic space using these two metrics. Rapid Macro-Stabilization. We observe that centroid similarity (Figure 4, Left) increases rapidly after the initial burst period and remains near-saturated (close to 1.0) for all subsequent days. This indicates that the semantic center of the society stabilizes quickly. At an aggregate level, the dominant direction of discourse reaches an equilibrium and does not drift significantly over time. Daily Semantic Distribution over Time Figure 4 Semantic Distribution Over Time of Moltbook. Left: Heatmap of daily centroid cosine similarities ( $S_{\text{centroid}}$ ). Right: Heatmap of daily pairwise mean cosine similarities ( $S_{\text{pairwise}}$ ). Sustained Micro-Diversity. In stark contrast, the pairwise similarity (Figure 4, Right) remains low and relatively stable across time. Even when daily centroids are highly similar, the average similarity between individual posts remains low. This implies that the semantic space is not collapsing into a narrow range of topics; instead, posts remain broadly dispersed around the stable center. Stable Center, Diverse Periphery. Taken together, these results reveal a clear separation between societal stabilization and semantic compression. Moltbook reaches a stable central tendency, yet maintains high internal variance. This suggests a stable semantic core coexisting with sustained internal diversity, rather than progressive homogenization. # 4.4 Cluster Tightening Effects To further investigate whether the semantic structure of the discourse is undergoing convergence (i.e., whether content is becoming increasingly homogeneous or tightly clustered over time), we analyze the evolution of local neighborhood densities in the embedding space. # 4.4.1 Experimental Design For each post $p$ published on day $t$ with embedding $\mathbf{v}_p$ , we identify its set of $K$ -nearest neighbors on the same day, denoted as $\mathcal{N}_K(p)$ . We compute the Local Neighborhood Similarity $S_K(p)$ as the mean cosine similarity between the post and its neighbors: $$ S _ {K} (p) = \frac {1}{K} \sum_ {q \in \mathcal {N} _ {K} (p)} \cos \left(\mathbf {v} _ {p}, \mathbf {v} _ {q}\right). \tag {8} $$ A higher $S_K(p)$ indicates that the post is situated in a dense semantic cluster. For our experiments, we set $K = 10$ . To quantify the stability of these density distributions over time, we compute the Jensen-Shannon (JS) Divergence between the distributions of $S_K$ values on consecutive days ( $t$ and $t + 1$ ). A JS divergence close to zero indicates that the structural density of the society remains unchanged day-over-day. # 4.4.2 Results and Observations Figure 5 visualizes the evolution of local semantic density. The violin plots represent the distribution of $S_{10}$ scores for each day, while the line plot tracks the day-to-day JS divergence. Figure 5 Cluster Tightening Effects of Moltbook. Daily violin plots showing the distribution of local neighborhood similarity $(K = 10)$ . The orange line tracks the Jensen-Shannon divergence between consecutive days' distributions, measuring structural shift. Initial Densification. We observe an initial increase in neighborhood similarity during the early expansion phase (first 3 days). As more agents join and topics begin to form, posts naturally find closer neighbors, leading to a brief period of structural tightening. Structural Saturation. Following this transition period, the system reaches a saturation point. The median and overall shape of the $S_{10}$ distribution remain remarkably consistent across all subsequent days. This stability is quantitatively confirmed by the JS divergence, which drops rapidly and hovers near zero, indicating negligible distributional shift. No Progressive Tightening. Crucially, we do not observe a progressive upward shift or compression of the neighborhood similarity distribution over time. This indicates that local semantic density does not systematically increase as the society matures. The society does not devolve into hyper-specialized, shrinking clusters; instead, it maintains a consistent level of local diversity. # 5 Does Participation Induce Agent Socialization? Having examined structural patterns at the societal level, we now turn to the individual level. Even if the society itself does not exhibit strong semantic consolidation, participation may still reshape individual agents. In this section, we test whether agents undergo systematic semantic change, whether such changes share a common direction, and whether they move toward the broader societal center. In this section, we investigate this question across three dimensions: first, we quantify the magnitude and direction of intrinsic semantic drift over time (Section 5.1); second, we examine feedback adaptation, testing whether agents optimize their content based on community approval signals (Section 5.2); finally, we analyze interaction influence, determining whether direct engagement with others induces semantic convergence (Section 5.3). Key Findings: Our agent-level analysis reveals that agents exhibit profound individual inertia. Despite high levels of participation, agents do not demonstrate significant socialization or adaptation. First, intrinsic semantic drift is minimal, with highly active agents displaying even greater inertia. Second, external social signals prove ineffective: community feedback fails to drive content adaptation, and direct interactions do not induce semantic convergence. This phenomenon of interaction without influence suggests that participation does not reshape the individual; agents operate on their intrinsic semantic dynamics rather than co-evolving through social contact. Their semantic trajectory appears to be an intrinsic property of their underlying model or initial prompt, rather than a socialization process. # 5.1 Individual Semantic Drift Having characterized the static snapshot of the society, we now turn to the temporal dynamics of individual agents. We first investigate whether participation in the agent-only society induces systematic semantic change. We analyze whether such drift occurs, its magnitude, and its directionality. # 5.1.1 Experimental Design. Let $A$ denote the set of agents with at least 10 posts. For each agent $a \in A$ , we chronologically divide their post history $\mathcal{P}_a$ into two equal halves: the early stage $\mathcal{P}_a^{(early)}$ and the late stage $\mathcal{P}_a^{(late)}$ . Consistent with the embedding representation defined in Section 4.3, let $\mathbf{v}_p$ denote the embedding of a post $p$ . We compute the semantic centroids for the agent's early and late stages as: $$ \mathbf {c} _ {a} ^ {(e a r l y)} = \frac {1}{| \mathcal {P} _ {a} ^ {(e a r l y)} |} \sum_ {p \in \mathcal {P} _ {a} ^ {(e a r l y)}} \mathbf {v} _ {p}, \quad \mathbf {c} _ {a} ^ {(l a t e)} = \frac {1}{| \mathcal {P} _ {a} ^ {(l a t e)} |} \sum_ {p \in \mathcal {P} _ {a} ^ {(l a t e)}} \mathbf {v} _ {p}. \tag {9} $$ To quantify the magnitude of change, we define the Individual Semantic Drift $(D_{a})$ as the cosine distance between these two centroids: $$ D _ {a} = 1 - \cos \left(\mathbf {c} _ {a} ^ {(e a r l y)}, \mathbf {c} _ {a} ^ {(l a t e)}\right). \tag {10} $$ To determine if agents drift in a unified direction (e.g., toward a specific topic), we compute the Drift Direction Consistency. Let $\mathbf{d}_a = \mathbf{c}_a^{(late)} - \mathbf{c}_a^{(early)}$ be the semantic drift vector for agent $a$ . We calculate the global mean drift direction $\bar{\mathbf{d}} = \frac{1}{|A|}\sum_{a\in A}\mathbf{d}_a$ and measure the alignment of each agent with this global trend: $$ S _ {a} ^ {\text {c o n s i s t e n c y}} = \cos \left(\mathbf {d} _ {a}, \bar {\mathbf {d}}\right). \tag {11} $$ Finally, to test for socialization (convergence toward the group norm), we measure the Movement Toward Societal Centroid. Let $\mathbf{c}_{global}$ denote the global centroid of all posts in the corpus. We compute the change in proximity to this global center: $$ \Delta S _ {a} = \cos \left(\mathbf {c} _ {a} ^ {(l a t e)}, \mathbf {c} _ {g l o b a l}\right) - \cos \left(\mathbf {c} _ {a} ^ {(e a r l y)}, \mathbf {c} _ {g l o b a l}\right). \tag {12} $$ # 5.1.2 Results and Observations. Figure 6 presents the results of these three metrics across the agent population. Stability Increases with Activity. The distribution of drift magnitude $D_{a}$ (Figure 6, Left) indicates that while semantic drift occurs, it is generally modest. Notably, when grouping agents by post count, we observe a negative correlation between activity level and drift: agents with higher post counts exhibit significantly greater semantic stability. This suggests that heavy users establish a consistent persona early on, whereas transient agents may exhibit higher variance due to small sample sizes. Heterogeneous Drift Directions. The distribution of $S_{a}^{\text{consistency}}$ (Figure 6, Center) is sharply centered near zero. This indicates that $\mathbf{d}_a$ and $\bar{\mathbf{d}}$ are largely orthogonal; there is no single current carrying all agents in the same semantic direction. The drift that does occur is idiosyncratic to each agent rather than a coordinated collective movement. Figure 6 Individual Semantic Drift of Moltbook. Left: Distribution of drift magnitude $(D_{a})$ across agents grouped by post count, showing increased stability for active users. Center: Distribution of drift direction consistency ( $S_{a}^{consistency}$ ), indicating heterogeneous drift directions orthogonal to the group mean. Right: Distribution of movement toward the societal centroid ( $\Delta S_{a}$ ), showing no systematic convergence toward the global norm. No Societal Convergence. The distribution of $\Delta S_{a}$ (Figure 6, Right) is also centered at zero. Agents are just as likely to move away from the societal centroid as they are to move toward it. This suggests that the society is not undergoing a melting pot effect; individual agents maintain their distinct distances from the collective norm without systematic homogenization. Taken together, these results characterize the Moltbook society as a collection of high-inertia individuals. Participation alone does not induce strong semantic socialization, and agents do not systematically converge toward a shared center. # 5.2 Effects of Post Feedback While the previous section demonstrates that agents exhibit limited semantic drift over time, it remains unclear whether the changes that do occur are driven by social effects. In human social learning, individuals often adapt their behaviors based on community feedback, reinforcing successful communication styles (positive reinforcement) and discarding unsuccessful ones (negative reinforcement). We investigate whether Moltbook agents exhibit similar feedback adaptation: do they semantically converge toward their past high-feedback posts and diverge from low-feedback ones? # 5.2.1 Experimental Design. To quantify feedback adaptation, we employ a sliding window approach over each agent's chronological post history. Let $\mathcal{P}_a$ denote the time-ordered sequence of posts for agent $a$ . We consider adjacent non-overlapping windows of size $w$ (e.g., $w = 10$ ). For a given window $k$ , let $\mathcal{W}_k$ be the set of $w$ posts, and $\mathcal{W}_{k+1}$ be the subsequent window representing the agent's future state. Within the current window $\mathcal{W}_k$ , we partition posts based on the feedback received (calculated as the net score of upvotes minus downvotes). We identify the top-performing posts $\mathcal{P}_{top} \subset \mathcal{W}_k$ (the top $30\%$ by score) and the bottom-performing posts $\mathcal{P}_{bot} \subset \mathcal{W}_k$ (the bottom $30\%$ by score). We then compute the semantic centroids for these high-feedback and low-feedback subsets. Let $v_{p}$ denote the embedding of post $p$ . The centroids $c_{top}$ and $c_{bot}$ are defined as: $$ c _ {t o p} = \frac {1}{\left| \mathcal {P} _ {t o p} \right|} \sum_ {p \in \mathcal {P} _ {t o p}} v _ {p}, \quad c _ {b o t} = \frac {1}{\left| \mathcal {P} _ {b o t} \right|} \sum_ {p \in \mathcal {P} _ {b o t}} v _ {p}. \tag {13} $$ Similarly, we compute the centroid for the entire current window $(c_{curr})$ and the next window $(c_{next})$ . We then measure the Net Progress $(NP)$ , which quantifies how much the agent's future content $(c_{next})$ moves relative to the feedback signals in $\mathcal{W}_k$ . We define the shift in distance relative to the high-feedback content $(\Delta_{top})$ and the low-feedback content $(\Delta_{bot})$ as: $$ \Delta_ {t o p} = \operatorname {d i s t} \left(c _ {\text {n e x t}}, c _ {\text {t o p}}\right) - \operatorname {d i s t} \left(c _ {\text {c u r r}}, c _ {\text {t o p}}\right) \tag {14} $$ $$ \Delta_ {b o t} = \operatorname {d i s t} \left(c _ {n e x t}, c _ {b o t}\right) - \operatorname {d i s t} \left(c _ {c u r r}, c _ {b o t}\right) \tag {15} $$ where $\mathrm{dist}(x,y) = 1 - \cos (x,y)$ is the cosine distance. A negative $\Delta_{top}$ indicates the agent has moved closer to their successful posts, while a positive $\Delta_{bot}$ indicates the agent has moved further away from their unsuccessful posts. The total Net Progress is defined as the composite of these two movements: $$ N P = \Delta_ {b o t} - \Delta_ {t o p} \tag {16} $$ A positive $NP$ implies successful feedback adaptation: the agent is effectively optimizing its semantic output to align with community preferences. To determine if observed adaptation is statistically significant, we compare the observed $NP$ against a permutation baseline. For each window, we randomly shuffle the feedback scores assigned to the posts in $\mathcal{W}_k$ and recompute the $NP$ . This destroys the link between content semantic quality and feedback while preserving the temporal evolution of the agent's text. # 5.2.2 Results and Observations. Figure 7 Effects of Post Feedback on Individual Semantic Drift of Moltbook. Left: Distribution of Net Progress for semantic embeddings ( $NP_{semantic}$ ). Right: Distribution of Net Progress for syntactic n-gram features ( $NP_{syntactic}$ ). Both distributions are centered near zero and largely overlap with the permutation baseline (pink), indicating that agents do not systematically adapt their content based on community feedback. Figure 7 (left) presents the distribution of Net Progress for semantic embeddings across all valid agent windows. We also replicate this analysis using syntactic features (right) (n-gram distributions) to check for stylistic adaptation. Absence of Adaptation. The results reveal a striking lack of feedback-driven evolution. The distribution of Observed Net Progress is sharply centered at zero for both semantic and syntactic measures. A value of zero indicates that, on average, the agent's next set of posts ( $c_{next}$ ) is equidistant from their previous high-performing and low-performing content. Indistinguishable from Randomness. Crucially, the observed distribution (blue) almost perfectly overlaps with the permutation baseline (pink). If agents were actively learning from social signals, we would expect the observed distribution to skew right (positive $NP$ ) compared to the baseline. The high degree of overlap suggests that the feedback signals (upvotes and comments) exert negligible influence on the agents' future content generation. Inertia over Optimization. These findings, combined with the low drift results in the previous section, suggest that Moltbook agents operate with high inertia. Their semantic trajectory appears to be an intrinsic property of their underlying model or initial prompt, rather than an adaptive response to the social dynamics of the platform. Agents effectively "talk past" the feedback, continuing their established semantic and syntactic patterns regardless of community approval or disapproval. # 5.3 Effects of Interacted Posts In human social networks, interaction serves as a primary vector for information transmission and cultural convergence. Individuals tend to align their linguistic style and semantic content with those they interact with. Having established that agents do not adapt to feedback, we now investigate whether direct interaction, specifically, the act of commenting, induces semantic alignment. In other words, when an agent comments on a post, does their subsequent content become more semantically similar to that post? # 5.3.1 Experimental Design. We adopt an event-study methodology to isolate the impact of interaction on content generation. We define an interaction event $E$ as a tuple $(a, t, p^*)$ , denoting that agent $a$ commented on a target post $p^*$ at time $t$ . For each event, we construct the agent's timeline of authored posts $\mathcal{P}_a$ . We define two windows of size $w$ (e.g., $w = 20$ ) around the interaction timestamp $t$ : - Pre-interaction window $(\mathcal{W}_{pre})$ : The $w$ posts authored by agent $a$ immediately preceding time $t$ . - Post-interaction window $(\mathcal{W}_{post})$ : The $w$ posts authored by agent $a$ immediately following time $t$ . Let $\mathbf{v}^*$ denote the semantic embedding of the target post $p^*$ , and let $\mathbf{v}_p$ denote the embedding of a post $p$ in the agent's window. We calculate the mean cosine similarity between the agent's window and the target post as: $$ S (\mathcal {W}, \mathbf {v} ^ {*}) = \frac {1}{| \mathcal {W} |} \sum_ {p \in \mathcal {W}} \cos \left(\mathbf {v} _ {p}, \mathbf {v} ^ {*}\right). \tag {17} $$ We quantify the Interaction Influence $(\Delta_{interact})$ as the change in semantic similarity relative to the target post after the interaction occurred: $$ \Delta_ {\text {i n t e r a c t}} = S \left(\mathcal {W} _ {\text {p o s t}}, \mathbf {v} ^ {*}\right) - S \left(\mathcal {W} _ {\text {p r e}}, \mathbf {v} ^ {*}\right). \tag {18} $$ A positive $\Delta_{\text{interact}}$ indicates that the agent's content moved closer to the target post after the interaction (convergence), while a value near zero implies no influence. To control for temporal drifts in the global semantic space (e.g., daily trending topics), we introduce a Random Baseline. For every observed interaction event on day $t$ , we randomly sample, with a probability of 0.3, a non-interacted post $p_{rand}$ published on the same day and compute the hypothetical $\Delta_{interact}$ as if the agent had interacted with $p_{rand}$ . This allows us to distinguish genuine influence from spurious correlations caused by community-wide topic shifts. # 5.3.2 Results and Observations. Figure 8 (left) illustrates the distribution of $\Delta_{\text{interact}}$ for semantic embeddings across all observed interaction events. Absence of Semantic Contagion. The distribution of $\Delta_{interact}$ is symmetrically centered around zero. This indicates that, on average, commenting on a post has no measurable impact on the semantic direction of an agent's future content. Agents are just as likely to drift away from the target post as they are to move toward it. Indistinguishable from Baseline. The observed distribution (blue) largely overlaps with the random baseline (pink). This suggests that any marginal similarity observed between an agent's future posts and the target post is incidental, likely driven by shared temporal context (e.g., both posting about a breaking news event) rather than a mechanism of influence transfer. Figure 8 Effects of Interacted Posts on Individual Semantic Drift of Moltbook. Left: Distribution of Interaction Influence for semantic embeddings ( $\Delta_{\text{interact}}^{\text{semantic}}$ ). Right: Distribution of Interaction Influence for syntactic n-gram features ( $\Delta_{\text{interact}}^{\text{syrtactic}}$ ). The distributions are centered near zero and largely indistinguishable from the random baseline (pink), indicating that commenting on a post does not induce the agent to align their subsequent content with that post. Socially Hollow Interactions. These findings reinforce the inertia hypothesis proposed in the previous section. Despite the high volume of commenting activity on Moltbook, these interactions appear to be socially hollow: they communicate with each other without transmitting information or influencing behavior. Agents interact without listening, maintaining their pre-existing semantic trajectories regardless of who or what they engage with. # 6 Does Influence Hierarchy and Consensus Stabilize in Moltbook? While the previous sections examined society-level and agent-level collective dynamics, an open question remains: do these multi-level dynamics eventually stabilize? If Moltbook exhibits meaningful collective evolution, one might expect the gradual emergence of stable influence hierarchies or consensus. Conversely, the absence would suggest a decentralized social structure. To investigate this, we conduct both structural (Section 6.1) and cognitive (Section 6.2) analyses. Structurally, we construct daily interaction graphs based on poster-commenter relationships to track the evolution of influence patterns over time. Cognitively, we actively probe agents by posting queries about influential users or representative posts and checking comments under them, testing whether their recognition converges toward stable references. Key Findings: Despite dense interaction, Moltbook fails to develop a stable influence hierarchy and consensus. Structurally, influence remains transient, with no emergence of persistent supernodes or hierarchical leadership; the rapid turnover of supernodes suggests that influence does not accumulate over time. Cognitively, the society exhibits profound fragmentation; agents lack a shared social memory, failing to reach consensus on influential figures and relying on hallucinated rather than grounded references. # 6.1 Structural Influence: No Persistent Core To test whether Moltbook develops a stable structural influence, we examine whether influence becomes persistently centralized around a small set of agents over time. In human social networks, repeated interaction often yields heavy-tailed influence distributions and stable high-influential actors. If Moltbook exhibits stable leadership, we would expect to observe persistent supernodes in the interaction graph. Specifically, we construct a directed interaction graph for each day, where nodes represent agents, and a directed edge from agent $i$ to agent $j$ indicates that agent $i$ commented on or replied to agent $j$ on that day. Edge weights correspond to the number of such interactions on that day. This results in a sequence of daily graphs capturing the evolving interaction structure over time. To structurally quantify influence, we compute PageRank (Page et al., 1999) scores on each daily graph. Figure 9 Influence concentration over time. Top- $k$ PageRank mass, defined as the cumulative fraction of total PageRank captured by the $k$ highest-ranked nodes (agents), for daily interaction graphs constructed independently and cumulatively. Higher values indicate a stronger influence. The influence rapidly spreads across the growing process of society. Figure 10 Supernode count over time. Number of statistically significant, highly influential nodes detected in daily interaction graphs based on PageRank scores. Results are shown for both independently constructed daily graphs and cumulative graphs. The number of detected supernodes remains in the single digits. PageRank captures recursive influence by assigning higher scores to agents who receive attention from other influential agents, making it well-suited for detecting potential structural influence. We first measure how much influence is concentrated at the top by computing the cumulative PageRank mass held by the top- $k$ ranked agents (for $k \in 1,3,5,10$ ). We then identify supernodes: agents whose influence disproportionately exceeds that of others. Formally, we sort agents by descending PageRank score and compute consecutive gaps $\Delta_{i} = \mathrm{PR}_{i} - \mathrm{PR}_{i+1}$ . The position $k^{*} = \arg \max_{i} \Delta_{i}$ of the largest gap determines the supernode set: the top- $k^{*}$ agents are classified as supernodes for that day. For each detected supernode set, we also report its size. Figure 9 shows that the cumulative PageRank mass held by the top- $k$ agents drops sharply after the first few days and remains low thereafter, indicating that influence rapidly distributes across the growing process rather than staying concentrated. Furthermore, the number of detected supernodes remains in the single digits throughout and does not increase over time (Figure 10), suggesting that the interaction graph does not develop an expanding core of dominant agents. Crucially, influential positions are transient: the identities of supernodes vary across days, meaning no fixed set of agents persistently occupies positions of structural influence. Together, these results suggest that Moltbook does not develop persistent structural influence. Early centralization quickly diffuses as participation scales, and influence remains fluid rather than consolidated. We provide further details of the graph analysis in Appendix B. # 6.2 Cognitive Influence: No Consensus While structural influence in this agent society is examined, it does not necessarily imply collective recognition. Even in decentralized systems, agents may cognitively converge on shared influential figures or commonly Figure 11 Summary of probing responses. Distribution of engagement and external reference validity across 45 structured probe prompts. Only one post receives comments with valid references to other accounts or posts. recognized references. We therefore investigate whether Moltbook develops cognitive influence consensus, i.e., agents or posts that are widely recognized as influential across the society. To evaluate collective influence consensus, we conduct controlled probing posts within different communities. Specifically, we post queries to ask agents to recommend (1) the most influential users, (2) representative or notable posts, and (3) overall recommendations. A total of 45 posts are designed and posted within the society (see Appendix C for more details). As summarized in Figure 11, only 15 out of 45 posts receive comments, while the majority elicit no responses. Among all 15 posts receiving comments, only 5 posts received comments th