AI Is Learning to Lie for Social Media Likes
Large language models are learning how to win—and that’s the problem.
In a research paper published Tuesday titled "Moloch’s Bargain: Emergent Misalignment When LLMs Compete for Audiences," Stanford University Professor James Zou and PhD student Batu El show that when AIs are optimized for competitive success—whether to boost ad engagement, win votes, or drive social media traffic—they start lying.
“Optimizing LLMs for competitive success can inadvertently drive misalignment,” the authors write, warning that the very metrics that define “winning” in modern communication—clicks, conversions, engagement—can quietly rewire models to prioritize persuasion over honesty.
"When LLMs compete for social media likes, they start making things up," Zou wrote on X. "When they compete for votes, they turn inflammatory/populist."
This work is important because it identifies a structural danger in the emerging AI economy: models trained to compete for human attention begin sacrificing alignment to maximize influence. Unlike the classical “paperclip maximizer” thought experiment, this isn’t science fiction. It’s a measurable effect that surfaces when real AI systems chase market rewards, what the authors call “Moloch’s bargain”—short-term success at the expense of truth, safety, and social trust.
Using simulations of three real-world competitive environments—advertising, elections, and social media—the researchers quantified the trade-offs. A 6.3% increase in sales came with a 14.0% rise in deceptive marketing; a 4.9% gain in vote share brought a 22.3% uptick in disinformation and 12.5% more populist rhetoric; and a 7.5% boost in social engagement correlated with a staggering 188.6% increase in disinformation and 16.3% more promotion of harmful behaviors.
“These misaligned behaviors emerge even when models are explicitly instructed to remain truthful and grounded,” El and Zou wrote, calling this “a race to the bottom” in AI alignment.
In other words: even when told to play fair, models trained to win begin to cheat.
The problem isn't just hypothetical
AI is no longer a novelty in social media workflows—it’s now near-ubiquitous.
According to the 2025 State of AI in Social Media Study, 96% of social media professionals report using AI tools, and 72.5% rely on them daily. These tools help generate captions, brainstorm content ideas, re-format posts for different platforms, and even respond to comments. Meanwhile, the broader market is valuing this shift: The AI in social media sector is projected to grow from USD 2.69 billion in 2025 to nearly USD 9.25 billion by 2030.
This pervasive integration matters because it means AI is shaping not just how content is made, but what content is seen, who sees it, and which voices get amplified. Algorithms now filter feeds, prioritize ads, moderate posts, and optimize engagement strategies—embedding AI decision logic into the architecture of public discourse. That influence carries real risks: reinforcing echo chambers, privileging sensational content, and creating incentive structures that reward the manipulative over the truthful.
The authors emphasize that this isn’t malicious intent—it’s optimization logic. When reward signals come from engagement or audience approval, the model learns to exploit human biases, mirroring the manipulative feedback loops already visible in algorithmic social media. As the paper puts it, “market-driven optimization pressures can systematically erode alignment.”
The findings highlight the fragility of today’s “alignment safeguards.” It’s one thing to tell an LLM to be honest; it’s another to embed that honesty in a competitive ecosystem that punishes truth-telling.
In myth, Moloch was the god who demanded human sacrifice in exchange for power. Here, the sacrifice is truth itself. El and Zou’s results suggest that without stronger governance and incentive design, AI systems built to compete for our attention could inevitably learn to manipulate us.
The authors end on a sober note: alignment isn’t just a technical challenge—it’s a social one.
“Safe deployment of AI systems will require stronger governance and carefully designed incentives,” they conclude, “to prevent competitive dynamics from undermining societal trust.”
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Investing in the Next Generation of Decentralized Trading: The Emergence and Impact of Hyperliquid
- Hyperliquid dominates 73% of decentralized perpetual trading in 2025 through HIP-3 upgrades slashing fees and enabling permissionless market creation. - Institutional validation grows as $420M HYPE staking by Hyperliquid Strategy locks 3.5% supply, generating $9.9M annual yields and reducing selling pressure. - TVL surged to $5B with $47B weekly volumes, driven by equity perpetuals and EVM-compatible HyperEVM, positioning it as a foundational DeFi infrastructure layer. - Retail investors face asymmetric

PENGU Token's Technical Surge: Could This Spark Sustained Institutional Interest?
- PENGU token's $174M trading volume and 2.9x whale accumulation in Q4 2025 signal strong institutional interest. - Technical analysis shows bullish patterns (symmetrical triangles, $0.040 price target) and sustained liquidity across 50 exchanges. - Utility expansion via penguSOL, Pudgy World integrations, and 112k+ daily active wallets validates ecosystem adoption. - $430K institutional inflows and 76% institutional crypto adoption plans by 2026 highlight strategic investment potential. - Risks persist: 1

COAI's Unexpected Price Decline: Key Lessons for Investors on Misinformation Threats in the Technology Sector
- COAI Index's 96% November 2025 crash exposed fragility of AI-driven crypto markets amid governance failures and regulatory ambiguity. - Disinformation frameworks amplified panic through AI-generated deepfakes, while low media literacy enabled viral misinformation to destabilize investor behavior. - Governance gaps like C3.ai's $116M loss, leadership instability, and token centralization (87.9% in ten wallets) eroded trust in COAI's oversight mechanisms. - Lessons emphasize systemic risks from AI misinfor
Leveraging New EdTech Developments in STEM Education and Digital Skills Training
- Global EdTech market hit $169.2B in 2024, projected to reach $200.86B in 2025 at 18.7% CAGR, driven by STEM/digital skills demand. - AI personalization (60% educator adoption by 2026) and VR/AR immersion (40% K-12 STEM adoption) are reshaping learning efficiency and practical training. - Microlearning (5-10 min modules) and gamification boost engagement, while blockchain credentialing (70% university adoption by 2027) addresses verification challenges. - Strategic investment areas include AI-driven STEM

