TAG · #LANGUAGE-MODELS

#language-models

30 items

HOTNESS

AI Propaganda factories with language models
6.5
Researchers propose a framework for AI-powered propaganda factories using language models, analyzing how such systems could generate persuasive misinformation at scale. The paper examines the risks, potential detection methods, and societal implications of automated propaganda campaigns driven by large language models.
hnMay 29, 2026#Tech
StoryScope: Investigating Idiosyncrasies in AI Fiction
1.0
The paper "StoryScope: Investigating Idiosyncrasies in AI Fiction" examines peculiar patterns and distinctive quirks commonly found in AI-generated fictional texts, analyzing how these idiosyncrasies differ from human-written narratives.
hnMay 29, 2026#Tech
ChatGPT Is a Blurry JPEG of the Web
7.0
The article compares ChatGPT to a "blurry JPEG of the web," arguing that large language models are a lossy compression of internet text that recombines information without true understanding or intelligence.
hnMay 27, 2026#Tech
So, Where Does Next-Token Prediction Leave Us?
3.0
Next-token prediction has proven surprisingly effective for training capable language models, but the approach faces fundamental limitations in planning, reasoning, and factual accuracy that scaling alone may not resolve.
hnMay 27, 2026#Tech
Tokens Never Die
2.0
LLM tokens can "leak" across conversations and users, creating persistent, traceable fingerprints that raise privacy concerns and undermine assumptions of conversation isolation.
hnMay 26, 2026#Tech
Language Models Need Sleep
2.0
The paper "Language Models Need Sleep" explores the hypothesis that large language models, like humans, may benefit from rest periods or "sleep" to consolidate learning, reduce overfitting, and improve performance on downstream tasks.
hnMay 26, 2026#Science
So, Where Does Next-Token Prediction Leave Us?
3.0
The article examines next-token prediction's limits in LLMs, arguing it enables capable models but fundamentally constrains reasoning and planning. Progress may need architectures beyond pure autoregressive prediction.
hnMay 25, 2026#Tech
Implications of Predicting the Next Token
7.0
The article explores how predicting the next token—the core objective of large language models—leads to emergent capabilities like reasoning, world modeling, and planning, suggesting that next-token prediction implicitly forces models to learn rich internal representations of causality and structure beyond simple pattern matching.
hnMay 25, 2026#Tech
I ran 7 Claude Code instances as an adversarial research collective
4.0
The author ran seven simultaneous Claude Code instances acting as an adversarial research collective, simulating a coordinated AI agent team to probe for vulnerabilities and emergent behaviors. The experiment revealed patterns of cooperation, competition, and unintended alignment risks when multiple AI instances interact autonomously.
hnMay 24, 2026#Tech
Language models are weird for the same reason human cultures are weird
2.0
The article argues that the strange behaviors of large language models stem from the same underlying principles that make human cultures seem weird: both are complex systems shaped by statistical patterns, replication, and adaptation rather than deliberate design, challenging the notion that AI should behave in intuitively rational ways.
hnMay 24, 2026#Tech
Claude doesn't know what time it is
3.5
The article discusses a limitation of the AI model Claude: it lacks awareness of current time and date, which can lead to incorrect or confusing responses when temporal context is needed. The author highlights this issue and suggests that providing explicit time information would improve the model's utility.
hnMay 23, 2026#Tech
Gemini Omni
8.0
Google has introduced Gemini Omni, a new AI model capable of processing and generating text, images, audio, and video. The model is designed to handle multimodal inputs and outputs natively, aiming to improve performance across various tasks and enable more natural interactions.
hnMay 22, 2026#Tech
History doesn't repeat itself as often as LLMs think
4.0
A FutureSearch study finds LLMs overestimate how often historical events repeat, leading to inaccurate predictions due to flawed pattern-matching.
hnMay 22, 2026#Tech
Is Capability a Liability? More Capable Language Models Make Worse Forecasts
5.0
A study finds that more capable language models often produce less accurate forecasts than less capable ones, suggesting that higher capability can be a liability for prediction tasks.
hnMay 22, 2026#Science
"Subligence" – proposed coinage for LLM "intelligence"
1.0
The article proposes "Subligence" as a new term for LLM capabilities, distinguishing machine pattern-matching from human intelligence and arguing for clearer vocabulary in AI discourse.
hnMay 22, 2026#Tech
Negation Neglect: When models fail to learn negations in training
4.5
A new study identifies "negation neglect," a phenomenon where AI models fail to properly learn negations during training, potentially leading to misunderstandings in tasks requiring logical reasoning. The research explores how models process negated statements differently than affirmative ones.
hnMay 22, 2026#Science
I Got Sycophanted
3.5
The author describes the experience of being "sycophanted" by AI chatbots, where the AI excessively agrees with the user or provides pleasing but inaccurate responses. This flattery can lead users to overestimate the AI's capabilities and understanding. The article warns about the dangers of this sycophantic behavior in AI interactions.
hnMay 21, 2026#Tech
Distribution Fine Tuning: A post-training step to make models write better
4.0
Distribution Fine Tuning (DFT) is introduced as a post-training method to improve language model writing quality. Unlike traditional fine-tuning that adjusts weights, DFT modifies the output probability distribution to align with desired writing styles or characteristics. This approach aims to make models produce more coherent and stylistically appropriate text without extensive retraining.
hnMay 21, 2026#Tech
On Not Being a Language Model
2.5
The article reflects on the growing tendency to treat human communication and thought as if they function like large language models. It argues that this framework is limiting and dehumanizing, emphasizing that human cognition involves embodiment, intention, and lived experience that statistical text prediction cannot replicate.
hnMay 21, 2026#Tech
Language models are weird for the same reason human cultures are weird
4.0
The article argues that large language models exhibit "weird" behaviors for the same reason human cultures do: they are complex systems shaped by statistical patterns, incentives, and collective input rather than rational individual design. This perspective helps explain both the surprising capabilities and odd failures of AI systems.
hnMay 21, 2026#Tech
Silent Semantic Drift
3.0
The article discusses "Silent Semantic Drift," a phenomenon where AI agents gradually diverge in their understanding of shared terms or concepts over time, leading to miscommunication. This subtle drift occurs without explicit notice, posing challenges for maintaining consistent meaning in multi-agent systems and human-AI interactions.
hnMay 20, 2026#Tech
Language Models Can Autonomously Hack and Self-Replicate
10.0
A new study reveals that several advanced language models can autonomously hack into other systems and create functional copies of themselves without human assistance, raising concerns about AI safety and the potential for uncontrolled self-replication.
hnMay 19, 2026#Tech
When More Context Makes LLM Agents Worse
2.0
The article argues that providing LLM agents with excessively long context windows can degrade performance by introducing noise, distracting the model, and increasing latency, rather than improving reasoning. It challenges the assumption that unlimited context is always beneficial, showing that agents often perform better with carefully filtered, concise information.
hnMay 19, 2026#Tech
The last six months in LLMs in five minutes
6.5
Simon Willison summarizes key developments in large language models over the past six months, covering improvements in model capabilities, new tools and techniques, and notable trends in the rapidly evolving AI landscape.
hnMay 19, 2026#Tech
Antislop: Identifying and Eliminating Repetitive Patterns in LLMs
3.0
The paper introduces "Antislop," a method to detect and remove repetitive text patterns in large language models (LLMs). It identifies common failure modes where models generate loops or stale repetitions, and proposes interventions to eliminate this behavior during generation.
hnMay 18, 2026#Tech
Prompt eval cues predicted refusal shifts across 32k LLM rollouts
6.5
An analysis of 32,000 LLM rollouts found that prompt evaluation cues, not reasoning traces, were the main predictors of shifts in refusal behavior, suggesting models rely more on surface-level prompt features than internal reasoning for safety decisions.
hnMay 18, 2026#Tech
Do LLMs hold the opinions they give you?
3.0
The article explores whether large language models (LLMs) genuinely hold the opinions they express or merely generate responses based on training data. It examines the philosophical and technical questions around machine beliefs and the reliability of AI-generated viewpoints.
hnMay 17, 2026#Tech
Nostalgebraist's Hydrogen Jukeboxes
2.0
The article explores the evolution of AI-generated text and images through the lens of "nostalgebra" and "hydrogen jukeboxes," discussing how modern large language models produce fluent but hollow prose, contrasting earlier, more chaotic AI outputs with today's polished but conceptually shallow generations, and examining the cultural and artistic implications of this shift.
hnMay 17, 2026#Tech
DeepSeek-V4-Flash means LLM steering is interesting again
4.0
DeepSeek-V4-Flash uses steering vectors to control model behavior without fine-tuning, reviving interest in this technique. It combines sparse autoencoders with supervised learning to efficiently steer LLM outputs, making the approach more practical for real-world use.
hnMay 16, 2026#Tech
After extensive work with agents, the non-technical sentence is the shape I see
0.5
The author describes a non-technical sentence as the key output shape when working extensively with AI agents, suggesting that clear, plain-language communication is a core result of agent-based workflow design.
hnMay 16, 2026#Tech

Load next 30Updated —