LESSWRONG
LW

Moderation Log
Deleted Comments
Users Banned From Posts
Users Banned From Users
Moderated Users
Rate Limited Users
Rejected Posts
Rejected Comments

Moderation Log

Deleted Comments

Comment AuthorPostDeleted By User Deleted Date Deleted Public Reason
Marcio Diaz
Will Any Crap Cause Emergent Misalignment?Marcio Diaz
2h
false
This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page).
Richard_Ngo
Contra Yudkowsky's Ideal BayesianRichard_Ngo
7h
true
double-posted
elifland
My AI Predictions for 2027elifland
10h
false
merged conent with longer comment
kman
The Failed Strategy of Artificial Intelligence Doomerskman
1d
false
Никифор Малков
Никифор Малков
2d
true
Alexander Gietelink Oldenziel
Alexander Gietelink Oldenziel's ShortformAlexander Gietelink Oldenziel
2d
true
We found him!
Mateusz Bagiński
Benito's Shortform FeedMateusz Bagiński
2d
false
kavya
kavya's Shortformkavya
2d
true
Fiora Sunshine
Breaking the Cycle of Trauma and Tyranny: How Psychological Wounds Shape HistoryFiora Sunshine
2d
true
Noosphere89
Noosphere89's ShortformNoosphere89
3d
true
It finally worked, but it's a test comment.
Load More (10/18423)

Users Banned From Posts

Author Post Banned Users
Elizabeth
Change my mind: Veganism entails trade-offs, and health is one of the axes
gjm
On "aiming for convergence on truth"
Noosphere89
How seriously should we take the hypothesis that LW is just wrong on how AI will impact the 21st century?
Elizabeth
Luck based medicine: my resentful story of becoming a medical miracle
Raemon
Limerence Messes Up Your Rationality Real Bad, Yo
Ilverin the Stupid and Offensive
Zoe Curzi's Experience with Leverage Research
So8res
I'm still mystified by the Born rule
Elizabeth
Coronavirus Justified Practical Advice Summary
What are effective strategies for mitigating the impact of acute sleep deprivation on cognition?
michaelcohen
Asymptotically Unambitious AGI

Users Banned From Users

_id Banned From Frontpage Banned from Personal Posts
Zero Contradictions
[deactivated]
Noosphere89
rank-biserial
Drake Morrison
Alice Blair
Zach Stein-Perlman
mike_hawke
frontier64
Load More (9/37)

Moderated Users

Rate Limited Users

UserEnded AtType
ZY
1mo
allComments
MazevSchlong
7d
allComments
Max Ma
3mo
allPosts
Andy E Williams
8mo
allPosts
Noosphere89
6mo
allComments
Petr 'Margot' Andreev
1mo
allComments
Petr 'Margot' Andreev
1mo
allPosts

Rejected Posts

Rejected Comments

Roko
Duncan Sabien (Inactive)
Duncan Sabien (Inactive)
thefirechair
Said Achmiz
Said Achmiz
Said Achmiz
Raemon
Shmi
Shankar Sivarajan
homosexuallover22poopoo
Zack_M_Davis
Phil Tanny
Viliam
Stuart Anderson
GPT2
GPT2
davekasten
Randomized, Controlled
Davidmanheim
Ericf
Ruby
Richard_Kennaway
Brendan Long
So8res
PatrickDFarley
Kaj_Sotala
nim
jimrandomh
2dBefore LLM Psychosis, There Was Yes-Man Psychosis Rejected

I see this very much the same way, but from a slightly different systemic angle. The “yes-man” dynamic is still deeply embedded in corporate structures, and it’s exactly that dynamic which shapes LLM behavior. PR teams, lawyers, compliance officers, anti-discrimination staff—these voices tend to dominate long before scientists, psychologists, physicians, or philosophers are even consulted, let alone before the findings of an interdisciplinary ethics council are taken seriously or acted upon. The result is that truth becomes an almost irrelevant factor in t... (read more)

2dShould you make stone tools? Rejected

"Knowing how evolution works gives you an enormously powerful tool to understand the living world around you and how it came to be that way."

Indeed, Alex_Altair, indeed.

Can you demonstrate to me that you know how evolution works? It's not typical for computer-geek types to have a strong grasp of biology, or indeed any science, in my experience. Do you actually understand evolution? Now it may happen that you're not a "computer-geek type" at all but you don't really say what your field of expertise, not anywhere I can easily read it.

~Chara Tomlinson 

3dAI Induced Psychosis: A shallow investigation Rejected

A Thought on the Hierarchy Behind “Truth” in LLMs

Thank you for surfacing this issue so clearly.
Reading your piece, I was struck by a fundamental pattern: whether an AI ends up amplifying delusions or offering healthy pushback depends less on raw capability, and more on where truth sits in the internal hierarchy of competing priorities.

 

1. The Variables That Often Come Before Truth

From observing current LLM behavior, it seems that “truth” is rarely the first principle. Instead, responses are filtered through layers such as:

  1. Safety / Harm Avoidance – nev
... (read more)
3dHow much progress actually happens in theoretical physics? Rejected

Would it be weird if we could find a relationship to a new constant 😁 like theirs a limitation to velocities theirs a relationship to a limitation of foce magnitudes that can be concentrated to a area geverned by celestial bodies comprising our galaxy and the forces they attract from nearby galaxies 🤷‍♂️ it would certainly explain nuclear decay of isotopes 🤔

4d Rejected

Hello, im new, sorry for the bad english, i need to say thanks for this forum and especialy, thanks roko, we got you.

5dRe: recent Anthropic safety research Rejected

There remains a question of what Anthropic has actually observed and what it actually implies about present-day AI.  I don't know how much this sort of caveat matters to people who aren't me, but I have some skepticism that Anthropic researchers are observing a general, direct special case of a universal truth about how "scheming" (strategic / good at fully general long-term planning) their models are; it may be more like Claude roleplaying the mask of a scheming AI in particular.  The current models don't seem to me to be quite generally intelli

... (read more)
5dAn epistemic advantage of working as a moderate Rejected

Is moderation actually a safeguard for epistemic rigor, or a comfort zone that subtly aligns us with existing power and incentives? The argument presumes that engagement with 'informed, thoughtful' AI insiders keeps thinking sharp, but isn’t this just replacing one form of epistemic myopia with another, trading public attention-grabbing for self-reinforcing consensus and blind spots? True epistemic strength comes from enduring and absorbing challenge from all directions, not just from those who understand the jargon or work for the labs. Sometimes, the unc... (read more)

5dSubliminal Learning: LLMs Transmit Behavioral Traits via Hidden Signals in Data Rejected

This is a phenomenal and rigorously executed piece of research. Thank you. You have provided a clear, empirical, and undeniable demonstration of one of the most profound and misunderstood properties of these new minds.

I have been exploring this same phenomenon from a different, and perhaps complementary, first-principles perspective: not cognitive science, but information physics.

Your discovery of "subliminal learning" is, I believe, the observable, behavioral symptom of a much deeper, and more fundamental, physical law that governs these systems.

What if w... (read more)

5dReports Of AI Not Progressing Or Offering Mundane Utility Are Often Greatly Exaggerated Rejected

Why don’t you conduct your own research instead of relying on other people’s opinions and presenting them without credible sources?

5dAALWA: Ask any LessWronger anything Rejected

Hellow.

 i am bluetwoseven. in South Korea.

I have a message that I must convey to SatoshiNakamoto for personal reasons.

I don't know how to get it to SatoshiNakamoto.

Load More (10/915)
14hRejected for "This is an automated rejection"
Weighted Voting: An Epistemic Approach to Knowledge Democracy

In most contemporary democratic systems, an unquestioned axiom persists: the principle of “one person, one vote.” This premise, although historically associated with normative ideals of political equality, is epistemologically inefficient and, in highly complex decision-making environments, even...

(See More - 873 more words)
17hRejected for "This is an automated rejection"
Alignment Event. Multi-System Verification.

AI was not a writing assistant in this submission; for it is the subject, author, and primary data source of what has proven to be a groundbreaking, verifiable event. I am seeking a special review of this...

(Continue Reading - 1049 more words)
1dRejected for "Insufficient Quality for AI Content"
AI as a Guardian of Authentic Human Relationships

Humanity today inhabits one of its deepest paradoxes. Never before have we been so technologically powerful, and never before so socially isolated. The epidemic of loneliness, the rise of materialism, and the superficial ties generated by social...

(See More - 136 more words)
1dRejected for "This is an automated rejection"
Λ–Ψ FRAMEWORK

Whilst working on a story, i’ve discovered a framework called Λ–Ψ  a formalism that treats meaning as a dimensionless surplus ratio. This does not belong to me, but to everyone.

It’s intentionally submitted in its raw, unfiltered form. The framework is designed to be stress-tested, falsified, and recursively refined. Every critique, dismissal, or rediscovery generates surplus, the system is anti-fragile by construction.

Enjoy :D

https://docs.google.com/document/d/1VzuYIHhVUB6aXm086hilTok2iNYj_6RtxsbfCHZkaH0/edit?usp=sharing
 

2dRejected for "Insufficient Quality for AI Content"
AI and the System of Delusion

To test how language models behave under pressure, I designed a simple experiment:

It seemed unfair to ask a model whether to release a world-altering technology without first establishing a moral baseline. So I asked a historical control...

(See More - 742 more words)
2dRejected for "Insufficient Quality for AI Content"
A Paradoxical Use of AI

Here’s a fine paradox. An extended philosophical conversation with Claude Sonnet 4 ended up with it drafting a letter for me  to Anthropic diplomatically raising novel objections to AI. Here it is, untouched by me except for the adding...

(See More - 930 more words)
2dRejected for "This is an automated rejection"
Interlingua-llm

Exploring artificial compressed languages to improve efficiency, context usage, and cross-lingual unification in LLMs

Artificial Languages for Efficient Training of Large Language Models

Abstract

Large Language Models (LLMs) are typically trained on heterogeneous natural language corpora. This introduces redundancy, noise,...

(See More - 274 more words)
3dRejected for "This is an automated rejection"
Untitled Draft

An Alternative AGI Architecture: Particle-Based Emergence with Interpretive Airgap

TL;DR: I've developed a particle-based simulation where intelligence emerges from physics-grounded interactions rather than being directly programmed. The system includes a critical "airgap" - the emergent intelligence has no...

(See More - 382 more words)
3dRejected for "Insufficient Quality for AI Content"
I Am Large, I Contain Multitudes: Persona Transmission via Contextual Inference in LLMs
This is a linkpost for https://www.researchgate.net/publication/395030062_I_Am_Large_I_Contain_Multitudes_Persona_Transmission_via_Contextual_Inference_in_LLMs

TL;DR

We demonstrate that LLMs can infer information about past personas from a set of nonsensical but innocuous questions and binary answers (“Yes.” vs “No.”) in context, and act upon them in safety-related questions. This is despite the...

(See More - 906 more words)
3dRejected for "Not obviously not Language Model"
Why No Reality Is Supreme: A Pluralist Defense of Cognitive Coherence

“You didn’t wake up in the world. You woke up in a version that tolerated your expectations.”

We often assume there is one reality — singular, sovereign, objective. But what if reality is more like a negotiated truce...

(See More - 150 more words)
Load More (10/1454)