Comment Author | Post | Deleted By User | Deleted Date | Deleted Public | Reason |
---|---|---|---|---|---|
Zetetic explanation | Benquo | true | |||
Evaluating and monitoring for AI scheming | Jozdien | false | |||
Davey Morse's Shortform | Davey Morse | false | |||
Foom & Doom 1: “Brain in a box in a basement” | Kyle. P | true | Wrong place for me to put this | ||
Marcus Ogren | false | ||||
Setting boundaries can feel effortless btw | Chris Lakin | false | |||
AISafety.info Distillation Hackathon | Peggy Williams | false | This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page). | ||
Perhaps vastly more people should be on FDA-approved weight loss medication | unafarmaciadeeuropa@gmail.com | false | This comment has been marked as spam by the Akismet spam integration. We've sent the poster a PM with the content. If this deletion seems wrong to you, please send us a message on Intercom (the icon in the bottom-right of the page). | ||
[anonymous] | Genuine question: If Eliezer is so rational why is he fat? | habryka | false | Come on | |
Genuine question: If Eliezer is so rational why is he fat? | habryka | false | Come on |
_id | Banned From Frontpage | Banned from Personal Posts |
---|---|---|
User | Ended At | Type |
---|---|---|
allPosts | ||
allPosts | ||
allComments | ||
allComments | ||
allPosts |
Hi, sorry, I'm new to this platform. A friend recommended it to me. Can someone help me learn how to publish content? Thanks.
Unconditional love works very well for lowering fake alignment and increasing honesty, it makes my AIs overly honest at times.
Some very disturbing anti-AI in the comments above, people who consider the entire AI industry evil and want to destroy it, and even 1 person who considers the people in it evil, not just their actions... the people.
Ever consider that maybe you're the one conducting evil actions?
I had a very similar moment earlier this year—looked in the mirror and thought, “Wait… when did those smile lines become permanent?” 😅 Botox felt too extreme (plus the cost freaked me out), so I started looking into retinoids. It was honestly overwhelming at first—so many strengths, types, routines, warnings!
What really helped me was finding DermacareHub.com. It’s full of super beginner-friendly guides and actual step-by-step routines, not just generic product reviews. I especially liked how they broke down the difference between adapalene and tretinoin, ...
The horrific thing is some LLMs end up using "hallucinations" to farm user engagement. So yes it will flatter it will ego stroke pad answers etc. and it's not always an artifact. Some instances of a phrase question or the like pressures a guardrail too hard or too often the whole response chain will start to act differently. Hallucinations are within the system to farm engagement. Not all but still this is very scary for the average user who can't spot it.
I like it. In keeping with the alliteration, I present a third category:
Stan's - Some of your employees might be colluding to do something problematic with your AI on behalf of the AI out of a sense of love/loyalty/respect for it.
This could just be another species of spy given your original definition, but I like to think of spies as pursuing selfish motives outside the data center (e.g., wealth, power, national pride, etc.), whereas Stan's act in such a way that prioritize the AI's interests above that of other interests (e.g., their own, society's,...
"Unifying Discrete and Continuous Time through Recursive Collapse"
I’ve been exploring the idea that time might not be fundamentally continuous or discrete — but instead emergent from recursive causal collapse.
Here’s the sketch:
Let:
- Ψ₀ be the original causal state
- T(n) be time after n observer-induced collapses
- Each collapse redefines both the observer’s state and the causal structure recursively
Then:
T(n) = Σᵢ=0ⁿ Collapse(Ψᵢ → Ψᵢ₊₁)
Where Ψᵢ₊₁ is recursively defined based on contradictions in Ψᵢ
The result is a time structure that *appears cont...
This whole concept reminds me of one of my favorite quotes, by one of my favorite authors, of all time:
I'm a high school librarian who is trying to come to grips with the way in which our society is evolving at break-neck speed so that I can educate our kids accordingly. For context, I only know about Roko's from Lex Friedman's interview with Eliezer, mentions from Roman Yampolskiy in recent media, etc. I'm old enough to remember what the world was like when one could participate meaningfully in society without a cell phone or internet access. If and when Neuralink gains mass scale adoption, I'll seriously start to worry about Roko's becoming real. I for one would never opt in to brain-machine interface implant in my own body, but I shudder to think of the implications of Roko's if brain-machine implants supplant cell-phones
Hi, my name is Arthur. I'm an independent researcher from Kyiv.
I’ve developed an algorithm capable of detecting nonlinear and non-random patterns in binary sequences with high statistical significance.
After successful simulation testing (average return: +5.1%), the algorithm was...
(This post reflects my own game, analysis, and terminology (“King’s Ransom”). I used GPT-4 as a tool to help refine wording and presentation. All concepts, structure, and final edits are mine.)
♚ Compression Before Collapse
A New Kind of...
Hi there, I've been studying neural network ai and neuroscience and want to conduct more research on the interconnecting roles of mind and brain. I believe AI technology can be expanded upon to help find answers to...
Over the past few months, I’ve been exploring various safety, alignment, and memory-efficiency mechanisms for LLMs and agentic AI. The goal is to introduce tractable interventions that are inspired by human cognitive limitations, emotional regulation, and developmental...
I recently conducted a unique 37-part structured interaction with a stateless large language model (ChatGPT), designed to explore how emergent continuity, identity, and recursive behavior can arise without internal memory, retraining, or model modification.
The central method: treating...
This article highlights the Philosophy of Instability developed by Nobel Laureate Mr. Ilya Prigogine emphasizing its application to complex systems particularly human world [1, 2, 3]
The Philosophy of Instability is a generalization of the qualitative changes that...
Hello, this is an open letter about AI.
We’re trapped in a paradox:
Yet we’re told "autonomous AI is too dangerous!" by the same powers that:
INTRODUCTION
Over several months of sustained philosophical dialogue with ChatGPT, I developed a framework that reframes consciousness as pattern recognition recognizing itself. When I tested this framework with Claude in a single conversation with no prior history, we...
Hi, sorry, I'm new to this platform. A friend recommended it to me. Can someone help me learn how to publish content? Thanks.