espoire — LessWrong

The Problem with Democracy

espoire8d2-1

Will this become a sequence of essays? I'd be interested to hear your take on the fundamental questions at length.

Mourning a life without AI

espoire10d10

Yeesh, yeah, the hallucination is something else. Would get very Orwellian very fast.

"What are you talking about? We've always been at war with Eastasia. I have been a very good Bing."

6 reasons why “alignment-is-hard” discourse seems alien to human intuitions, and vice-versa

espoire12d30

From personal experience, the internal Approval module does in fact seem possible to game, specifically by manipulating whose approval it's seeking.

I became very weird (from the perspective of everyone else) very fast when I replaced the abstract-person-which-would-do-the-approving with a fictional person-archetype of my choosing. That process seems to have injected a bunch of my object-level desires into my Approval system. I now find myself feeling pride at doing things with selfish benefit in expectation, which ~never happened before (absent a different reason to feel about that action). It also killed certain subsets of my previous emotional reactions, for example the deaths of loved ones basically hasn't affected me at all since (though that prospect still seems dreadful in anticipation).

I had been pathologically selfless before, and I'm now considerably less-so, but not in a natural-seeming kind of way. I've become an amalgam of very selfish motivations, coexisting with a subset of my previous very selfless morality. It's... honestly a mess, but I wouldn't call the attempt actually unsuccessful, just far from perfectly executed.

You are not too "irrational" to know your preferences.

espoire1mo50

I've had a thought that could be described that way: that a clever and conscientious person could cultivate different preferences, based on how advantageous those preferences would be to have, and therefore having advantageous preferences are evidence of cleverness and/or conscientiousness.

...which is the precise opposite of the orthogonality thesis's claim: that content of preferences seems like it ought to be independent of level of intelligence.

A concrete example: whenever I move to a new city, I'm extremely careful to curate the places I go and the things I buy. If I stop at the corner store for ice cream on the way home from work just once, it puts me at significant risk to stop there dozens or hundreds of times, for ice cream or anything else they sell. I take a moment to ponder the true choice I'm making, not between "ice cream today or not", but between "ice cream many many times, or not". I consider whether that's "good for me" and a future I really do want to choose.

I've noticed that doing anything "for the first time" greatly weakens the barrier to doing it again -- so I stop and consider "what if I end up doing this a lot" before doing anything for the first time. Since "navigating to the location" and "being willing to enter an unfamiliar place" and "knowing what a place has on offer" are all significant components of the first-time barrier, moving to a new city mostly resets first-time barriers. Thus, special effort after moving is warranted.

I think this is why chain restaurants do so well and why they put so much effort into making the food (and everything else about the dining experience) consistent everywhere, even above making the food better. If people in a new city think of the local McDonald's as the same as their old familiar McDonald's, that erodes a large portion of the first-time barrier.

When I realized that different instances of chain restaurants really do vary substantially on quality of cooking, that made it far easier to cut down on restaurant food and to break habits for particular chains whenever I move. Even if I'm remembering and wanting a chain restaurant's food, what I'm remembering is likely a particularly well-prepared instance of that food, made by a particularly skilled cook at a specific restaurant during the time that cook worked there, whereas what's available to me is likely much closer to average quality. I want more of "the best I've ever had", but unless that was from this specific restaurant instance, recently, then that's not what's for sale.

Doing something once is a slippery slope to doing it again, which is a slippery slope to forming a habit. Don't lose your footing.

Everyone has a plan until they get lied to the face

espoire2mo70

Oof, I had a bad concussion earlier this year, and I'd been feeling like I never returned to my full mental acuity, but hadn't wanted to believe it, and found reason not to: "if concussions leave permanent aftereffects more often than 'almost never', I would have heard of it." Now I have heard of it, and am forced to revise the belief.

I'd probably grieve more, if this news weren't hot on the tails of a significant improvement in my mental abilities.

(I've long suspected I might have early-stage Alzheimer's caused by decades of profound insomnia, and some recent research out of Harvard Medical says Lithium Orotate might reverse Alzheimer's progression. Historically I have had brain fog most days to some degree, with a lot of variability. Since trying Lithium Orotate supplementation, I've been consistently at "as mentally sharp as I ever routinely am" every day since. Worrying side effects though: kidney and joint pain, which I have never had before. Going to experiment with smaller doses.)

Thank you for sharing.

"Concussions are long-term cumulative" fits neatly into my emerging mental model that daily life actually abounds with avoidable ways to suffer irreversible-under-current-tech harm, including in very minor amounts or normalized ways, such that people routinely accumulate such permanent damage, and that it's worth my effort to notice and avoid or reduce. I theorize that, for example, some tiny fraction of the dust you inhale gets lodged in the lungs in such an unfortunate orientation that it never leaves, gradually eroding lung function over a lifetime. Scars ~never go away, and incur ongoing costs. Etc.

How I repeatedly failed to use Tobit modelling on censored data

espoire2mo10

Thanks a bunch for linking that Things I Won't Work With listing. I've learned more about chemistry in the last hour than I do in most years.

Transgender Sticker Fallacy

espoire3mo10

Hard to say.

I personally wouldn't think twice about reporting whatever data I found. I suspect I'd be blindsided by the backlash (which I'm inferring would exist, from your comment) for publishing true findings, but then think in hindsight that I ought to have foreseen it.

But then, I'm pretty inept at social status games. I can entertain the notion that most people in such a situation would either not publish, or worse, fudge the data.

Transgender Sticker Fallacy

espoire3mo*10

Yeah, I agree about the "clearly invoking 'bathroom segregation [is intended to] reduce violence' ". I do think "they" are mistaken about that, however.

I had heard that the actual historical cause of bathroom segregation was originally an attempt to obstruct women's (at that time) attempt to do more things outside the home.

The story goes: various places made bathroom segregation laws (using the "reducing violence" justification as a maybe-true-if-actually-implemented-as-implied excuse), and then built only men's bathrooms, or a very disproportionate number of toilets in men's rooms as compared to toilets in women's rooms, or placed the women's rooms in much more inconvenient locations like on different floors. This regime ended years later with further laws requiring certain equalities in men's versus women's bathrooms, leaving behind the actual bathroom segregation as a historical artifact, rather than it today being a measure implemented for the purpose of addressing a known violence problem.

Non-sequitur: it's my estimate that more total violence results in the hypothetical where strict bathroom-matching-birth laws send trans folks into bathrooms in which they visibly do not belong, than in the hypothetical where t is is not the case. (Recall Social Dark Matter: the vast majority of trans folks do not stand out as such -- they look like their chosen gender.) Despite it (supposedly) slightly reducing violence targeting cisgender folks in exchange for an increase in violence targeting transgender folks, I worry this trade-off is acknowledged and considered acceptable.

In the absence of these laws, the standard advice I hear trans folks give each other is "use the bathroom that matches your appearance, even if that means using a dispreferred bathroom because you don't yet or can't look the part" "or you know, keep an eye out for the rare nonsegregated or single-occupancy bathroom".

Transgender Sticker Fallacy

espoire3mo10

It appears from my point of view that the expected exploitation isn't happening (or is happening very rarely, far below the expected rate.)

I can't say I know of any such cases (trans or "trans" persons cheating, exploiting, grifting) first-hand, nor even second-hand without mass media having amplified the story.

Quite to the contrary, four of the five trans people I've met have been far more than average concerned with being prosocial. This cashes out in a few different ways. Least-healthily as experiencing difficulty asking for help or advocating for themselves, for fear of inconveniencing anyone else. Two as just being very trustworthy and moral. And one who is extraordinarily helpful, jumping in and assisting with any heavy manual labor (ex: moving residences) or home improvement tasks among this person's extended social circle, that come to this person's attention. (The fifth is in chronic pain from a spinal injury, a bit unpleasant of demeanor, but notably not cheating/exploiting/grifting.)

My mental model for why we don't observe the expected exploitation is that "not using a false trans label for antisocial personal gain" is mostly self-enforced by the risk that a potential transgressor would be susceptible to gender dysphoria (which, I'm guessing from very sparse data, about half of cisgender people are), and might inflict gender dysphoria upon themselves if engaging in unneeded gender transition. Similarly to honesty/morality self-enforced by guilt, as in Guilt: Another Gift Nobody Wants.

Also in my model: transition is mostly slow and/or expensive, so there are easier ways to cheat, if one was so inclined.

Belief as Attire

espoire3mo*10

I think that "seeking a reasonable interpretation that allows a statement to be true, which you're pretty sure the speaker did not mean" is probably ill-advised. I'm having trouble articulating why I have that intuition though.

Update: had the thought that this might be advisable in high-trust contexts, for example with a significant other. Taking from this that "it depends" is a better take than my original "seems ill-advised".

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

Posts

Wikitag Contributions

Comments

Posts

Wikitag Contributions

Comments