Wikitag Dashboard — LessWrong

Archetypal Transfer Learning (ATL) is a proposal by @whitehatStoic for what is argued by the author to be a fine tuning approach that "uses archetypal data" to "embed Synthetic Archetypes". These Synthetic Archetypes are derived from patterns that models assimilate from archetypal data, such as artificial stories. The method yielded a shutdown activation rate of 57.33% in the GPT-2-XL model after fine-tuning. .. (read more)

Religion is a complex group of human activities — involving commitment to higher power, belief in belief, and a range of shared group practices such as worship meetings, rites of passage, etc... (read more)

Interp on Deepseeks mHC architecture

Hey everyone! My name's Rishi. Hoping to explore more of the Rationalist community and float some of my ideas. Any initial reading recs? I'm mostly interested in the relation of rationalism to metaphysics.

The arguments about which entities to include or exclude seem to contradict each other, or don't really justify their positions. Examples:

Says we should include powerless humans so as not to be a jerk. Isn't it similarly jerkish to exclude powerless non-humans?
Says to exclude non-human sapients because "they aren't here to protest". Well neither are powerless humans!
Says we should exclude mammals because they might not be moral patients. How do I know other humans are moral patients?
Says we should exclude mammals because they might have strange preferences. Okay, so then we should also exclude fundamentalist Christians and Muslims who want heretics to burn in hell forever.
"Including mammals into the extrapolation base for CEV potentially sets in stone what could well be an error, the sort of thing we'd predictably change our minds about later." – I could similarly argue that we should exclude any humans who don't care about animal welfare. Including those humans could potentially set in stone bad outcomes for animals, and later I'd predictably have preferred to exclude those people.

The only argument that seems to me to have force is "avoid a slap-fight over who gets to rule the world". The argument for excluding particular (plausibly-)moral patients is that if you try to include them, you might be conquered by someone else who doesn't include them, and get a worse ultimate outcome.

ML4Good is a France-based field-building organisation that runs AI Safety bootcamps.

Summaries of discussions, takeaways, etc. from LessWrong meetups that have already taken place.

Inkhaven is a 30-day residency where one has to publish posts every day, as part of an effort to grow stronger as a writer. While this has produced some excellent posts it also produces a fair bit of noise too, and also many more hastily-written or experimental posts than usual.

Inkhaven-like posts emerge when other people try to imitate this manner on a smaller scale (e.g. Lightcone team members doing their own 1-week writing stints, or 'HalfHaven' where remote LessWrongers aim to post 30 posts over the course of two months).

Realmbird	mHC (1)	1d
Jack_S	ML4Good (4)	12d
jchan	Meetup Writeups (3)	12d
StanislavKrym	Inkhaven-like posts (2)	15d

Wikitags in Need of Work

Newest Wikitags

Wikitag Voting Activity

Recent Wikitag Activity

Wikitags in Need of Work

Newest Wikitags

Wikitag Voting Activity

Recent Wikitag Activity

Local introductions

External introductions