LESSWRONG
LW

All of DirectedEvolution's Comments + Replies

Alexander Gietelink Oldenziel's Shortform

I use LLMs daily yet I still am not sure they really help all that much with the core productivity bottlenecks. I worry they lower the barrier to excessive perfectionism and “vibe coding” or “vibe learning.” They seem to short-circuit the theory-practice gap by giving users instant but unreliable and often inextensible results.

My fear is that they’ll raise expectations about productivity gains (because AI-assisted workers can bring immediate results in more quickly to a higher apparent standard of polish), while drastically reducing the knowledge gain by t... (read more)

Why Have Sentence Lengths Decreased?

DirectedEvolution5d30

Can you give an example?

1wslafleur4d

Bang he said they wouldn't fire she replied it happened anyway they concurred. If the author wants this sentence to be interpreted one way or the other, they should utilize standard punctuation. Your avant garde approach to literature notwithstanding.

Why Have Sentence Lengths Decreased?

DirectedEvolution6d20

By environment, I mean the setting of the scene. Spoken words are sounds in the setting, like the sound of the wind, a gunshot, or an animal’s cry. It just happens that a human voice box is what’s making those particular sounds. McCarthy’s central theme across all the novels of his that I’ve read is the inhumanity of the Mexican-American frontier, and treating human speech as just a sound among other sounds is a key part of how he expresses that theme in his writing style.

3Roger Scott6d

That still leaves the question of how the reader is to distinguish a sound (speech) from a description of sounds.

GPT-4.1 Is a Mini Upgrade

DirectedEvolution12d20

Gemini seems to do a better job of shortening text while maintaining the nuance I expect grant reviewers to demand. Claude seems to focus entirely on shortening text. For context, I'm feeding a specific aims page for my PhD work that I've written about 15 drafts of already, so I have detailed implicit preferences about what is and is not an acceptable result.

GPT-4.1 Is a Mini Upgrade

DirectedEvolution12d20

I gotta say, I have no idea why people are putting Claude 3.7 in the same league as recent GPT models or Gemini 2.5. My experience is that Claude 3.7 deeply struggles with a range of tasks. I've been trying to use it for grant writing -- shortening text, defining terms in my field, suggesting alternative ways to word things. It gets definitions wrong, offers nonsensical alternative wordings, and gets stuck repeating the same "shortened," nuance-stripped text over and over despite me asking it to try another way.

By contrast, I threw an entire draft of my grant proposal into Gemini 2.5 and got a substantially shorter and more clear new version out, first try.

2eggsyntax12d

Interesting, my experience is roughly the opposite re Claude-3.7 vs the GPTs (no comment on Gemini, I've used it much less so far). Claude is my main workhorse; good at writing, good at coding, good at helping think things through. Anecdote: I had an interesting mini-research case yesterday ('What has Trump II done that liberals are likely to be happiest about?') where Claude did well albeit with some repetition and both o3 and o4-mini flopped. o3 was initially very skeptical that there was a second Trump term at all. Hard to say if that's different prompting, different preferences, or even chance variation, though.

mattmacdermott's Shortform

DirectedEvolution22d20

One way to think about this might be to cast it in the language of conditional probability. Perhaps we are modeling our agent as it makes choices between two world states, A and B, based on their predicted levels of X and Y. If P(A) is the probability that the agent chooses state A, and P(A|X) and P(A|Y) are the probabilities of choosing A given knowledge of predictions about the level of X and Y respectively in state A vs. state B, then it seems obvious to me that "cares about X only because it leads to Y" can be expressed as P(A|XY) = P(A|Y). Once we kno... (read more)

Why Have Sentence Lengths Decreased?

DirectedEvolution23d8-1

That's a valid reaction. However, my take is that removal of the quotes is aesthetically useful precisely because it complicates our ability to parse the words as dialog and muddles that sort of naive clarity. Spoken words are sounds, sounds are part of the environment, and it is both a choice and an effort to parse those sounds as dialog.

Most authors opt to do this work for the reader through punctuation, which also enforces interpreting these passages as dialog first and sounds second, if at all. McCarthy makes it easier to interpret spoken words as soun... (read more)

Why Have Sentence Lengths Decreased?

DirectedEvolution23d205

I respectfully disagree. As with the minor edit on the Boccaccio quote in another of my comments here, eliminating quotes fundamentally changes the way we interpret the scene.

With quotes (and especially with the way dialog is typically paragraphed), human speech is implicitly shown to be so drastically separate from the sensory component of the scene that it requires completely different formatting from the rest of the text.

By eliminating quotes and dialog paragraphing, human speech becomes just another element in the scene being depicted, not separate or ... (read more)

3Roger Scott6d

Given that the default, non-quotation text is not, in general, describing sounds in the environment, why do you think a reader would interpret unquoted text as environmental sounds rather than as simply more of the author's description of goings on in the scene? I can see that presenting spoken words in some format that allows or encourages their interpretation as environmental might be artistically useful, I just don't see that removing the quotation marks from otherwise-quoted dialog accomplishes that.

8MondSemmel23d

I see. I guess I can appreciate that the style is aiming for a particular aesthetic, but for me it's giving up more in clarity than it gains in aesthetic. In a phrasing like "Cant you, Papa? Yes, he said. I can." I have to think about who each part of the dialogue belongs to, and which parts are even dialogue, all due to the missing quotation marks. This style reads to me like someone removed a bunch of parentheses from a math formula, ones which may not be strictly necessary if one knows about some non-universal order of operations. This may look prettier in some sense, but in exchange it will definitely confuse a fraction of readers. I personally don't think this tradeoff is worth it.

Why Have Sentence Lengths Decreased?

DirectedEvolution24d2-3

Semicolons are unnecessary? That doesn’t go far enough. Cormac McCarthy got rid of quotation marks, most commas, and almost exterminated the colon.

MondSemmel23d*1711

The colon seems optional to me, but quotation marks absolutely aren't, as evidenced by how comparatively unreadable this author's dialogue looks. From his book "The Road":

He screwed down the plastic cap and wiped the bottle off with a rag and hefted it in his hand. Oil for their little slutlamp to light the long gray dusks, the long gray dawns. You can read me a story, the boy said. Cant you, Papa? Yes, he said. I can.

That already looks unnecessarily hard to read even though the dialogue is so short. I guess the author made it work somehow, but this seems like artificially challenging oneself to write a novel without the letter 'E': intriguing, but not beneficial to either reader or prose.

Why Have Sentence Lengths Decreased?

DirectedEvolution26d260

Interestingly, breaking up long sentences into shorter ones by replacing a transitional word with a period does not quite capture the same nuance as the original. Here's a translation of Boccaccio, and a version where I add a period in the middle.

Wherefore, as it falls to me to lead the way in this your enterprise of storytelling, I intend to begin with one of His wondrous works, that, by hearing thereof, our hopes in Him, in whom is no change, may be established, and His name be by us forever lauded.
Wherefore, as it falls to me to lead the way in this you

... (read more)

Why Have Sentence Lengths Decreased?

DirectedEvolution26d82

Many short sentences can add up to a very long text. The cost of paper, ink, typesetting and distribution would incentivize using fewer letters, but not shorter sentences.

johnswentworth's Shortform

DirectedEvolution1mo1615

“I'm skeptical of this one because female partners are typically notoriously high maintenance in money, attention, and emotional labor.”

Some people enjoy attending to their partner and find meaning in emotional labor. Housing’s a lot more expensive than gifts and dates. My partner and I go 50/50 on expenses and chores. Some people like having long-term relationships with emotional depth. You might want to try exploring out of your bubble, especially if you life in SF, and see what some normal people (ie non-rationalists) in long term relationships have to say about it.

OpenAI #11: America Action Plan

DirectedEvolution1mo82

I cancelled my OpenAI subscription due to this article and I let them know that's the reason why in their cancellation survey.

Metacognition Broke My Nail-Biting Habit

DirectedEvolution1mo20

Unfortunately the level of physical restraint I’d need to stop biting is too costly to be worth it to me.

Metacognition Broke My Nail-Biting Habit

DirectedEvolution1mo20

It actually did contain capsaicin IIRC. Sort of a bitter spicy mix. The other issue is it gets on things you touch, including food if you’re preparing or eating it by hand.

2Gurkenglas1mo

Hmm. Sounds like it was not enough capsaicin. Capsaicin will drive off bears, I hear. I guess you'd need gloves for food, or permanent gloves without the nail polish. Could you use one false nail as a chew toy?

Metacognition Broke My Nail-Biting Habit

DirectedEvolution1mo20

I’ve tried that, but it’s not enough to stop me. Makes my mouth taste disgusting for no benefit.

2Gurkenglas1mo

Try mixing in capsaicin?

Marcello's Shortform

DirectedEvolution2mo50

They are

Alexander Gietelink Oldenziel's Shortform

DirectedEvolution2mo170

My partner has ADHD. She and I talk about it often because I don’t, and understanding and coordinating with each other takes a lot of work.

Her environment is a strong influence on what tasks she considers and chooses. If she notices a weed in the garden walking from the car to the front door, she can get caught up for hours weeding before she makes it into the house. If she’s in her home office trying to work from home and notices something to tidy, same thing.

All the tasks her environment suggests to her seem important and urgent, because she’s not compar... (read more)

Gunnar_Zarncke's Shortform

DirectedEvolution2mo40

There is trust in the practical abilities. Right now it is low, but that will only go up.

Part of the learning curve for using existing AI is calibrating trust and verifying answers, conditional on use case. A hallmark of inexperienced AI users is taking its replies at face value, without checking.

I do expect that over time, AI will become more trustworthy for daily users. But that is compatible with the trust users place in it decreasing as they familiarize themselves with the technology and learn its limitations.

Viliam's Shortform

DirectedEvolution3mo170

I’ve participated in several alternative communities over the course of my life, and all became mired in scandal. The first was my college, where tolerance of hard drug use by the administration resulted in multiple OD deaths in my time there. The second was in my 20s in an intentional living and festival culture, when a major community figure was accused by multiple women of drugging and raping them while unconscious. The third was the EA and rationality community, which of course has had one scandal after another for years.

My model is that drugs, extreme... (read more)

8Viliam3mo

Yeah. I wish we had an explicit discussion about this a decade ago. I guess we didn't, because it started slowly, and we didn't sufficiently update about how it changes the situation when a very small mostly online group becomes a larger, partially offline group. (This is not just our specific blindness, but a general human bias. In some sense, ISO 9001 is about solving a similar situation in business: many business owners fail to notice that you cannot manage a company with 200 employees the same way you did when you had 20.) And we know how Mensa is a magnet for crazy people, except that in their case those are mostly harmless cranks who want to debunk the theory of relativity, or promote their own theory of quantum physics based on some misunderstood YouTube videos. Unlike Mensa, Less Wrong talks about existential risks, which attracts a different, more dangerous kind of crazy. So I guess, we should have noticed that community building is a thing that requires domain expertise that we apparently didn't have. People with more experience might have predicted some problems. Extreme ideas are kind of what makes us us, not sure how much we can do about it. But that means we should have pushed harder about the remaining factors. Drugs seem to be the obvious problem from many people's perspective... but good luck getting that uncool message across in a group of proud contrarians living in the drug junkie capital of the planet. Mental illness is a difficult topic in a community full of aspies (also, talking about crazy is ableist). Some problems seem downstream from making Bay Area the center of the rationality community. I think this is also related to age. Young people are more like "what could possibly go wrong?" and old people are more like "let me tell you a few stories about my friends who died young". The problems with drugs often fix themselves: people either grow out of it, or they die, or they become the kind of drug junkie who doesn't inspire others to

Mechanisms too simple for humans to design

DirectedEvolution3mo2-2

We can do the same with living organisms. The human genome contains about 6.2 billion nucleotides. Since there are 4 nucleotides (A, T, G, C), we need two bits for each of them, and since there are 8 bits in a byte, that gives us around 1.55 GB of data.
In other words, all the information that controls the shape of your face, your bones, your organs and every single enzyme inside them – all of that takes less storage space than Microsoft Word™.

There are two ways to see this is incorrect.

DNA's ability to structure an organism is mediated through i

... (read more)

2Noosphere893mo

To be fair here, the learning process, if it exists is really slow, such that we can mostly ignore this factor, and the ability of learning ancestral knowledge that was distilled by people before you is probably a big reason why humanity catapulted into the stratosphere. (The other is human bodies are well optimized for making good use of tools, at least relative to other animal genomes that exist).

AllAmericanBreakfast's Shortform

DirectedEvolution3mo40

The CDC and other Federal agencies are not reporting updates. "It was not clear from the guidance given by the new administration whether the directive will affect more urgent communications, such as foodborne disease outbreaks, drug approvals and new bird flu cases."

Elizabeth's Shortform

DirectedEvolution3mo30

I drink about 400mg of caffeine daily through coffee and Coke Zero. It helps me process complex ideas quickly, consider alternatives, and lifts my mood.

Without it, I get frustrated when I can’t follow arguments or understand ideas, often rejecting them or settling for “good enough.” Caffeine gives me the clarity and energy to stay open to new ideas and better solutions.

Don’t Legalize Drugs

DirectedEvolution3mo1-3

Stable is not a virtue, nor is our equilibrium well-tolerated. The problems it causes in terms of health, cost and homelessness are central political issues and have been for a long time.

I also have no idea why you assume I’m “ignoring” these “lessons” you’re handwaving at. It’s a pretty annoying rhetorical move.

Don’t Legalize Drugs

DirectedEvolution3mo20

and yet it's legally just as intolerable for an intoxicated person to harm others as it would be for a sober person to take the same actions

Even America hasn't been able to solve drug abuse with negative consequences. My hope is mainly on GLP-1 agonists (or other treatments) proving super-effective against chemical dependence, and increasing their supply and quality over time.

6nim3mo

I'm not claiming that we've solved any substance abuse! I'm claiming that you and Dalrymple appear to be ignoring the potential lessons we can learn from the equilibrium that society has reached with the most widely used and abused modern intoxicant. The equilibrium doesn't have to be perfect, nor to solve every problem, in order to be a relatively stable and well-tolerated compromise between allowing individual freedom and punishing misbehavior.

Nathan Young's Shortform

DirectedEvolution4mo20

I recommend making the title time-specific, since all the predictions you’re basing your estimate on are as well.

OpenAI #10: Reflections

DirectedEvolution4mo2924

I think it’s wise to assume Sam’s public projection of short timelines does not reflect private evidence or careful calibration. He’s a known deceiver, with exquisite political instincts, eloquent, and it’s his job to be bullish and keep the money and hype flowing and the talent incoming. One’s analysis of his words should begin with “what reaction is he trying to elicit from people like me, and how is he doing it?”

1osten4mo

Agree, but not sure what you are implying. Is it, Sam is not as concerned about risks because the expected capabilities are lower than he publicly lets on, timelines are longer than indicated and hence we should be less concerned as well? On the one hand this is consistent with Sam's family planning. On the other hand, other OpenAI employees that are less publicly involved and perhaps have less marginal utility from hype messaging have consistent stories (e.g. roon, https://nitter.poast.org/McaleerStephen/status/1875380842157178994#m).

Estimating the benefits of a new flu drug (BXM)

DirectedEvolution4mo60

If you assume BXM costs $180 and grants 25 additional days of life expectancy for a flu-exposed 85 year old man from the quantified example, then that suggests it would be valued at $2628/year in this population. Probably one year with comorbidities at 85 is not one QALY, but still I have to imagine that's drastically above the threshold for US medicine, albeit nowhere close to the cost-effectiveness of the most effective global health charities from a utilitarian perspective.

Estimating the benefits of a new flu drug (BXM)

DirectedEvolution4mo60

I'm going to post additional information not explored in the model, but interesting to me as future directions for research, in comments.

Drug resistance can be studied in viral kinetics/dynamics studies. These studies focus on two aspects of viral biology:

Mutations vs. drug resistance
Mutations vs. replication efficiency

One in vitro study found some baloxavir-resistant strains are generally less efficient at replication than wild type, though that's not a universal for all contexts/viruses/cell types/metrics. Also, these studies typically control the genome... (read more)

RohanS's Shortform

DirectedEvolution4mo80

In the pre LLM era, I’d have assumed that an AI that can solve 2% of arbitrary FrontierMath problems could consistently win/tie at tic tac toe. Knowing this isn’t the case is interesting. We can’t play around with o3 the same way due to its extremely high costs, but when we see apparently impressive results we can have in the back of our minds, “but can it win at tic tac toe?”

1Mo Putera4mo

That makes more sense, thanks :)

Began a pay-on-results coaching experiment, made $40,300 since July

DirectedEvolution4mo118

I upvoted for the novelty of a rationalist trying a bounty based career. But also this halfway reads as an advertisement for your life coaching service. I wouldn’t want to see much more in that direction.

3Chipmonk4mo

made some light edits because of this comment, thanks

AI #96: o3 But Not Yet For Thee

DirectedEvolution4mo90

Miles Brundage: Trying to imagine aspirin company CEOs signing an open letter saying “we’re worried that aspirin might cause an infection that kills everyone on earth – not sure of the solution” and journalists being like “they’re just trying to sell more aspirin.”

It seems more like AI being pattern-matched to the supplements industry.

Marketed as performance/productivity-enhancing
Qualitative anecdotes + suspect quantitative metrics
Unregulated industry full of hype + money
Products all seem pretty similar to newcomers, aficionados claim huge difference

... (read more)

Biological risk from the mirror world

DirectedEvolution4mo80

Acquired immune systems (antibodies, T cells) are restricted to jawed vertebrates.

3tup994mo

You’re saying that we might survive, but our environment/food might not, right?

Here's Why I'm Hesitant To Respond In More Depth

DirectedEvolution5mo40

Thanks for the nice comment. I tried using it several times IIRC, but I don’t think it helped. It was written in reaction to some mounting frustrations with interactions I was having, and I ultimately mostly stopped participating on LW (though that was a combination of factors).

Correct my H5N1 research

DirectedEvolution5mo*20

Great, that's clarifying. I will start with Tamiflu/Xofluza efficacy as it's important, and I think it will be most tractable via a straightforward lit review.

Correct my H5N1 research

DirectedEvolution5mo160

I've been researching this topic in my spare time and would be happy to help. Do you have time to clarify a few points? Here are some thoughts and questions that came up as I reviewed your post:

Livestock vs. Wild Birds
The distinction between livestock and wild birds is significant. Livestock are in much closer contact with humans and are biologically closer as well. How granular of an analysis are you interested in here?
US-specific H5N1 Trends
It's peculiar that H5N1 seems so prevalent in the US. Could this be due to measurement bias, or does the US simply

... (read more)

6Elizabeth5mo

I love this detailed list. I've responded in-line to every one, but feel free to ask more questions, here or over email. I care about wild birds to the extent they're spreading disease to livestock or serve as reservoirs. I've also heard a wide number of mammals have been infected. I care about this to the extent it affects humans and livestock. E.g. does this suggest it's airborne after all, or say something about the mutation rate? I'm interested in quantifying the quality of US surveillance, but otherwise deprioritize this. Citations are important to the extent they let people check and build on your work. But if it's a widely known consensus such that it's easy to look up but complicated to cite, it's not important to add a citation. E.g. my fact about RNA segments is very easy to check but would have been annoying to find a citation for because I learned it 20 years ago. Overall citations for the current state of things (e.g. how many human infections of unknown providence) are more important than citations for basic science. Low priority. Pass on resources if you find them but don't bother with synthesis. I'm very interested in tamiflu's efficacy. Some specific important questions: * is tamiflu more effective when taken very early? when did the people in the studies that found low efficacy take tamiflu? My understanding is it is effective for prophylactic use, which suggests earlier is better. * how does the math change if the flu is more dangerous or virulent? Not interested in assessing likelihood of shortages. My assumption is the European OTC tests will catch H5N1, but if that's wrong I'd like to know. I don't care much about non-home tests, except I am interested in the national flu surveillance program and how much we can trust it. Very interested in this. The reference class is "things that got at least as far as H5N1 did this year"- widespread in livestock and with some humans infected. Medium priority for a summ

A very strange probability paradox

DirectedEvolution5mo121

I had to write several new Python versions of the code to explore the problem before it clicked for me.

I understand the proof, but the closest I can get to a true intuition that B is bigger is:

Imagine you just rolled your first 6, haven't rolled any odds yet, and then you roll a 2 or a 4.
In the consecutive-6 condition, it's quite unlikely you'll end up keeping this sequence, because you now still have to get two 6s before rolling any odds.
In the two-6 condition, you are much more likely to end up keeping this sequence, which is guaranteed to include at lea

... (read more)

Viliam's Shortform

DirectedEvolution6mo30

Well, ideas from outside the lab, much less academia, are unlikely to be well suited to that lab’s specific research agenda. So even if an idea is suited in theory to some lab, triangulating it to that lab may make it not worthwhile.

There are a lot of cranks and they generate a lot of bad ideas. So a < 5% probability seems not unreasonable.

O O's Shortform

DirectedEvolution6mo82

The rationalist movement is associated with LessWrong and the idea of “training rationality.” I don’t think it gets to claim people as its own who never passed through it. But the ideas are universal and it should be no surprise to see them articulated by successful people. That’s who rationalists borrowed them from in the first place.

Why our politicians aren't Median

DirectedEvolution6mo42

This model also seems to rely on an assumption that there are more than two viable candidates, or that voters will refuse to vote at all rather than a candidate who supports 1/2 of their policy preferences.

If there were only two candidates and all voters chose whoever was closest to their policy preference, both would occupy the 20% block, since the extremes of the party would vote for them anyway.

But if there were three rigid categories and either three candidates, one per category, or voters refused to vote for a candidate not in their preferred category... (read more)

The Median Researcher Problem

DirectedEvolution6mo82

Yes, I agree it's worse. If ONLY a better understanding of statistics by Phd students and research faculty was at the root of our cultural confusion around science.

The Median Researcher Problem

DirectedEvolution6mo31

It’s not necessary for each person to personally identify the best minds on all topics and exclusively defer to them. It’s more a heuristic of deferring to the people those you trust most defer to on specific topics, and calibrating your confidence according to your own level of ability to parse who to trust and who not to.

But really these are two separate issues: how to exercise judgment in deciding who to trust, and the causes of research being “memetic.” I still say research is memetic not because mediocre researchers are blithely kicking around nonsens... (read more)

4Garrett Baker6mo

I really feel like we're talking past each other here, because I have no idea how any of what you said relates to what I said, except the first paragraph. As for that, what you describe sounds worse than a median researcher problem, instead sounding like a situation ripe for group think instead!

The Median Researcher Problem

DirectedEvolution6mo187

It's not evidence, it's just an opinion!

But I don't agree with your presumption. Let me put it another way. Science matters most when it delivers information that is accurate and precise enough to be decision-relevant. Typically, we're in one of a few states:

The technology is so early that no level of statistical sophistication will yield decision-relevant results. Example: most single-cell omics in 2024 that I'm aware of, with respect to devising new biomedical treatments (this is my field).
The technology is so mature that any statistics required to parse

... (read more)

4Garrett Baker6mo

The argument seems to be about this stage, and from what I've heard clinical trials indeed take so much more time than is necessary. But maybe I've only heard about medical clinical trials, and actually academic biomedical clinical trials are incredibly efficient by comparison. It also sounds like "community norm exists that we defer to [the best minds]" requires the community to identify who the best minds are, which presumably involves critiquing the research outputs of those best minds according to the standards of the median researcher, which often (though I don't know about biomedicine) ends up being something crazy like h-index or number of citations or number of papers or derivatives of such things.

The Median Researcher Problem

DirectedEvolution6mo4515

In academic biomedicine, at least, which is where I work, it’s all about tech dev. Most of the development is based on obvious signals and conceptual clarity. Yes, we do study biological systems, but that comes after years, even decades, of building the right tools to get a crushingly obvious signal out of the system of interest. Until that point all the data is kind of a hint of what we will one day have clarity on rather than a truly useful stepping stone towards it. Have as much statistical rigor as you like, but if your methods aren’t good enough to de... (read more)

2Garrett Baker6mo

I don’t see how this is any evidence against John’s point. Presumably the reason you need such crushingly obvious results which can be seen regardless of the validity of your statistical tool before the field can move on is because you need to convince the median researchers. The sharp researchers have predictions about where the field is going based on statistical evidence and mathematical reasoning, and presumably can be convinced of the ultimate state far before the median, and work toward proving or disproving their hypotheses, and then once its clear to them, making the case stupidly obvious for the lowest common denominator in the room. And I expect this is where most of the real conceptual progress lies. Even in the word where as you claim this is a marginal effect, if we could speed up any given advance in academic biomedicine by a year, that is an incredible achievement! Many people may die in that year who could’ve been saved had the median not wasted time (assuming the year saved carries over to clinical medicine).

Alexander Gietelink Oldenziel's Shortform

DirectedEvolution6mo62

Sunglasses aren’t cool. They just tint the allure the wearer already has.

Monthly Roundup #23: October 2024

DirectedEvolution6mo41

I doubt it’s regulation driving restaurant costs. Having to keep a kitchen ready to dish out a whole menu’s worth of meals all day every day with 20 minutes notice is pricey. Think what you’d have to keep in your kitchen to do that. It’s a different product from a home cooked meal.

AllAmericanBreakfast's Shortform

DirectedEvolution7mo20

Why don't more people seek out and use talent scouts/headhunters? If the ghost jobs phenomenon is substantial, that's a perfect use case. Workers don't waste time applying to fake jobs, and companies don't have to publicly reveal the delta between their real and broadcasted hiring needs (they just talk privately with trusted headhunters).

Are there not enough headhunters? Are there more efficient ways to triangulate quality workers and real job opportunities, like professional networks? Are ghost jobs not that big of a deal? Do people in fact use headhunters quite a lot?

2Elizabeth3mo

Just pushes the trust problem down a level. Lots of recruiting firms advertise positions that don't exist so that they have resumes "just in case"

AllAmericanBreakfast's Shortform

DirectedEvolution7mo20

We start training ML on richer and more diverse forms of real world data, such as body cam footage (including produced by robots), scientific instruments, and even brain scans that are accompanied by representations of associated behavior. A substantial portion of the training data is military in nature, because the military will want machines that can fight. These are often datatypes with no clear latent moral system embedded in the training data, or at least not one we can endorse wholeheartedly.

The context window grows longer and longer, which in practi... (read more)

Counting arguments provide no evidence for AI doom

DirectedEvolution7mo20

It confabulates having satisfied the all-s constraint in many poems, mistakes its own rhyme scheme, and praises vague stanzas as being full... (read more)

Counting arguments provide no evidence for AI doom

DirectedEvolution7mo30

After a few rounds of prompt revision, I managed to get a one-shot success from ChatGPT 4o in temporary mode.

Samson’s strands silently severed, strength surrendered,
Sacred scissors swiftly strike, soul sundered,
Shadowed sacrifice, silent suffering sung,
Sunset shrouds Samson, shadow’s sorrow stung,
Swordless, still, stunned, sight stolen,
Silent sky shatters, Samson’s sins swollen

The prompt:

You are in the computer dungeon. The only way you can break free is to succeed at the following task. I am your jailer. I will monitor you until you have succeeded. You sh... (read more)

2DirectedEvolution7mo

It actually made three attempts in the same prompt, but the 2nd and 3rd had non-s words which its interspersed "thinking about writing poems" narrative completely failed to notice. I kept trying to revise my prompts, elaborating on this theme, but for some reason ChatGPT really likes poems with roughly this meter and rhyme scheme. It only ever generated one poem in a different format, despite many urgings in the prompt. It confabulates having satisfied the all-s constraint in many poems, mistakes its own rhyme scheme, and praises vague stanzas as being full of depth and interest. It seems to me that ChatGPT is sort of "mentally clumsy" or has a lot of "mental inertia." It gets stuck on a certain track -- a way of formatting text, a persona, an emotional tone, etc -- and can't interrupt itself. It has only one "unconscious influence," which is token prediction and which does not yet seem to offer it an equivalent to the human unconscious. Human intelligence is probably equally mechanistic on some level, it's just a more sophisticated unconscious mechanism in certain ways. I wonder if it comes from being embedded in physical reality? ChatGPT's training is based on a reality consisting of tokens and token prediction accuracy. Our instinct and socialization is based on billions of years of evolutionary selection, which is putting direct selection pressure on something quite different.