Example of OpenErrata nitting the Sequences I just published OpenErrata, a browser extension that investigates the posts you read using your OpenAI API key, and underlines any factual claims that are sourceably incorrect. It then saves the results of the investigation so that whenever anybody else using the extension visits...
Almost one year ago now, a company named XBOW announced that their AI had achieved "rank one" on the HackerOne leaderboard. HackerOne is a crowdsourced "bug bounty" platform, where large companies like Anthropic, SalesForce, Uber, and others pay out bounties for disclosures of hacks on their products and services. Bug...
Suppose Fred opens up a car repair shop in a city which has none already. He offers to fix the vehicles of Whoville and repair them for money; being the first to offer the service to the town, he has lots of happy customers. In an abstract sense Fred is...
Beware LLMs' pathological guardrailing Modern large language models go through a battery of reinforcement learning where they are trained not to produce code that fails in specific, easily detectable ways, like crashing the program or causing failed unit tests. Almost universally, this means these models have learned to produce code...
[CW: Partial nudity] I'm a straight man. If you're a straight man who befriends other straight men, you will occasionally have conversations that sound like this: * A: I hate how Hollywood is pushing all of these unattractive actors lately. I want to see more people like [Celebrity Name]. *...
Before I voice some really normie, stereotypically overblown concerns on LessWrong, let me state a couple things about myself: * I'm a non-Jewish white man * I've never been concerned about race relations before * I don't think of myself as the kind of person who gets overly politically activated...
About nine months ago, I and three friends decided that AI had gotten good enough to monitor large codebases autonomously for security problems. We started a company around this, trying to leverage the latest AI models to create a tool that could replace at least a good chunk of the...