Roman Leventov

LESSWRONG
LW

Roman Leventov — LessWrong

I don't understand why people rave so much about Claude Code etc., nor how they really use these agents. The problem is not capability--sure, today agents can go far without stumbling or losing the plot. The problem is that they will go not in the direction I want.

It's because my product vision, architectural vision, and code quality "functions" are complex: very tedious to express in CLAUDE/AGENTS .md, and often hardly expressible in language at all. "I know it when I see it." Hence keeping agent "on a short leash" (Karpathy)--in Cursor.

This makes me think that at least in coding (also, probably some other types of engineering, design, soon perhaps content creation, perhaps... (read more)

Personal agents

Roman Leventov

8mo

Cross-posted from my blog.

Motivation

I believe that the most important factor in whether our AI future goes broadly well or poorly is whether people quickly develop effective AI-ready (and AI-enabled) institutions and networks^[1]. In that, I agree with the recent Séb Krier's essay "Maintaining agency and control in an age of accelerated intelligence".

Many academic groups, non-profit orgs (such as Collective Intelligence Project, AI Objectives Institute, Metagov, and Gaia Lab), and even some governmental agencies, such as Taiwan's Ministry of Digital Affairs are currently working on new AI-ready institutions. However, these projects will likely remain theoretical exercises or prototypes unless there is a population of AI-enabled agents (individuals and organisations) eager to coordinate and... (read 2029 more words →)

Replying toAn “Optimistic” 2027 Timeline

Roman Leventov10mo

An “Optimistic” 2027 Timeline

But then the possibilities for 2027 branch on whether there are reliable agents, which doesn't seem knowable either way right now.

Very reliable, long-horizon agency is already in the capability overhang of Gemini 2.5 pro, perhaps even the previous-tier models (gemini 2.0 exp, sonnet 3.5/3.7, gpt-4o, grok 3, deepseek r1, llama 4). It's just the matter of harness/agent-wrapping logic and inference-time compute budget.

Agency engineering is currently in the brute-force stage. Agent engineers over rely on a "single LLM rollout" to be robust, but also often use LLM APIs that sometimes lack certain nitty-gritty affordances for implementing reliable agency, such as "N completions" with timely self-consistency pruning and perhaps scaling N up again when... (read more)

Roman Leventov11moQuick Take

It seems that a lot of white collar jobs will become (already becoming) positional goods, such as aristocratic titles, at least for a few years, possibly longer.

AI will do 100% of the "meat" of the job better than almost all humans, and ~equally for every user (prompting won't matter much).

But business will still demand accountability for results, and that the workers can claim that they understand and attest AI outputs (these claims themselves won't be tested, though, nor would it really matter in the grand scheme of things). At the same time, the productivity of these jobs will increase more than businesses can absorb, at least for a few years (and then... (read more)

Replying toGradual Disempowerment, Shell Games and Flinches

Roman Leventov1y

Gradual Disempowerment, Shell Games and Flinches

Even for those not directly employed by AI labs, there are similar dynamics in the broader AI safety community. Careers, research funding, and professional networks are increasingly built around certain ways of thinking about AI risk. Gradual disempowerment doesn't fit neatly into these frameworks. It suggests we need different kinds of expertise and different approaches than what many have invested years developing. Academic incentives also currently do not point here - there are likely less than ten economists taking this seriously, trans-disciplinary nature of the problem makes it hard sell as a grant proposal.

I agree this is unfortunate, but this also seems irrelevant? Academic economics (as well as sociology, political science, anthropology,... (read more)

-2

Replying toGradual Disempowerment, Shell Games and Flinches

Roman Leventov1y

Gradual Disempowerment, Shell Games and Flinches

My quick impression is that this is a brutal and highly significant limitation of this kind of research. It's just incredibly expensive for others to read and evaluate, so it's very common for it to get ignored.

I'd predict that if you improved the arguments by 50%, it would lead to little extra uptake.

I think this is wrong. The introduction of the GD paper takes no more than 10 minutes to read and no significant cognitive effort to grasp, really. I don't think there is more than 10% potential of making it any clearer or approachable.

Replying toThe Failed Strategy of Artificial Intelligence Doomers

Roman Leventov1y

The Failed Strategy of Artificial Intelligence Doomers

https://gradual-disempowerment.ai/ is mostly about institutional progress, not narrow technical progress.

Replying toAI research assistants competition 2024Q3: Tie between Elicit and You.com

Roman Leventov1y

AI research assistants competition 2024Q3: Tie between Elicit and You.com

Undermind.ai I think is much more useful for searching concepts and ideas in papers rather than extracting tabular info a la Elicit. Nominally Elicit can do the former, too, but is quite bad in my experience.

Replying toThe Great Data Integration Schlep

Roman Leventov1y

The Great Data Integration Schlep

https://openmined.org/ develops Syft, a framework for "private computation" in secure enclaves. It potentially reduces the barriers for data integration both within particularly bureaucratic orgs and across orgs.

Replying toMy motivation and theory of change for working in AI healthtech

Roman Leventov1y

My motivation and theory of change for working in AI healthtech

Thanks for the post, I agree with it!

I just wrote a post with differential knowledge interconnection thesis, where I argue that it is on net beneficial to develop AI capabilities such as

Federated learning, privacy-preserving multi-party computation, and privacy-preserving machine learning.
Federated inference and belief sharing.
Protocols and file formats for data, belief, or claim exchange and validation.
Semantic knowledge mining and hybrid reasoning on (federated) knowledge graphs and multimodal data.
Structured or semantic search.
Datastore federation for retrieval-based LMs.
Cross-language (such as, English/French) retrieval, search, and semantic knowledge integration. This is especially important for low-online-presence languages.

I discuss whether knowledge interconnection exacerbates or abates the risk if industrial dehumanization on net in a section. It's a challenging question, but... (read more)

Differential knowledge interconnection

Roman Leventov

Cross-posted from my blog.

This post is a reply to Eugene Kirpichov's post on Linkedin. Eugene writes that contributing to general AI and information processing capabilities (including GPUs, general LLM technology, general data processing, etc.) is probably harmful overall because these capabilities effectively increase the speed at which the civilisation is moving but doesn't affect the trends, and the trends are negative right now because the civilisation is not on a sustainable trajectory, that is, the civilisation doens't move towards increasing flourishing of all moral patients, human and non-human.

I disagree with Kirpichov's proposition as is because I think it lacks important nuance in definition of general AI and knowledge production capabilities. I think... (read 1960 more words →)

Replying toThere Should Be More Alignment-Driven Startups

Roman Leventov2y

There Should Be More Alignment-Driven Startups

I think the model of commercial R&D lab would often suit alignment work better than a "classical" startup company. Conjecture and AE Studio come to mind. Answer.AI, founded by Jeremy Howard (of Fast.ai and Kaggle) and Eric Ries (Lean Startup) elaborates on this business and organisational model here: https://www.answer.ai/posts/2023-12-12-launch.html.

The AI Revolution in Biology

Roman Leventov

An expert in the field (Amelie Schreiber) predicts the convergence of AI models for biology to the state where designing drugs, targeted genetic mutations (which then could be enacted through genetic therapy), or viruses for specific purposes becomes really easy in a few years. (Note: apparently, the podcast was recorded before the release of AlphaFold 3.)

Unlike AGI safety, where OpenAI's argument of "iterative deployment" merits more weight^[1], in biology I think we can already map the capabilities that will be unlocked. In fact, this podcast does it pretty well, it seems to me. So, it should also be possible to develop effective regulation and controls ahead of the creation of the bio-capabilities.

This raises the questions of the relative speeds of bio-capabilities, regulation, and controls development. And the ethics of pushing bio-capabilities if regulation and controls are lagging behind.

^{^}
Even though many people, including me still don't think "iterative development" is justified the way OpenAI does it.

The two-tiered society

Roman Leventov

On AI and Jobs: How to Make AI Work With Us, Not Against Us With Daron Acemoglu

Here is Claude.ai's summary of Daron Acemoglu's main ideas from the podcast:

Historically, major productivity improvements from new technologies haven't always translated into benefits for workers. It depends on how the technologies are used and who controls them.
There are concerns that AI could further exacerbate inequality and create a "two-tiered society" if the benefits accrue mainly to a small group of capital owners and highly skilled workers. Widespread prosperity is not automatic.
We should aim for "machine usefulness" - AI that augments and complements human capabilities - rather than just "machine intelligence" focused on automating human tasks. But

... (read 694 more words →)

From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

Roman Leventov

Submitted on 6 Nov 2023

Sean Tull, Razin A. Shaikh, Sara Sabrina Zemljic, Stephen Clark

Abstract

In this article we present a new modelling framework for structured concepts using a category-theoretic generalisation of conceptual spaces, and show how the conceptual representations can be learned automatically from data, using two very different instantiations: one classical and one quantum. A contribution of the work is a thorough category-theoretic formalisation of our framework. We claim that the use of category theory, and in particular the use of string diagrams to describe quantum processes, helps elucidate some of the most important features of our approach. We build upon Gardenfors' classical framework of conceptual spaces, in which cognition is modelled

... (read 1049 more words →)

AI alignment as a translation problem

Roman Leventov

Yet another way to think about the alignment problem

Consider two learning agents (humans or AIs) that have made different measurements of some system and have different interests (concerns) regarding how the system should be evolved or managed (controlled). Let’s set aside the discussion of bargaining power and the wider game both agents play and focus on how the agents can agree about a specific way of controlling the system, assuming the agents have to respect each other’s interests.

For such an agreement to happen, both agents must see the plan for controlling the system of interest as beneficial from the perspective of their models and decision theories^[1]. This means that they can find... (read 803 more words →)

Workshop (hackathon, residence program, etc.) about for-profit AI Safety projects?

Roman Leventov

Reading posts like "This might be the last AI Safety Camp" makes me feel very sad. The degree to which AI Safety is currently funding-constrained is clearly inadequate on the national and civilisational levels.

Maybe let's try to make a smart counter-move and accelerate the development of for-profit AI Safety projects (see also the comments to that post, and this post)? With the obvious idea to pull some VC money, which is a different pool than AI safety philanthropic funds.

Potential collaborations:

https://vc.ae.studio/
https://www.lionheart.vc/
https://www.mythos.vc/
https://www.joinef.com/ (or maybe they are working on something like this already? At least Matt Clifford was the UK PM's sherpa for AI Safety Summit, and now working on AI full time)
More?

P.S. I'm not a professional organiser or community builder, nor a startup accelerator program manager, so just floating the idea, but I'd be very eager to participate if something like this is organised.

Institutional economics through the lens of scale-free regulative development, morphogenesis, and cognitive science

Roman Leventov

Cross-posted from my Substack.

In this article, I highlight some of the ideas from Musthtaq Khan’s interview for 80000hours podcast about institutional economics^[1], political economy, his “political settlement” framework, and the methodology of economics, and connect these ideas to the concepts in scale-free regulative development^[2], morphogenesis^[3], and cognitive science^[4]^[5].

The history of the socioeconomy matters

Khan points to the fact that socioeconomies are non-ergodic dynamical systems with hysteresis, i.e., memories of their own^[1]:

To say that all structures are created by individuals, and therefore if the structure of society in India is different from the one in the United States, then we have to look at the individual incentives that created those structures, I think is

... (read 4029 more words →)

Gaia Network: An Illustrated Primer

Rafael Kaufmann Nedal

Rafael Kaufmann Nedal, Roman Leventov

NB: this post has lost footnotes. Please read the version on Effective Altruism Forum with footnotes.

In our first LW post on the Gaia Network, we framed it as a solution to the challenges of building safe, transformative AI. However, the true potential of Gaia as a “world-wide web of causal models” goes far beyond that, and in fact, justifying it in terms of its value to other use cases is key to showing its viability for AI safety. At the same time, the previous post focused more on the “what” and “why”, and didn’t really talk much about the “how”. In this piece, we’ll correct both of these flaws: we’ll visually walk... (read 4374 more words →)

Worrisome misunderstanding of the core issues with AI transition

Roman Leventov

This post is triggered by “Generative AI dominates Davos discussions as companies focus on accuracy” (CNBC) and “AI has a trust problem — meet the startups trying to fix it” (Sifted).

It's just remarkable (and worrying) how business leaders and journalists misunderstand the core issues with AI adoption and transition^[1]. All they talk about is "accuracy", "correctness", and "proving that AI is actually right"(!). The second piece has a hilarious passage “Cassar says this aspect of AI systems creates a trust issue because it goes against the human instinct to make ‘rule-based’ decisions.”(!)

There are many short- and medium-term applications where this "rule-following and accuracy" framing of the issue is correct, but they are... (read 941 more words →)

I asked GPT-4 to write a list of desiderata for a naturalistic (i.e., scientific) theory of ethics: https://chat.openai.com/share/1025a325-30e0-457c-a1ed-9d6b3f23eb5e. It made some mistakes but in other regards surprised me with the quality of its philosophy of science and meta-ethics.

The mistake that jumped out for me was “6. Robustness and Flexibility”:

The ethical theory should be robust and flexible, meaning it should be able to accommodate new information and adapt to different contexts and conditions. As our scientific knowledge evolves, the theory should be able to incorporate new insights without losing its coherence or practical applicability.

According to a Popperian criterion for a good scientific theory, falsifiability, GPT-4 should have made the opposite point: the scientific... (read more)

This is my comment about Pedro Domingos' thinking about AI alignment (in this video interview for the "Machine Learning Street Talk" channel:

also sprawling into adjacent themes (in which I push "my agenda", of course):

Thank you Tim for starting consistently bringing up the theme of "alignment" (although I disagree with this framing of the problem, quite similarly to Pedro actually, and for this reason prefer the term "AI safety"; more on this in the end of this comment) in your conversations with AI scientists who are not focused on this. This is very important.

It's sad to see though that both you and Pedro strawman the problem by reducing it to the dated ideas... (read 734 more words →)

I believe there is a lot of discussion about singleton AI (what a singleton even is, whether a "community of agents" or a singleton is more likely, or more preferable from the safety perspective, what are the safety implications, etc.), with which I'm basically unfamiliar.

Here, I want to make an observation from the engineering/performance perspective. If there will be a singleton (a single model/algorithm, or a collection of models which we can treat as a singleton) that "controls everything", then at least some of the models close to the top of the hierarchy (where smaller agents/models operate in real-time on the edge, higher level or several layers of hierarchy is/are algorithm(s) that... (read more)

LESSWRONG
LW

LESSWRONG
LW

Critique of some recent philosophy of LLMs’ minds

Yoshua Bengio: "Slowing down development of AI systems passing the Turing test"

A multi-disciplinary view on AI safety research

Has private AGI research made independent safety research ineffective already? What should we do about this?

Roman Leventov

Roman Leventov

Personal agents

Differential knowledge interconnection

The AI Revolution in Biology

The two-tiered society

From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

AI alignment as a translation problem

Workshop (hackathon, residence program, etc.) about for-profit AI Safety projects?

A multi-disciplinary view on AI safety

Roman Leventov

Critique of some recent philosophy of LLMs’ minds

Yoshua Bengio: "Slowing down development of AI systems passing the Turing test"

A multi-disciplinary view on AI safety research

Has private AGI research made independent safety research ineffective already? What should we do about this?

Roman Leventov

Roman Leventov

Personal agents

Differential knowledge interconnection

The AI Revolution in Biology

The two-tiered society

From Conceptual Spaces to Quantum Concepts: Formalising and Learning Structured Conceptual Models

AI alignment as a translation problem

Workshop (hackathon, residence program, etc.) about for-profit AI Safety projects?

A multi-disciplinary view on AI safety

Motivation

Yet another way to think about the alignment problem

The history of the socioeconomy matters