LESSWRONG
LW

Pablo Villalobos — LessWrong

Replying toMost successful entrepreneurship is unproductive

Most successful entrepreneurship is unproductive

This applies to basically any form of competition beyond the amount needed to avoid pricing power.

For example, if you are a job seeker, you can dedicate your time to learning actual skills and looking for employers with unmet needs, or you can rack up your credentials to be candidate #1 instead of #2 in an oversubscribed hiring process.

That doesn't make these activities unproductive in most cases: if customers or employers pick you it's because you are providing some marginal value, but the marginal value might be low compared to the amount you capture

Replying toStatus Is The Game Of The Losers' Bracket

Pablo Villalobos3mo

Status Is The Game Of The Losers' Bracket

I think this is partly cope. Middle management at large companies might have a significant political component but large companies still have much higher labor productivity than small ones, they still represent like a quarter of OECD economic output, and the average middle manager is probably still creating very significant amounts of marginal value despite the political infighting.

Yes, there might be very few middle managers at the top of Forbes' list. But look at millionaires, the people at the top 10% of the wealth distribution in the US. Most middle managers will probably be there, along with other highly paid professionals. And if you found a startup and it fails, you won't... (read more)

Pablo Villalobos10mo

Personal view as an employee: Epoch has always been a mix of EAs/safety-focused people and people with other views. I don't think our core mission was ever explicitly about safety, for a bunch of reasons including that some of us were personally uncertain about AI risk, and that an explicit commitment to safety might have undermined the perceived neutrality/objectiveness of our work. The mission was raising the standard of evidence for thinking about AI and informing people to hopefully make better decisions.

My impression is that Matthew, Tamay and Ege were among the most skeptical about AI risk and had relatively long timelines more or less from the beginning. They have contributed enormously... (read more)

Replying toMadrid – ACX Meetups Everywhere Spring 2025

Pablo Villalobos10mo

Madrid – ACX Meetups Everywhere Spring 2025

We're in the nearby bar, Casa Remigio, since the theater is occupied

Replying toHow to Make Superbabies

Pablo Villalobos1y

How to Make Superbabies

I suspect the analogy does not really work that well. Much of human genetic variation is just bad mutations that take a while to be selected out. For example, maybe a gene variant slightly decreases the efficiency of your neurons and makes everything in your brain slightly slower

Replying toMarket Capitalization is Semantically Invalid

Pablo Villalobos1y

Market Capitalization is Semantically Invalid

I stand corrected. Although the broader point about share prices noisily approximating a discounted expected cash flow which can be added or multiplied still holds

Replying toMarket Capitalization is Semantically Invalid

Pablo Villalobos1y

Market Capitalization is Semantically Invalid

There is a sense in which the price approximates an intrinsic property of the shares that you can add up or multiply by the number of shares. Each share gives you a vote in the shareholder assembly and an equal portion of the dividends. If you had all the shares, you would own the company and in principle could pay yourself as much as the company can afford in dividends.

How much the company can afford to pay in dividends in the future is basically how much net operating profit after taxes (NOPAT) the company will have.

If you have a prediction of the future NOPAT of the company, it implies a present value... (read more)

-2

Madrid - ACX Meetups Everywhere Fall 2024

Pablo Villalobos

This year's Fall ACX Meetup in Madrid.

Location: El Retiro Park, puppet theatre (https://www.esmadrid.com/en/tourist-information/teatro-de-titeres-de-el-retiro) – https://plus.codes/8CGRC897+F8M

Contact: pvillalobos@proton.me

Replying toA Step Against Land Value Tax

Pablo Villalobos2y

A Step Against Land Value Tax

The arguments you make seem backwards to me.

All this to say, land prices represent aggregation effects / density / access / proximity of buildings. They are the cumulative result of being surrounded by positive externalities which necessarily result from other buildings not land. It is the case that as more and more buildings are built, the impact of a single building to its land value diminishes although the value of its land is still due to the aggregation of and proximity to the buildings that surround it.

Yes, this is the standard Georgist position, and it's the reason why land owners mainly capture (positive and negative) externalities from land use around them, not... (read 463 more words →)

Data on AI

Robi Rahman

Robi Rahman, Jaime Sevilla Molina, Pablo Villalobos, Ben Cottier

Epoch AI collects key data on machine learning models from 1950 to the present to analyze historical and contemporary progress in AI.

This is a big update to the website, and the datasets have substantially expanded since last year.

Announcing Epoch's newly expanded Parameters, Compute and Data Trends in Machine Learning database

Robi Rahman

Robi Rahman, Jaime Sevilla Molina, Tamay, Ege Erdil, Pablo Villalobos, Ben Cottier, Matthew Barnett

The performance of machine learning models is closely related to their amount of training data, compute, and number of parameters. At Epoch, we’re investigating the key inputs that enable today’s AIs to reach new heights.

Our recently expanded Parameter, Compute and Data Trends database traces these details for hundreds of landmark ML systems and research papers.

In the past six months, we’ve added 240 new language models and 170 compute estimates. We will be maintaining this dataset, updating it with more historical information, and adding new significant releases. It's a valuable resource for journalists, academics, policymakers, and anyone interested in understanding the trajectory of AI.

Explore the interactive visualization, check out the documentation, and access the data for your own research at epochai.org/data/pcd.

EA Madrid social

Pablo Villalobos

¡Desde Altruismo Eficaz Madrid hemos preparado un evento social para que podamos conocernos y charlar en un ambiente más relajado! Es un momento perfecto para

Hablar informalmente de cualquier tema sobre el que quieras saber más.
Conocernos mejor y compartir ideas y proyectos.
Y pasar un buen rato con personas que comparten tu interés por las formas más efectivas de mejorar el mundo.
Disfrutar de buena comida vegana.

¡Te esperamos!

----------------------------------------------------------------------------

We at Effective Altruism Madrid have organized a social event so we can get to know each other and chat in a more relaxed atmosphere! It's a perfect time to:

Casually discuss any topic you want to know more about.
Get to know each other better and share ideas and projects.
Enjoy a great time with people who share your interest in the most effective ways to make the world better.
Savor delicious vegan food.

We look forward to seeing you!

Trading off compute in training and inference (Overview)

Pablo Villalobos

Summary: Some techniques allow to increase the performance of Machine Learning models at the cost of more expensive inference, or reduce inference compute at the cost of lower performance. This possibility induces a tradeoff between spending more resources on training or on inference. We explore the characteristics of this tradeoff and outline some implications for AI governance.

Key takeaways

In current Machine Learning systems, the performance of a system is closely related to how much compute is spent during the training process. However, it is also possible to augment the capabilities of a trained model at the cost of increasing compute usage during inference or reduce compute usage during inference at the cost of... (read 1950 more words →)

Replying toACX Meetup Madrid

Pablo Villalobos3y

ACX Meetup Madrid

We'll be at the ground floor!

Replying toRevisiting the Horizon Length Hypothesis

Pablo Villalobos3y

Revisiting the Horizon Length Hypothesis

Not quite. What you said is a reasonable argument, but the graph is noisy enough, and the theoretical arguments convincing enough, that I still assign >50% credence that data (number of feedback loops) should be proportional to parameters (exponent=1).

My argument is that even if the exponent is 1, the coefficient corresponding to horizon length ('1e5 from multiple-subjective-seconds-per-feedback-loop', as you said) is hard to estimate.

There are two ways of estimating this factor

Empirically fitting scaling laws for whatever task we care about
Reasoning about the nature of the task and how long the feedback loops are

Number 1 requires a lot of experimentation, choosing the right training method, hyperparameter tuning, etc. Even OpenAI made some mistakes... (read more)

Revisiting the Horizon Length Hypothesis

Pablo Villalobos

Summary: As part of my work at Epoch, I investigated the horizon length hypothesis - the idea that the horizon length of a task is predictive of the training compute needed to learn that task. My current (weak) conclusion is that the horizon length hypothesis can't be used in practice to estimate the compute requirements for training transformative AI because of the difficulty of 1) measuring horizon length accurately and 2) accounting for compute-saving techniques like curriculum learning. The evidence is weak, so we decided to publish a summary of our research so far here for public scrutiny and gather feedback while we decide whether we want to work more on the... (read 743 more words →)

ACX Meetup Madrid

Pablo Villalobos

Come join us for a social!

I have an intuition that any system that can be modeled as a committee of subagents can also be modeled as an agent with Knightian uncertainty over its utility function. This goal uncertainty might even arise from uncertainty about the world.

This is similar to how in Infrabayesianism an agent with Knightian uncertainty over parts of the world is modeled as having a set of probability distributions with an infimum aggregation rule.

Scaling Laws Literature Review

Pablo Villalobos

Common shape of a scaling law, taken from Hestness et al. (2017)

Executive summary

Scaling laws are predictable relations between the scale of a mode and performance or other useful properties.
I have collected a database of scaling laws for different tasks and architectures, and reviewed dozens of papers in the scaling law literature.
My main takeaways are:
- Functional forms: a basic power law can effectively model the scaling behavior in the power-law region but not the transitions to the other two regions. For this, either the M4 estimator or the BNSL estimator introduced below seem to be the best options right now.
- Transfer learning: there is not a simple universal scaling law for transfer learning between arbitrary tasks. When

... (read 956 more words →)

Causal abstractions vs infradistributions

Pablo Villalobos

Summary: I illustrate the relationship between Wentworth-style causal abstractions and infradistributions, and how they both deal with nonrealizability by throwing away information. If you have a basic intuition for causal abstractions, this might help you understand infradistributions better. And if you are comfortable with infradistributions, this might help you translate abstraction-related concepts to a more rigorous setting.

The core idea of this post is the following:

Causal abstractions are "forgetful" (non-injective) functions of probability distributions.
Infradistributions are (not exactly) sets of probability distributions.
Given a forgetful function of a probability distribution, we can form the set of all distributions that are equivalent in a certain sense. Similarly, given a set of distributions we can define a

... (read 1645 more words →)

Will we run out of ML data? Evidence from projecting dataset size trends

Pablo Villalobos

Summary: Based on our previous analysis of trends in dataset size, we project the growth of dataset size in the language and vision domains. We explore the limits of this trend by estimating the total stock of available unlabeled data over the next decades.

Read the full paper in arXiv.

Our projections predict that we will have exhausted the stock of low-quality language data by 2030 to 2050, high-quality language data before 2026, and vision data by 2030 to 2060. This might slow down ML progress.

All of our conclusions rely on the unrealistic assumptions that current trends in ML data usage and production will continue and that there will be no major innovations in data... (read 511 more words →)

I'm not sure if using the Lindy effect for forecasting x-risks makes sense. The Lindy effect states that with 50% probability, things will last as long as they already have. Here is an example for AI timelines.

The Lindy rule works great on average, when you are making one-time forecasts of many different processes. The intuition for this is that if you encounter a process with lifetime T at time t<T, and t is uniformly random in [0,T], then on average T = 2*t.

However, if you then keep forecasting the same process over time, then once you surpass T/2 your forecast becomes worse and worse as time goes by. Just when t is... (read more)

LESSWRONG
LW

LESSWRONG
LW

Pablo Villalobos

Announcing Epoch: A research organization investigating the road to Transformative AI

Compute Trends Across Three eras of Machine Learning

Will we run out of ML data? Evidence from projecting dataset size trends

Parameter counts in Machine Learning

Pablo Villalobos

Data on AI

Announcing Epoch's newly expanded Parameters, Compute and Data Trends in Machine Learning database

Trading off compute in training and inference (Overview)

Revisiting the Horizon Length Hypothesis

Scaling Laws Literature Review

Causal abstractions vs infradistributions

Will we run out of ML data? Evidence from projecting dataset size trends

Pablo Villalobos

Announcing Epoch: A research organization investigating the road to Transformative AI

Compute Trends Across Three eras of Machine Learning

Will we run out of ML data? Evidence from projecting dataset size trends

Parameter counts in Machine Learning

Pablo Villalobos

Data on AI

Announcing Epoch's newly expanded Parameters, Compute and Data Trends in Machine Learning database

Trading off compute in training and inference (Overview)

Revisiting the Horizon Length Hypothesis

Scaling Laws Literature Review

Causal abstractions vs infradistributions

Will we run out of ML data? Evidence from projecting dataset size trends

Key takeaways

Executive summary