Open source software has long differentiated between “free as in speech” (libre) and “free as in beer” (gratis). In the first case, libre software has a license that allows the user freedom to view the source and modify it, understand it, and remix it. In the second case, gratis software does not need to be paid for, but the user doesn’t necessarily have access to the pieces, can’t make new versions, and cannot remix or change it.
...
If Open Source AI is neither gratis or libre, then those calling free model weights “Open Source,” should figure out what free means to them. Perhaps it’s “free as in oxygen” (dangerous due to reactions it can cause), or “free as in birds” (wild, without any person responsible).
I’m not necessarily opposed to judicious release of model weights, though as with any technology, designers and developers should consider the impact of their work before making or releasing it, as LeCun has recently agreed. But calling this new competitive strategy by Facebook “Open Source” without insisting on the actual features of open source is an insult to the name.
Have you met Mistral, Phi-2, Falcon, MPT, etc ... ? There are plenty of freely remixable models out there; some even link to their datasets and recipes involved in processing them (though I wouldn't be surprised if some relevant thing got left out because no one researched that it was relevant yet).
Though I'm reasonably sure Llama license isn't preventing viewing the source (though of course not the training data), modifying it, understanding it and remixing it. It's a less open license than others, but Facebook didn't just free-as-in-beer release a compiled black box you put on your computer and can never change; research was part of the purpose, and needs to do that. It's not the best open source license, but I'm not sure if being a good example of something is required to meet the definition.
This is technically correct but irrelevant. Meta doesn't provide any source code, by which I mean the full set of precursor steps (including the data and how to process it).
Generally speaking, a license defines usage rights; it has nothing to do with if/how the thing (e.g. source code) is made available.
As a weird example, one could publish a repository with a license but no source code. This would be odd. The license would have no power to mandate the code be released; th... (read more)