The Spectrum of Openness
There is no easy binary that opposes “open” and “closed” in the case of artificial intelligence (AI) models. Instead, openness should be viewed as a spectrum.1 This more flexible understanding of openness fosters productive conversations about both the varied benefits of openness and the marginal risks associated with open models relative to closed models or what is already publicly available online.2
“Open-source AI” is a term with a definition that is still in flux and likely to be defined differently by different stakeholders. The Open Source Initiative (OSI), a nonprofit that advocates for the benefits of open source and acts as a multistakeholder standards body that maintains a widely-used definition of open source software as a public benefit, just released its first definition of the term “open source AI.”3 The OSI definition clarifies that to be open, an entire AI system must be considered—both in its “fully functional structure and its discrete structural elements.” OSI further notes that “the requirements are the same, whether applied to a system, a model, weights and parameters, or other structural elements.”4
Efforts such as OSI’s to align on a specific definition of open-source AI that is based on certain criteria are helpful, as different actors are currently using the term in different, often self-serving, ways. In a paper last year, David Gray Widder, Sarah West, and Meredith Whittaker described this phenomenon of “openwashing,” in which a model developer misleadingly claims the mantle of openness for public relations gains while actually providing access to their model in a way that “should be understood as ‘closed.’”5 They suggest that models fall on “gradients of openness” in which the term “open” can describe models that “offer vastly differing levels of access.”
The authors offer three attributes for understanding the openness of models: (1) transparency, (2) reusability, and (3) extensibility. Transparency denotes “the ability to access and vet source code, documentation and data;” reusability is “the ability and licensing needed to allow third parties to reuse source code and/or data;” and extensibility is “the ability to build on top of extant off-the-shelf models, ‘tuning’ them for one or another specific purpose.”6 These attributes are a useful framework for examining a model’s software components.
These attributes also highlight many of the same principles required by OSI’s definition. For OSI, an open-source AI system must allow use “for any purpose and without having to ask permission,” the ability to study how the system works, the ability to modify it, and to share it “with or without modification.” These overlaps suggest a growing agreement around the term “open,” particularly the need to include transparency, access, and modification in the definition.
Relevant to the ongoing discussions about how to define open models is the question of access to training data. While some models describe themselves as open and provide code and model weights, they do not provide access to the data used when training the model. A group of scholars recently suggested using the term “open-access AI” in this context, arguing that “‘open-source AI’ is a misnomer for such models” due to “meaningful differences in access, control, and development.”7 OSI’s definition similarly regards access to training data as an essential test in determining whether or not a model is truly open source.
We believe there are at least five key ways in which a model manifests openness, whether it is a large foundation model or a more narrowly tailored one:
- Open code that can be downloaded, modified, shared, and used by others;
- Open licenses that allow third parties to use the model;
- Transparency about model inputs (data sources, model weights8);
- Transparency about envisioned threats from models and ways to mitigate against undesirable downstream effects (e.g., malicious actors fine-tuning the model to cause clear harms); and
- Open standards for interconnection and communication among AI models that allow people and companies to switch between models (portability) and for models to interoperate with one another.
To illustrate the concept of a spectrum of openness, we offer a simplified breakdown with examples. We have chosen this range of attributes as an exercise in illustratively drawing the line between models, recognizing that the spectrum could consist of many more attributes and reflect greater nuance.
The exercise of defining open-source AI, or even placing AI models along such a spectrum, demonstrates that the emerging requirements for an AI model to be considered open are similar to those in earlier free and open-source software projects. To examine—as OSI suggests—the entire system of an AI, we must investigate more attributes than simply the code or the weights. We must think more broadly of models as software projects. The history of open-source software is full of instructive examples of how to (and not to) structure and maintain large software projects, and it is also full of examples of unintended consequences that have shaped tech.
The term “open source” is used throughout this report in a way that includes consideration of all software licensing that meets both the Free Software Foundation’s definition of free software9 and the Open Source Initiative’s “Open Source Definition.” (OSD).10 We have chosen to use “open-source” to refer to code as it resonates with the current discourse around open models and not because of a particular preference or recommendation for existing open software licenses. We use the term “open model” throughout this report to echo that discussion but recognize that the lexicon around AI and openness is changing and may ultimately need more terminology—like “open access”—to meaningfully distinguish among model types in the future.
Much of the prevailing discourse around open models focuses on risks and fails to fully account for the significant societal benefits of open models to public transparency and accountability, to unexpected innovation and competition, to education and research, and to security. The following sections explore each of these benefits in further detail.
Citations
- “Several speakers challenged the notion of a binary between ‘open’ and ‘closed’ models, pointing toward a spectrum of options regarding the level of access to system components such as datasets, code, model cards, and model weights.” Amanda Leal, Towards Effective Governance of Foundation Models and Generative AI: Takeaways from the Fifth Edition of The Athens Roundtable on AI and the Rule of Law (Future Society, March 2024), 32, source. See also Dual-Use Foundation Models, source.
- “One thing we have already learned is the importance of focusing on the marginal or differential risks and benefits of open weights. For example, we need to measure the risks of open-weight models relative to the risks that already exist today from widely-available information, or from closed models. We have also been encouraged to hear that this is not a binary choice of ‘open’ vs. ‘closed.’ Rather there is a broader ‘gradient of openness’ that we need to consider and that may offer broader options for policy.” Alan Davidson, “National Security and Open Weight Models: Remarks of Alan Davidson,” National Telecommunications and Information Administration, March 22, 2024, source.
- “The Open Source AI Definition 1.0,” Open Source Initiative, October 28, 2024, source.
- “The Open Source AI Definition 1.0,” source.
- David Gray Widder, Sarah West, and Meredith Whittaker, “Open (For Business): Big Tech, Concentrated Power, and the Political Economy of Open AI,” SSRN, August 17, 2023, source.
- Widder, West, and Whittaker, “Open (For Business),” source.
- Parth Nobel, Alan Z. Rozenshtein, and Chinmayi Sharma, “Open-Access AI: Lessons From Open-Source Software,” Lawfare, October 25, 2024, source.
- Model weights refer to the numerical value an AI model gives to a piece of information to show the relative strength between it and another piece of information.
- “What is Free Software? The Free Software Definition,” Free Software Foundation, December 27, 2016, source.
- “The Open Source Definition,” Open Source Initiative, March 22, 2007, source.