SAF: Stakeholders’ Agreement on Fairness in the Practice of Machine Learning Development

Curto, Georgina; Comim, Flavio

doi:10.1007/s11948-023-00448-y

SAF: Stakeholders’ Agreement on Fairness in the Practice of Machine Learning Development

Original Research/Scholarship
Open access
Published: 24 July 2023

Volume 29, article number 29, (2023)
Cite this article

Download PDF

You have full access to this open access article

Science and Engineering Ethics Aims and scope Submit manuscript

SAF: Stakeholders’ Agreement on Fairness in the Practice of Machine Learning Development

Download PDF

1300 Accesses
Explore all metrics

Abstract

This paper clarifies why bias cannot be completely mitigated in Machine Learning (ML) and proposes an end-to-end methodology to translate the ethical principle of justice and fairness into the practice of ML development as an ongoing agreement with stakeholders. The pro-ethical iterative process presented in the paper aims to challenge asymmetric power dynamics in the fairness decision making within ML design and support ML development teams to identify, mitigate and monitor bias at each step of ML systems development. The process also provides guidance on how to explain the always imperfect trade-offs in terms of bias to users.

Action-guidance and AI ethics: the case of fair machine learning

Article Open access 04 March 2024

Ethical implications of fairness interventions: what might be hidden behind engineering choices?

Article Open access 28 February 2022

Designing Against Bias: Identifying and Mitigating Bias in Machine Learning and AI

Introduction

Discrimination and bias in AI are still unresolved topics in the current information civilization (Zuboff, 2019). While AI has the potential to facilitate the achievement of all United Nations Sustainable Goals, it can also widen existing social gaps by reproducing and often aggravating societal bias (Vinuesa et al., 2020). ML systems in particular have been found to often exacerbate representational and allocational harm to vulnerable salient groups (Suresh & Guttag, 2021). As a result, these groups not only receive demeaning treatment, but also less resources and opportunities. AI-amplified bias has been identified in critical services such as education, health and justice (Floridi, 2020).

ML bias has other particularities that deserve special attention. Users are often not aware of it and developers cannot always explain it (Barocas & Selbst, 2016). In this context, governmental efforts to regulate AI have gained traction in the past few years (White House, 2016; European Commission, 2021). In addition, there has been a proliferation of ethical guidelines (Algorithm Watch, 2021) in what has been described as a “moral panic” (Ess, 2020). These have been found to converge on specific topics (Hagendorff, 2020; Zeng et al., 2019) and have been summarized as 5 ethical principles: transparency, justice and fairness, non-maleficence, responsibility and privacy (Jobin, 2019).The scope of this article focuses on the principle of justice and fairness and facilitates the principle of transparency by guiding which specific steps of the fairness decision making process should be openly disclosed.

In the nascent field of AI ethics, these ethical principles have been qualified as appropriate but too abstract. AI development teams find them difficult to apply in practice. Existing AI ethics guidelines focus the effort on the “what” and fall short on clarifying the operationalization of AI ethics (Floridi, 2019; Morley et al., 2021a; Morley et al., 2021; Vakkuri et al., 2020; Vakkuri and Kemell, 2019). As a result, counterproductive practices such as ethics shopping, ethics bluewashing, ethics lobbying, ethics dumping or ethics shirking are prone to flourish (Floridi, 2019). An urgency has been identified to translate theoretical principles into practical inclusive processes (Harrison et al., 2020). This article has the objective to answer the question: how can the principle of justice and fairness be applied into the practice of ML development? With that aim, the paper first provides a conceptual framework, rooted in social and cognitive sciences, describing different interpretations of fairness. Then, we present the Stakeholder Agreement on Fairness (SAF), an iterative process that aims to support ML development teams on fairness decision making at each stage of ML design, testing and deployment, with the active participation of stakeholders. Finally, the article gives guidelines to facilitate the disclosure of the fairness decision making and trade-offs.

In parallel to the production of ethical guidelines, the principle of fairness has been tackled from the technical perspective by mitigating bias. A large body of work in recent years has been produced on the bias identification and debiasing of ML systems, especially on Natural Language Processing (NLP) (Bolukbasi et al., 2016; Caliskan et al., 2017; Dan Jurafskyc, 2018; Guo et al., 2022; Manzini et al., 2019; Nadeem et al., 2020; Zhao et al., 2021). Intersectional biases are gradually incorporated in the analysis (Lalor et al., 2022), considering the accumulative effect of multiple biases on what Hoffmann (2019) describes as the multi-oppressed groups. Approaches, such as the “Dephi” project (Jiang et al., 2021) have been presented to explicitly train state-of-the-art ML models on moral judgements, weighing competing moral concerns between broad ethical norms and personal values. And gradually emphasis is changing from the quantity to the quality of “greener” data sets (Schick & Schütze, 2020) and the use of synthetic data that can be aligned according to specific value systems (Watson et al. 2019). However, algorithms can also be biased in the way they learn or, more appropriately, in the way they are programmed to learn (Mittelstadt et al., 2016; Tsamados et al., 2021). Additionally, AI solutions are deployed into real complex systems and it is difficult to predict the social impact of an algorithmic system before actually deploying it (Morley et al., 2020).

In spite of all the technical efforts in place to mitigate bias and the proliferation of principle-based AI ethics guidelines (Morley et al., 2021), 79% of tech workers admit that they would like practical resources to assist them with ethical considerations (Miller & Coldicott, 2019). It has been suggested that a more holistic method to AI bias mitigation is required, with focus not only on data and algorithms but on the procedures carried out by developers (Floridi & Taddeo, 2016; Morley et al., 2021a). While there is an important body of work describing ML system design processes (Lehr & Ohm, 2017), the quality management of such systems (Horch, 1996) and the auditing practices at each stage of ML end-to-end processes development (Raji et al., 2020), to date there is not a process describing how to manage fairness decision making throughout ML design. This article is filling this gap.

While participatory approaches to ML design are gaining momentum (Martin et al., 2020) and a multi-stakeholder approach has been identified appropriate to make AI principles actionable (Stix, 2021), there is a lack of clarity on how to implement inclusive AI in practice (Birhane et al., 2022). This article pursues an instrumental objective: to reach acceptable agreements among stakeholders to manage bias in a specific ML system. In addition, the paper pursues an intrinsic objective: challenge power imbalances in ML design fairness decision making. The Stakeholders Fairness Agreement (SAF) process described in the article aims to go beyond societal value alignment design (Dobbe et al., 2018; Gabriel & Ghazavi, 2021). It intends to encourage reflection on societal values by providing a framework where representatives of vulnerable salient groups can express their needs and generate an impact on ML design (Sloane et al., 2020).

Finally, there is an identified need in the AI Ethics literature to analyze ML bias based on social sciences research. ML bias is often described as “statistical bias”, or a mismatch between the sample to train the model and the world as it currently is (Mitchell et al., 2021). However, biases are not exclusive to the on-line world (Card & Smith, 2020). When we are considering bias purely as a technical problem, we are missing part of the picture (Crawford, 2017). Bias in AI should be considered a socio-technical problem (Dignum, 2022). Blodgett et al. (2020) analyzed 146 papers describing bias in NLP systems and explain that the proposed quantitative techniques do not engage with the relevant literature outside ML. In this article, we are offering a method to reflect on and mitigate bias in ML systems not limited to statistical bias but also tackling societal bias.

This paper aims to fill the identified gaps in the area of ML fairness by, first of all, providing the conceptual background outside AI to engage critically with what constitutes bias. For that purpose, the article describes the nature of prejudice, discrimination and bias both in the online and the offline world. The paper proposes the SAF process, a hands-on, end-to-end methodology that translates the principle of justice and fairness into practice with specific actions to assist development teams at each step of the design, building, testing, deployment and monitoring of the ML lifecycle. This article does not aim to describe the ML process of ideation and development, but the process to manage bias within the ideation and development of ML systems. The SAF process foresees the identification and participation of legitimate stakeholders, including representatives of socially salient groups. As a result, it paves the way for the disclosure of the always imperfect trade-offs involved in fairness decision making and allows for a distributed responsibility of such decisions. The article discusses the challenges and limitations of the proposed process and concludes by identifying further research actions, such as assessing the suitability of the SAF process in practice through case studies, developing guidelines to evaluate compliance to it and exploring potential adaptations of the methodology to the other ethical principles and AI fields.

Conceptual Background: Fairness as an Agreement Among Stakeholders

Biases have been described as systematic and predictable errors in decision making based on available heuristics (Kahneman, 2011). Biases take place when there is an action, such a decision making process or the act of speech, and have their cognitive root on prejudices (Ely, 1980; Greenawalt & Dworkin, 1987). Allport (1954) suggests that prejudices are overgeneralized (and therefore erroneous beliefs) that lead to an attitude of favor or disfavor. Prejudices are part of the human learning process, during which we put information into categories and generalize based on previous experience. The only way to question them is by becoming aware of them through knowledge acquisition, which allows for critical thought and empathy (Cortina, 2007; Morgado, 2017).

Prejudices can trigger different degrees of action defined by Allport as antilocution, avoidance, discrimination, physical attack and even extermination (Fig. 1). One of the consequences of these actions is social stigma, which is associated with feelings of shame on the side of the discriminated (Goffman, 1963) and beliefs of deservingness on the side of the discriminators. When prejudices are socially shared, we are talking about stereotypes. These can be transmitted through language, what we know as linguistic bias, creating a self-perpetuating cycle where prejudices are shared and maintained (Beukeboom & Burgers, 2019; Maass, 1999) (Fig. 2).

It needs to be clarified that while discrimination has become a morally-laden term (Silvers, 1998), it has no build-in moral status (Eidelson, 2015). In the sense of differential treatment, discrimination is a necessary concept in the legal framework when assigning rights and responsibilities such as, for example, defining a minimum age to apply for a driving license. This paper is only concerned about wrongful discrimination, which demeans the persons affected, in the sense that denies equal moral worth to individuals (Hellman, 2008). Mitigating discrimination, however, should not be interpreted as impartiality or receiving equal treatment. In fact, Young describes the ideal of impartiality as keystone of a “mechanical interpretation of fairness” (1990, p. 11), which suppresses the difference that needs to be acknowledged in public policy. The mathematical approaches AI fairness currently being implemented would correspond to this mechanical interpretation. These can be grouped as: fairness through unawareness (Card & Smith, 2020; Hardt et al., 2016), demographic or statistical parity (Dwork et al., 2011), individual fairness (Green & Hu, 2018), randomisation (Kroll et al., 2017), equality of odds / opportunity (Hardt et al., 2016). These mathematical approximations to fairness are mutually incompatible (Card & Smith, 2020; Kleinberg et al., 2016; Tsamados et al., 2021).

In fact, there is not a silver bullet to solve discrimination and bias in AI (Crawford, 2017) and there is not a single universal and absolute interpretation of fairness. Young clarifies that the diversity of interpretations of fairness contains premises from the actual social context (1990). In fact, it has been argued that the widely influential conception of fairness deriving from Rawls’s “distributive justice” (1971) needs to be understood in the context of liberal capitalist societies (Wolfe, 1977; Young, 1981). Coeckelbergh (2022) provides some insights on how some of the main theories on fairness in social sciences could be translated to AI environments. A distributive justice approach (Rawls, 1971) would require, for example, that algorithms in recruiting apps would give priority to individuals that live in worse off areas. Or according to an identitarian approach to justice (Fraser & Honneth, 2003), algorithms would implement positive discrimination of vulnerable salient social groups, as described by the “algorithmic reparation” concept (Davis et al., 2021).

In this paper we argue that since ML systems cannot be subjected to a single interpretation of fairness, we propose a methodology to work towards a stakeholders’ consensual view of fairness for a specific ML system. The focus should not be only on the results, but on the process to reach such consensus, where pluralism and participation of stakeholders is key (Tasioulas, 2022). This approach can be found in the literature. For instance, Cortina describes fairness as “an agreement that could discover human beings through dialogue if they were really taken into account” (Cortina, 2011, p. 148). On his turn, Lyotard states that “there are language games in which the important thing is to listen, in which the rule deals with audition. Such a game is the game of the just” (1984, p. 71) and Young explains that “rational reflection on justice begins in a hearing, in heeding a call, rather than in asserting and mastering a state of affairs” (Young, 1990, p. 5). The methodology described in this article seeks to provide practical recommendations to reach agreements with stakeholders in terms of fairness decision making, in order to manage the phenomenon of bias at each stage of ML design.

If ML is to benefit society as a whole, it is essential to understand the specific backgrounds of stakeholders (or agents affected by the system) (Whittlestone et al., 2019). It has to be noted that the body of work on AI bias focuses on specific demographic dimensions, mainly referring to gender and race (Bolukbasi et al., 2016; Manzini et al., 2019; Nadeem et al., 2020) or intersectional biases across multiple demographic dimensions (Lalor et al., 2022). This paper argues that these classifications are too coarse and incomplete. A more detailed analysis should be performed to identify those that are going to be adversely affected by the ML system (Stix, 2021). Special attention should be paid to vulnerable salient groups, but stakeholders’ representation should not be limited to these groups only. The corporation that develops and aims to use or market the AI system is responsible not only for the valueladenness of the resulting technology, but also for identifying the relevant stakeholders (Martin, 2018), as it is defined in the first step of the Stakeholder Agreement for Fairness Process (SAF) (Fig. 3).

In fact, one could argue that considering only the bias suffered by vulnerable salient groups could be a form of discrimination itself. Indeed, the members of dominant groups can also be victims of discrimination (even though they enjoy unfair advantages) and are therefore included in this process when they are legitimate stakeholders. However, wrongs done to persons in a dominant group are not the same as the discriminatory wrongs that combine to create serious systemic injustice (Stanford Encyclopedia of Philosophy, 2011). Therefore, it is important to pay special attention to the groups that have been identified as being specifically vulnerable to structural discrimination when they are affected by the ML system. Which salient groups count for the purpose of determining an act of discrimination is at the heart of many political and legal debates. We are referring to vulnerable salient groups on the grounds of sex, race, color, language, religion, political or other opinion, national or social origin, association with a national minority, property, birth or other status (European Convention on Human Rights, 2010; International Covenant on Civil and Political Rights, 1966).

The topic of cultural specificity of the AI system in relation to the perception of fairness is also being considered. It has been documented that ML systems need to be congruent not only with the personal moral beliefs of developers, but also with the values of societies where they operate (Carman & Rosman, 2021). Other proposals suggest working towards an intercultural citizenship and universal values (Jiang et al., 2021). And the ethical pluralism approach (Ess, 2020; Wong, 2020) acknowledges both the coexistence of universally valid values with international cultural diversity of moral codes. In practice, Chan notes that out of the top 100 universities and companies by publication index, none of them is from Africa or Latin America (2021). It is therefore essential that the stakeholders participating in the process represent the cultures where the ML system will be used, with especial attention for the inclusivity of Global South. Although stakeholders might dissent in terms of values, they all share the capacity of communicative reasoning (Habermas, 1990) to reach agreements on how to manage AI fairness decision making in a specific context.

Bringing the Principle of Justice and Fairness to the Design Level

The Stakeholders’ Agreement on Fairness (SAF) process, shown in Fig. 3, not only aims to align ML technology with human values that receive widespread endorsement (Gabriel & Ghazavi, 2021), but accompanies stakeholders on a reflective process about their own subjectivity in a specific scenario, (Terzis, 2020), questioning societal values (Dobbe et al., 2018), and fosters the inclusion of a broader taxonomy of biases other than those pre-existing in the data. Therefore the SAF does not constrain the choices of stakeholders, but encourages stakeholders to make informed choices in line with the pro-ethical design concept (Floridi, 2016).

Clarifications for each step of the SAF:

0.
Identify stakeholders, agree on fairness objectives and decide if AI is required. As a first step, both a landscape and a preliminary impact assessment will be performed by the ML Development Multidisciplinary Team (MLMDT) before consultations with external stakeholders. The aim of the landscape assessment is to describe the contextual environment in which the ML system will be implemented (geopolitical, societal, legal) (Stix, 2021). The landscape assessment will contribute to identify the stakeholders. In turn, a preliminary impact assessment will help the MLMDT identify potential risks and challenges resulting from the ML development and implementation, in that particular landscape. The preliminary impact assessment will guide the consultation process with stakeholders. Secondly, the MLMDT needs to identify the legitimate stakeholders, i.e., all the agents affected by the ML system (both internal and external to the company or institution developing it), including vulnerable salient groups. Within the stakeholders’ group is important to balance the need for subject-matter technological expertise with the diversity of perspectives and to manage power imbalances (Hollis & Whittlestone, 2021). Once the stakeholders’ representatives have been identified, the aim of the stage 0 of the SAF process is to question whether the project goals contribute to the human objectives shared by the stakeholders. An explicit agreement on what are the objectives of the ML system in terms of fairness should be reached among stakeholders. As a result of stage 0, stakeholders must feel empowered to conclude that the ML system is not required at all, in which case it should not be developed (Pasquale, 2019; Russell, 2019). Corporative objectives such as efficiency, performance, accuracy, novelty and state-of-the-art are to be questioned and societal benefits according to the requirement of “diversity, non-discrimination and fairness” (HLEGAI, 2019, p. 14) should be agreed and assessed at the end of the ongoing process when the outputs of the project are obtained (Hollis & Whittlestone, 2021).
1.
Define users’ needs (including those of vulnerable salient groups (VSG). Once stakeholders are identified, the MLMDT will focus on documenting the users’ needs that the ML system aims to address, which are key to human-centered design (IDEO.org, 2015). The deeper the MLMDT gets into the users’ reality (including VSG), the more it will be able to understand users’ beliefs and values and therefore question social assumptions and prejudices. There are trade-offs in all development processes and the MLMDT needs to justify the ranking of users’ preferences in order to provide explainability.
2.
Start the bias-aware project. Building a diverse team is an integral part of the project in order to achieve ethical pluralism (Ess, 2020). And the practical operationalization of AI ethics is not about external impositions, but more about procedural regularity. Therefore, the process aims to accompany the MLMDT to continuously learn from own subjectivity and biases, adapting the process across contexts, and reach agreements with stakeholders (Kroll et al., 2017). The MLMDT should identify the bias risks, which should be taken into account in the business model canvas.
3.
Frame the design & discrimination challenges. The needs of the VSG should be taken into account in the brainstorming sessions, paying special attention to personal experiences. Having detailed information on users (including real-life situations of exclusion) provides knowledge on other cultures and contexts that help identify values, assumptions and counteract prejudices. Stakeholders should be invited into the design team in co-creation sessions (IDEO.org, 2015), in line with the Trustworthy AI requirement of “human agency and oversight” (HLEGAI, 2019, p. 15) and the capability approach to agency (Nussbaum, 2012; Sen, 2001).
4.
Train the model and minimize bias in data. Enormous amounts of data tend to include low quality information and higher level of biases. Therefore, the focus should not only be on obtaining the maximum quantity of data, but on ensuring the maximum quality of this data, which will often mean working with smaller datasets (Schick & Schütze, 2020). Data needs to be analyzed to challenge assumptions, prejudices and the resulting bias by differentiating direct information from proxy, identifying human influence as well as blind spots (Sampson, & Chapman, 2021). Existing approaches to identify and measure bias in data can be explored (Bolukbasi et al., 2016; Garg, 2018; Kiritchenko et al., 2014; Manzini et al., 2019; Nadeem et al., 2020; Zhao et al., 2021). In addition, the MLMDT needs to bear in mind that value should be provided to users that share data in order to comply with fairness criteria.
5.
Program a bias-aware model. Data from the “real world” cannot be assumed to have the values agreed within the SAF process. Therefore, algorithms to debias the system are to be foreseen (Bolukbasi et al., 2016; Manzini et al., 2019; Zhao et al., 2021). The development team can also consider using methods to train algorithms to detect bias (Jiang et al., 2021; Sap et al., 2020).
6.
Test & iterate to mitigate unfairness. Identified bias risks are to be tested in isolation to ensure unfairness is mitigated. Users’ feedback can be integrated into several iterations of the prototype testing, in line with the concept of non-bias engineering of negotiated ethics (Morley et al., 2021). In addition, emergent bias in the system should be identified by studying the feedback mechanisms between the algorithms and the environment they act upon (Dobbe et al., 2018).
7.
Implement ensuring value to stakeholders. Indicators are to be defined in order to measure the impact of the ML model on stakeholders and monitor the achievement of agreed objectives on fairness and bias mitigation. This information needs to be publicly available in the launching of the ML model and thereafter, in line with the transparency principle. Target users and other stakeholders, including VSG, are invited to participate in the launch validation and are solicited feedback after deployment, in line with the “stakeholder participation” approach recommended in the EU Ethics Guidelines for Trustworthy AI (HLEGAI, 2019, p. 19).
8.
Monitoring, assessment and ongoing improvement to mitigate bias. AI ethics focuses on procedural regularity (Morley et al., 2021) and the SAF process is not to be applied as a “one-off” test, but to be re-applied on ongoing basis when ML systems are revised and re-tuned. The agreed objectives in terms of fairness should be revisited and enriched in line with the critical maturity of society. Consultation to stakeholders post-implementation is foreseen with a feedback loop to ensure continuous process improvement (Stix, 2021). The SAF is an ongoing process that will evolve and keep track of biases in a context of its time. Since not all biases will be eliminated, the MLMDT will need to do bias forensics (Crawford, 2017), to be able to inform on biases in an open and transparent way. For the SAF process to be effectively integrated in the ML development practices of a company or institution, it is recommended to implement it first on a pilot project where there are clearly identified stakeholders. This will allow to build the in-house expertise to be able to address more ambitious projects (Whittlestone & Clark, 2021).

Facilitating Trade-offs Disclosure

The SAF process creates the grounds to openly disclose the agreements reached on the principle of justice and fairness. Since decisions are reflected upon and agreed among stakeholders, they are easier to communicate. The SAF process, therefore, facilitates the Trustworthy AI requirement of “transparency, including traceability, explainability and communication” (HLEGAI, 2019, p. 14).

Transparency has been described as a second-order principle because it can be directly addressed from a programming perspective, tackling the black-box effect (Carman & Rosman, 2021; Floridi et al., 2018). It allows organizations to communicate the always imperfect trade-offs (Whittlestone et al., 2019). In fact, it is argued that when a system is explainable and interpretable it is inherently fairer, since it allows stakeholders take informed decisions on whether to use the ML system (Binns et al. 2018). AI systems need to be designed to be transparent (Ananny & Crawford, 2018) and Fig. 4 defines what is the minimum information from the SAF process that should be explicitly communicated in order to disclose the fairness decisions taken throughout the development of the ML system.

Challenges and Limitations

The SAF process is grounded on the state on the topic of fairness both in AI and social sciences. However, it has not yet been empirically validated. Testing it in a real context scenario will be required to prove its applicability, ensure the benefits on bias mitigation and improve the drawbacks. Secondly, all approaches aiming to tackle complex ethical principles have some limitations and this is not an exception. Table 1 identifies the main challenges and risks as well as the proposed mitigating actions.

Table 1 Challenges and risks of the SAF process and proposed mitigating actions

Full size table

Conclusions

In recent years numerous studies have acknowledged that ML systems can have harmful consequences in terms of human rights and discrimination. A growing number of voices describe a need to translate the ethical principle of justice and fairness into practice and call for the participation of social science researchers to clarify and contextualize the concept of bias. To fill the existing gaps, this paper provides, first of all, a descriptive framework for the concepts of prejudice, discrimination and bias. Since prejudices are originated in the way human beings interpret reality, bias cannot be mitigated completely, rather it should be managed not only in the data and algorithms but also in the practices of ML development. With that aim, the SAF process constitutes an end-to-end inclusive framework that encourages an ongoing reflective approach to bias management and mitigation, by suggesting specific actions to be taken by a multi-disciplinary team and active involvement of stakeholders, including VSG. As a result, the SAF process facilitates the disclosure of the trade-offs when managing bias, giving users the necessary information to take ethically-informed decisions. In addition, the transparency provided by the SAF process facilitates the external assessment, because it provides explainability on the decisions taken (Dearden & Rizvi, 2008). Therefore, the SAF process can constitute a useful tool for NGOs, community organizations and government officials to monitor ML and encourage its alignment with broad civic goals rather than narrow commercial interests (Whittlestone & Clark, 2021).

Societies where ML systems operate are becoming better informed and more critically aware about the challenges put forward by topic of AI ethics. Stakeholders expect ML systems to benefit the communities where they operate and AI ethics is growingly becoming a business need. However, socially beneficial ML systems cannot be achieved as one-shot activity nor with technical solutions exclusively, but rather be the result of procedural regularity and inclusive participation. Further work should be performed to verify the suitability of the SAF process in practice, through case studies. A multi-disciplinary ethics advisory board should evaluate the appropriateness and comprehensiveness of the SAF process (Morley et al., 2021) and guidelines should be developed to evaluate compliance with it. Finally, further research should explore potential adaptations of the SAF process to other Trustworthy AI requirements and broadening the scope to other AI fields such as robotics.

References

Ahamat, G., Chang, M., & Thomas, C. (2021). The need for effective AI assurance. Center for Data Ethics and Innovation Blog. https://cdei.blog.gov.uk/2021/04/15/the-need-for-effective-ai-assurance/.
Algorithm, W. (2021). AI ethics guidelines global inventory. Algorithm Watch. https://inventory.algorithmwatch.org/. Accessed 4 December 2021.
Allport, G. W. (1954). The nature of prejudice. Basic Books.
Ananny, M., & Crawford, K. (2018). Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic accountability. New Media & Society, 20(3), 973–989. https://doi.org/10.1177/1461444816676645.
Article Google Scholar
Asplund, J., Eslami, M., Sundaram, H., Sandvig, C., & Karahalios, K. (2020). Auditing race and gender discrimination in online housing markets. Proceedings of the International AAAI Conference on Web and Social Media, 14, 24–35. https://ojs.aaai.org/index.php/ICWSM/article/view/7276. Accessed 20 September 2022.
Barocas, S., & Selbst, A. D. (2016). Big Data’s disparate impact. SSRN Electronic Journal. https://doi.org/10.2139/SSRN.2477899.
Article Google Scholar
Beukeboom, C. J., & Burgers, C. (2019). How stereotypes are shared through language: A review and introduction of the social categories and stereotypes communication (SCSC) framework. Review of Communication Research, 7, 1–37. https://doi.org/10.12840/ISSN.2255-4165.017.
Birhane, A., Isaac, W., Prabhakaran, V., Díaz, M., Elish, M. C., Gabriel, I., & Mohamed, S. (2022). Power to the people? Opportunities and challenges for participatory AI. https://doi.org/10.1145/3551624.3555290.
Binns, R., Kleek, M. Van, Veale, M., Lyngs, U., Zhao, J., & Shadbolt, N. (2018). “It’s reducing a human being to a percentage”: Perceptions of justice in algorithmic decisions. In CHI ’18: Proceedings of the 2018 CHI conference on human factors in computing systems. Paper No.: 377 Pages 1–14. https://doi.org/10.1145/3173574.3173951
Blodgett, S. L., Barocas III, S., H. D., & Wallach, H. (2020). Language (technology) is power: A critical survey of “bias” in NLP. In Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault (Eds), Proceedings of the 58th annual meeting of the Association for Computational Linguistics (pp. 5454–5476). https://doi.org/10.18653/V1/2020.ACL-MAIN.485.
Bolukbasi, T., Chang, K. W., Saligrama, V., Zou, J., & Kalai, A. (2016). Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. https://doi.org/10.48550/arXiv.1607.06520.
Caliskan, A., Bryson, J. J., & Narayanan, A. (2017). Semantics derived automatically from language corpora contain human-like biases. Science, 356(6334), 183–186. https://doi.org/10.1126/science.aal4230.
Article Google Scholar
Card, D., & Smith, N. A. (2020). On consequentialism and fairness. Frontiers in Artificial Intelligence, 3, 34. https://doi.org/10.3389/FRAI.2020.00034/BIBTEX.
Article Google Scholar
Carman, M., & Rosman, B. (2021). Applying a principle of explicability to AI research in Africa: Should we do it? Ethics and Information Technology, 23(2), 107–117. https://doi.org/10.1007/S10676-020-09534-2.
Article Google Scholar
Chan, A. Chinasa, T., Okolo, Z., & Wang, A. (2021). Terner, &. The limits of global inclusion in AI development. arXiv.org.
Chuvpilo, G. (2020). AI research rankings 2020: Can the United States stay ahead of China? https://chuvpilo.medium.com/ai-research-rankings-2020-can-the-united-states-stay-ahead-of-china-61cf14b1216. Accessed 5 October 2022.
Coeckelbergh, M. (2022). The political philosophy of AI. Polity.
Cortina, A. (2007). Etica de la razón cordial. Ediciones Nobel.
Cortina, A. (2011). Neuroética y neuropolítica. Tecnos.
Crawford, K. (2017). Keynote: The trouble with bias. https://www.youtube.com/watch?v=fMym_BKWQzk.
Garg, N., Schiebinger, L., Jurafsky, D., & Zou, J. (2018). Word embeddings quantify 100 years of gender and ethnic stereotypes, 12. https://doi.org/10.1073/pnas.1720347115.
Davis, J. L., Williams, A., & Yang, M. W. (2021). Algorithmic reparation. Big Data and Society, 8(2), https://doi.org/10.1177/20539517211044808.
Dearden, A., & Rizvi, S. H. (2008). Participatory IT design and participatory development: A comparative review. Proceedings of the tenth conference on participatory design, (PDC).https://doi.org/10.1145/1795234.1795246.
Dignum, V. (2022). Relational Artificial Intelligence. https://doi.org/10.48550/arXiv.2202.07446.
Dobbe, R., Dean, S., Gilbert, T., & Kohli, N. (2018). A broader view on bias in automated decision-making: Reflecting on epistemology and dynamics. https://doi.org/10.48550/arxiv.1807.00553.
Dwork, C., Hardt, M., Pitassi, T., Reingold, O., & Zemel, R. (2011). Fairness through awareness. https://doi.org/10.48550/arXiv.1104.3913.
Eidelson, B. (2015). Discrimination and disrespect. Oxford University Press.
Book Google Scholar
Ely, J. H. (1980). Democracy and distrust: A theory of judicial review. Harvard University Press.
Ess, C. (2020). Digital media ethics. Wiley.
European Commission. Artificial Intelligence Act (2022). Proposal for a regulation of the European Parliament and the Council laying down harmonised rules on Artificial Intelligence (Artificial Intelligence Act) and amending certain Union legislative acts. https://data.consilium.europa.eu/doc/document/ST-14954-2022-INIT/en/pdf.
European Convention on Human Rights (2010). European Court of Human Rights. www.conventions.coe.int. Accessed 2 September 2022.
Floridi, L. (2016). Tolerant paternalism: Pro-ethical design as a resolution of the dilemma of toleration. Science and Engineering Ethics, 22(6), 1669–1688. https://doi.org/10.1007/S11948-015-9733-2.
Article Google Scholar
Floridi, L. (2019). Translating principles into practices of digital ethics: Five risks of being unethical. Philosophy & Technology 2019, 32(2), 185–193. https://doi.org/10.1007/S13347-019-00354-X. 32.
Article Google Scholar
Floridi, L. (2020). AI4PEOPLE’S 7 AI GLOBAL FRAMEWORKS. https://ai4people.eu/wp-content/pdf/AI4People7AIGlobalFrameworks.pdf. Accessed 14 August 2022.
Floridi, L., & Taddeo, M. (2016). What is data ethics? Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2083). https://doi.org/10.1098/RSTA.2016.0360.
Floridi, L., Cowls, J., Beltrametti, M., Chatila, R., Chazerand, P., Dignum, V., et al. (2018). AI4People-An ethical framework for a good AI society: Opportunities, risks, principles, and recommendations. Minds and Machines, 28, 689–707. https://doi.org/10.1007/s11023-018-9482-5.
Article Google Scholar
Fraser, N., & Honneth, A. (2003). Redistribution or recognition? A political-philosophical exchange. Verso Books.
Gabriel, I., & Ghazavi, V. (2021). The challenge of value alignment: From fairer algorithms to AI safety. https://doi.org/10.48550/arxiv.2101.06060.
Goffman, E. (1963). Stigma notes on the management of spoiled identity. Simon & Schuster.
Green, B., & Hu, L. (2018). The myth in the methodology: Towards a recontextualization of fairness in machine learning. Machine learning: The debates workshop at the 35th international conference on machine learning (ICML).
Greenawalt, K., & Dworkin, R. (1987). A matter of principle. The Journal of Philosophy, 84(5), 284. https://doi.org/10.2307/2026755.
Article Google Scholar
Guo, Y., Yang, Y., & Abbasi, A. (2022). Auto-debias: Debiasing masked language models with automated biased prompts, 1, 1012–1023. https://doi.org/10.18653/V1/2022.ACL-LONG.72.
Habermas, J. (1990). Moral consciousness and communicative action. Polity Press.
Google Scholar
Hagendorff, T. (2020). The ethics of AI ethics: An evaluation of guidelines. Minds and Machines. https://doi.org/10.1007/s11023-020-09517-8.
Article Google Scholar
Hardt, M., Price, E., & Srebro, N. (2016). Equality of opportunity in supervised learning. https://doi.org/10.48550/arXiv.1610.02413.
Harrison, G., Hanson, J., Jacinto, C., Ramirez, J., & Ur, B. (2020). An empirical study on the perceived fairness of realistic, imperfect machine learning models. FAT* 2020 - Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 392–402. https://doi.org/10.1145/3351095.3372831.
Hellman, D. (2008). When is discrimination wrong? Harvard University Press.
Google Scholar
HLEGAI. High-level expert group on Artificial Intelligence, EU - Ethics guidelines for trustworthy AI (2019). https://digital-strategy.ec.europa.eu/en/library/ethics-guidelines-trustworthy-ai.
Hoffmann, A. L. (2019). Where fairness fails: Data, algorithms, and the limits of antidiscrimination discourse. Information Communication & Society, 22(7), 900–915. https://doi.org/10.1080/1369118X.2019.1573912.
Article Google Scholar
Hollis, H., & Whittlestone, J. (2021). Participatory AI futures: Lessons from research in climate change. https://medium.com/@helena.hollis/participatory-ai-futures-lessons-from-research-in-climate-change-34e3580553f8. Accessed 12 October 2022.
Horch, J. W. (1996). Practical guide to software quality management - John W. Horch - Google Libros. Artech House Publisher.
IDEO.org. (2015). The field guide to human-centred design. IDEO.org.
International Covenant on Civil and Political Rights (1966). United Nations.
Jiang, L., Hwang, J. D., Bhagavatula, C., Bras, R., Le, Forbes, M., Borchardt, J. (2021). Delphi: Towards machine ethics and norms. https://arxiv.org/abs/2110.07574v1.
Jobin, A., Ienca, M., & Vayena, E. (2019). The global landscape of AI ethics guidelines. Nature Machine Intelligence, 1(9), 389–399. https://doi.org/10.1038/S42256-019-0088-2
Kahneman, D. (2011). Thinking, fast and slow. Farrar, Straus and Giroux.
Kiritchenko, S., Zhu, X., & Mohammad, S. M. (2014). Sentiment analysis of short informal texts. Journal of Artificial Intelligence Research, 50, 723–762. https://doi.org/10.1613/JAIR.4272.
Article Google Scholar
Kleinberg, J., Mullainathan, S., & Raghavan, M. (2016). Inherent trade-offs in the fair determination of risk scores. https://click.endnote.com/viewer?doi=arxiv%3A1609.05807&token=WzMzMjg3NjgsImFyeGl2OjE2MDkuMDU4MDciXQ.ASX-JmpHAE9FR6hmrNs0UunW9do. Accessed 24 December 2021.
Kroll, J., Huey, J., Barocas, S., Felten, E., Reidenberg, J., Robinson, D., & Yu, H. (2017). Accountable algorithms. University of Pennsylvania Law Review, 165(3). https://scholarship.law.upenn.edu/penn_law_review/vol165/iss3/3.
Lalor, J. P., Yang, Y., Smith, K., Forsgren, N., & Abbasi, A. (2022). Benchmarking intersectional biases in NLP. Proceedings of the 2022 conference of the North American chapter of the association for computational linguistics: Human language technologies, (pp. 3598–3609). https://doi.org/10.18653/V1/2022.NAACL-MAIN.263.
Lehr, D., & Ohm, P. (2017). Playing with the data: What legal scholars should learn about machine learning. U C Davis Law Review, 51(2), 653.
Google Scholar
Lyotard, J. F. (1984). The postmodern condition: A report on knowledge. University of Minnesota Press.
Google Scholar
Maass, A. (1999). Linguistic intergroup bias: Stereotype perpetuation through language. Advances in Experimental Social Psychology, 31(C), 79–121. https://doi.org/10.1016/S0065-2601(08)60272-5.
Article Google Scholar
Manzini, T., Chong, L. Y., Black, A. W., & Tsvetkov, Y. (2019). Black is to criminal as caucasian is to police: Detecting and removing multiclass bias in word embeddings. NAACL HLT 2019–2019 conference of the North American chapter of the association for computational linguistics: Human language technologies - Proceedings of the conference, 1, (pp. 615–621). https://doi.org/10.18653/V1/N19-1062.
Martin, K. (2018). Ethical implications and accountability of algorithms. Journal of Business Ethics, 160(4), 835–850. https://doi.org/10.1007/s10551-018-3921-3.
Article Google Scholar
Martin, K., & Phillips, R. (2022). Stakeholder friction. Journal of Business Ethics, 177(3), 519–531. https://doi.org/10.1007/S10551-020-04652-9.
Article Google Scholar
Martin, D., Prabhakaran, V., Kuhlberg, J., Smart, A., & Isaac, W. S. (2020). Participatory problem formulation for fairer machine learning through community based system dynamics. ICLR. https://doi.org/10.48550/arxiv.2005.07572.
Metaxa, D., Park, J. S., Robertson, R. E., Karahalios, K., Wilson, C., Hancock, J., & Sandvig, C. (2021). Auditing algorithms. Foundations and Trends in Human-Computer Interaction, 14(4), 272–344. https://doi.org/10.1561/1100000083.
Article Google Scholar
Miller, C., & Coldicott, R. (2019). People, power and technology, the tech workers’ view. Doteveryone. https://doteveryone.org.uk/report/workersview/. Accessed 3 December 2021.
Mitchell, S., Potash, E., Barocas, S., D’Amour, A., & Lum, K. (2021). Algorithmic fairness: Choices, assumptions, and definitions. Annual Review of Statistics and its Application, 8, 141–163. https://doi.org/10.1146/annurev-statistics-042720-125902.
Article Google Scholar
Mittelstadt, B. D., Allo, P., Taddeo, M., Wachter, S., & Floridi, L. (2016). The ethics of algorithms: Mapping the debate. Big Data & Society. https://doi.org/10.1177/2053951716679679.
Article Google Scholar
Morgado, I. (2017). Emociones corrosivas. Editorial Planeta.
Morley, J., Floridi, L., Kinsey, L., & Elhalal, A. (2020). From what to how: An initial review of publicly available AI ethics tools, methods and research to translate principles into practices. Science and Engineering Ethics, 26(4), 2141–2168. https://doi.org/10.1007/S11948-019-00165-5/TABLES/6.
Article Google Scholar
Morley, J., Elhalal, A., Garcia, F., Kinsey, L., Mökander, J., & Floridi, L. (2021a). Ethics as a service: A pragmatic operationalisation of AI ethics. Minds and Machines, 31(2), 239–256. https://doi.org/10.1007/S11023-021-09563-W.
Article Google Scholar
Morley, J., Kinsey, L., Elhalal, A., Garcia, F., Ziosi, M., & Floridi, L. (2021b). Operationalising AI ethics: Barriers, enablers and next steps. AI & SOCIETY. https://doi.org/10.1007/S00146-021-01308-8.
Article Google Scholar
Nadeem, M., Bethke, A., & Reddy, S. (2020). StereoSet: Measuring stereotypical bias in pretrained language models, 5356–5371. https://stereoset.
Nussbaum, M. C. (2012). Creating capabilities. The human development approach. Harvard University Press.
Google Scholar
Pasquale, F. (2019). The second wave of algorithmic accountability - LPE Project. https://lpeproject.org/blog/the-second-wave-of-algorithmic-accountability/. Accessed 18 January 2023.
Raji, I. D., Smart, A., White, R. N., Mitchell, M., Gebru, T., Hutchinson, B. (2020). Closing the AI accountability gap: Defining an end-to-end framework for internal algorithmic auditing. FAT* 2020 - Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 33–44. https://doi.org/10.1145/3351095.3372873.
Rawls, J. (1971). A theory of justice. Oxford University Press.
Book Google Scholar
Russell, S. J. (2019). Human compatible: Artificial Intelligence and the problem of control. Pinguin.
Google Scholar
Sampson, O., & Chapman, M. (2021). AI needs an ethical compass. This tool can help. | ideo.com. https://www.ideo.com/blog/ai-needs-an-ethical-compass-this-tool-can-help. Accessed 4 December 2021.
Sap, M., Gabriel, S., Qin, L., Jurafsky, D., Smith, N. A., Choi, Y., & Allen, P. G. (2020). Social bias frames: Reasoning about social and power implications of language. In Proceedings of the 58th annual meeting of the association for computational linguistics, (pp. 5477–5490). https://doi.org/10.18653/V1/2020.ACL-MAIN.486. Association for Computational Linguistics
Schick, T., & Schütze, H. (2020). It’s not just size that matters: Small language models are also few-shot learners. In Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: Human language technologies (pp. 2339–2352). https://doi.org/10.18653/v1/2021.naacl-main.185. Association for Computational Linguistics
SCMP Research (2020). China AI report. World Scientific. https://www.worldscientific.com/page/china-ai-report. Accessed 5 February 2022.
Sen, A. (2001). Development as freedom. Oxford University Press.
Silvers, A., Wasserman D.T., & Mahowald M.B., Disability, difference, discrimination: Perspectives on justice in bioethics and public policy. Rowman & Littlefield Publishers.
Google Scholar
Sloane, M., Moss, E., Awomolo, O., & Forlano, L. (2020). Participation is not a design fix for machine learning. document. https://doi.org/10.48550/arXiv.2007.02423.
Smith, M., Patil, D. J., Muñoz, C. (2016). White House. Big risks, big opportunities: The intersection of big data and civil rights. https://obamawhitehouse.archives.gov/blog/2016/05/04/big-risks-big-opportunities-intersection-big-data-and-civil-rights. Accessed 5 February 2022.
Stanford Encyclopedia of Philosophy (2011). https://plato.stanford.edu/entries/discrimination/. Accessed 12 September 2022.
Stix, C. (2021). Actionable principles for artificial intelligence policy: Three pathways. Science and Engineering Ethics 2021, 27(1), 1–17. https://doi.org/10.1007/S11948-020-00277-3.
Suresh, H., & Guttag, J. (2021). A framework for understanding sources of harm throughout the machine learning life cycle. ACM International Conference Proceeding Series. https://doi.org/10.1145/3465416.3483305.
Tasioulas, J. (2022). Artificial Intelligence, humanistic ethics. Daedalus, 151(2), 232–243. https://doi.org/10.1162/DAED_A_01912.
Article Google Scholar
Terzis, P. (2020). Onward for the freedom of others: Marching beyond the AI ethics. FAT* 2020 - Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, 220–229. https://doi.org/10.1145/3351095.3373152.
Tsamados, A., Aggarwal, N., Cowls, J., Morley, J., Roberts, H., Taddeo, M., & Floridi, L. (2021). The ethics of algorithms: Key problems and solutions. AI & SOCIETY 2021, 1, 1–16. https://doi.org/10.1007/S00146-021-01154-8.
Article Google Scholar
Vakkuri, V., & Kemell, K. K. (2019). Implementing AI ethics in practice: An empirical evaluation of the RESOLVEDD strategy. Lecture Notes in Business Information Processing, 370 LNBIP, 260–275. https://doi.org/10.1007/978-3-030-33742-1_21.
Vakkuri, V., Kemell, K. K., Jantunen, M., & Abrahamsson, P. (2020). “This is just a prototype”: How ethics are ignored in software startup-like environments. Lecture Notes in Business Information Processing, 383 LNBIP, 195–210. https://doi.org/10.1007/978-3-030-49392-9_13.
Vinuesa, R., Azizpour, H., Leite, I., Balaam, M., Dignum, V., Domisch, S., et al. (2020). The role of Artificial Intelligence in achieving the sustainable development goals. Nature Communications 2020, 11(1), 1–10. https://doi.org/10.1038/s41467-019-14108-y. 11.
Article Google Scholar
Wachter, S., Mittelstadt, B., & Russell, C. (2020). Why fairness cannot be automated: Bridging the gap between EU non-discrimination law and AI. SSRN Electronic Journal. https://doi.org/10.2139/SSRN.3547922.
Article Google Scholar
Watson, David S., Jenny Krutzinna, Ian N. Bruce, Christopher E.M. Griffiths, Iain B. McInnes, Michael R. Barnes, and Luciano Floridi. (2019). Clinical applications of machine learning algorithms: beyond the black box. British Medical Journal.
Whittlestone, J., & Clark, J. (2021). Why and how governments should monitor AI development. http://arxiv.org/abs/2108.12427. Accessed 20 September 2022.
Whittlestone, J., Nyrup, R., Alexandrova, A., & Cave, S. (2019). The role and limits of principles in AI ethics: Towards a focus on tensions. www.aaai.org. Accessed 3 December 2021.
Wolfe, A. (1977). The limits of legitimacy: Political contradictions of contemporary capitalism. Free Press.
Google Scholar
Wong, P. H. (2020). Cultural differences as excuses? Human rights and cultural values in global ethics and governance of AI. Philosophy & Technology 2020, 33(4), 705–715. https://doi.org/10.1007/S13347-020-00413-8. 33.
Article Google Scholar
Young, I. M. (1981). Toward a critical theory of justice. Social Theory and Practice, 7(3), 279–302. https://doi.org/10.5840/soctheorpract19817314.
Article Google Scholar
Young, I. M. (1990). Justice and the politics of difference. Princeton University Press.
Google Scholar
Zeng, Y., Lu, E., & Huangfu, C. (2019). Linking Artificial Intelligence principles. https://drive.google.com/file/d/0B7XkCwpI5KDYNlNUT. Accessed 20 September 2022.
Zhao, J., Khashabi, D., Khot, T., Sabharwal, A., & Chang, K. W. (2021). Ethical-advice taker: Do language models understand natural language interventions?, 4158–4164. https://doi.org/10.18653/v1/2021.findings-acl.364.
Zuboff, S. (2019). The age of surveillance capitalism. Profile Books.

Download references

Author information

Authors and Affiliations

University of Notre Dame, Notre Dame, USA
Georgina Curto
IQS School of Management, Universitat Ramon Llull, Barcelona, Spain
Flavio Comim

Authors

Georgina Curto
View author publications
You can also search for this author in PubMed Google Scholar
Flavio Comim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Georgina Curto.

Ethics declarations

Conflict of interest

The authors did not receive support from any organization for the submitted work. The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Curto, G., Comim, F. SAF: Stakeholders’ Agreement on Fairness in the Practice of Machine Learning Development. Sci Eng Ethics 29, 29 (2023). https://doi.org/10.1007/s11948-023-00448-y

Download citation

Received: 26 December 2021
Accepted: 16 June 2023
Published: 24 July 2023
DOI: https://doi.org/10.1007/s11948-023-00448-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

SAF: Stakeholders’ Agreement on Fairness in the Practice of Machine Learning Development

Abstract

Similar content being viewed by others

Action-guidance and AI ethics: the case of fair machine learning

Ethical implications of fairness interventions: what might be hidden behind engineering choices?

Designing Against Bias: Identifying and Mitigating Bias in Machine Learning and AI

Introduction

Conceptual Background: Fairness as an Agreement Among Stakeholders

Bringing the Principle of Justice and Fairness to the Design Level

Facilitating Trade-offs Disclosure

Challenges and Limitations

Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

SAF: Stakeholders’ Agreement on Fairness in the Practice of Machine Learning Development

Abstract

Similar content being viewed by others

Action-guidance and AI ethics: the case of fair machine learning

Ethical implications of fairness interventions: what might be hidden behind engineering choices?

Designing Against Bias: Identifying and Mitigating Bias in Machine Learning and AI

Introduction

Conceptual Background: Fairness as an Agreement Among Stakeholders

Bringing the Principle of Justice and Fairness to the Design Level

Facilitating Trade-offs Disclosure

Challenges and Limitations

Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation