In summary, Interpretability refers to the user's ability to understand model outputs, while Model Transparency includes Simulatability (reproducibility of predictions), Decomposability (intuitive explanations for parameters), and Algorithmic Transparency (explaining how algorithms work). Model Functionality focuses on textual descriptions, visualization, and local explanations, which clarify specific outputs or instances rather than entire models. All these concepts aim to enhance the comprehensibility and usability of AI systems.
If algorithms fulfill these principles, they provide a basis for justifying decisions, tracking them and thereby verifying them, improving the algorithms, and exploring new facts.
Sometimes it is also possible to achieve a high-accuracy result with white-box ML algorithms. These algorithms have an interpretable structure that can be used to explain predictions. Concept Bottleneck Models, which use concept-level abstractions to explain model reasoning, are examples of this and can be applied in both image and text prediction tasks. This is especially important in domains like medicine, defense, finance, and law, where it is crucial to understand decisions and build trust in the algorithms. Many researchers argue that, at least for supervised machine learning, the way forward is symbolic regression, where the algorithm searches the space of mathematical expressions to find the model that best fits a given dataset.
AI systems optimize behavior to satisfy a mathematically specified goal system chosen by the system designers, such as the command "maximize the accuracy of assessing how positive film reviews are in the test dataset." The AI may learn useful general rules from the test set, such as "reviews containing the word "horrible" are likely to be negative." However, it may also learn inappropriate rules, such as "reviews containing 'Daniel Day-Lewis' are usually positive"; such rules may be undesirable if they are likely to fail to generalize outside the training set, or if people consider the rule to be "cheating" or "unfair." A human can audit rules in an XAI to get an idea of how likely the system is to generalize to future real-world data outside the test set.
AI systems sometimes learn undesirable tricks that do an optimal job of satisfying explicit pre-programmed goals on the training data but do not reflect the more nuanced implicit desires of the human system designers or the full complexity of the domain data. For example, a 2017 system tasked with image recognition learned to "cheat" by looking for a copyright tag that happened to be associated with horse pictures rather than learning how to tell if a horse was actually pictured. In another 2017 system, a supervised learning AI tasked with grasping items in a virtual world learned to cheat by placing its manipulator between the object and the viewer in a way such that it falsely appeared to be grasping the object.
There is a subtle difference between the terms explainability and interpretability in the context of AI.
Some explainability techniques don't involve understanding how the model works, and may work across various AI systems. Treating the model as a black box and analyzing how marginal changes to the inputs affect the result sometimes provides a sufficient explanation.
Explainability is useful for ensuring that AI models are not making decisions based on irrelevant or otherwise unfair criteria. For classification and regression models, several popular techniques exist:
Systems that are expert or knowledge based are software systems that are made by experts. This system consists of a knowledge based encoding for the domain knowledge. This system is usually modeled as production rules, and someone uses this knowledge base which the user can question the system for knowledge. In expert systems, the language and explanations are understood with an explanation for the reasoning or a problem solving activity.
Interpretability research often focuses on generative pretrained transformers. It is particularly relevant for AI safety and alignment, as it may enable to identify signs of undesired behaviors such as sycophancy, deceptiveness or bias, and to better steer AI models.
By the 1990s researchers began studying whether it is possible to meaningfully extract the non-hand-coded rules being generated by opaque trained neural networks. Researchers in clinical expert systems creating neural network-powered decision support for clinicians sought to develop dynamic explanations that allow these technologies to be more trusted and trustworthy in practice. In the 2010s public concerns about racial and other bias in the use of AI for criminal sentencing decisions and findings of creditworthiness may have led to increased demand for transparent artificial intelligence. As a result, many academics and organizations are developing tools to help detect bias in their systems.
Explainable AI has been recently a new topic researched amongst the context of modern deep learning. Modern complex AI techniques, such as deep learning, are naturally opaque. To address this issue, methods have been developed to make new models more explainable and interpretable. This includes layerwise relevance propagation (LRP), a technique for determining which features in a particular input vector contribute most strongly to a neural network's output. Other techniques explain some particular prediction made by a (nonlinear) black-box model, a goal referred to as "local interpretability". We still today cannot explain the output of today's DNNs without the new explanatory mechanisms, we also can't by the neural network, or external explanatory components There is also research on whether the concepts of local interpretability can be applied to a remote context, where a model is operated by a third-party.
There are various techniques to extract compressed representations of the features of given inputs, which can then be analysed by standard clustering techniques. Alternatively, networks can be trained to output linguistic explanations of their behaviour, which are then directly human-interpretable. Model behaviour can also be explained with reference to training data—for example, by evaluating which training inputs influenced a given behaviour the most, or by approximating its predictions using the most similar instances from the training data.
The use of explainable artificial intelligence (XAI) in pain research, specifically in understanding the role of electrodermal activity for automated pain recognition: hand-crafted features and deep learning models in pain recognition, highlighting the insights that simple hand-crafted features can yield comparative performances to deep learning models and that both traditional feature engineering and deep feature learning approaches rely on simple characteristics of the input time-series data.
As regulators, official bodies, and general users come to depend on AI-based dynamic systems, clearer accountability will be required for automated decision-making processes to ensure trust and transparency. The first global conference exclusively dedicated to this emerging discipline was the 2017 International Joint Conference on Artificial Intelligence: Workshop on Explainable Artificial Intelligence (XAI). It has evolved over the years, with various workshops organised and co-located to many other international conferences, and it has now a dedicated global event, "The world conference on eXplainable Artificial Intelligence", with its own proceedings.
Despite ongoing endeavors to enhance the explainability of AI models, they persist with several inherent limitations.
By making an AI system more explainable, we also reveal more of its inner workings. For example, the explainability method of feature importance identifies features or variables that are most important in determining the model's output, while the influential samples method identifies the training samples that are most influential in determining the output, given a particular input. Adversarial parties could take advantage of this knowledge.
For example, competitor firms could replicate aspects of the original AI system in their own product, thus reducing competitive advantage. An explainable AI system is also susceptible to being “gamed”—influenced in a way that undermines its intended purpose. One study gives the example of a predictive policing system; in this case, those who could potentially “game” the system are the criminals subject to the system's decisions. In this study, developers of the system discussed the issue of criminal gangs looking to illegally obtain passports, and they expressed concerns that, if given an idea of what factors might trigger an alert in the passport application process, those gangs would be able to “send guinea pigs” to test those triggers, eventually finding a loophole that would allow them to “reliably get passports from under the noses of the authorities”.
Many approaches that it uses provides explanation in general, it doesn't take account for the diverse backgrounds and knowledge level of the users. This leads to challenges with accurate comprehension for all users. Expert users can find the explanations lacking in depth, and are oversimplified, while a beginner user may struggle understanding the explanations as they are complex. This limitation downplays the ability of the XAI techniques to appeal to their users with different levels of knowledge, which can impact the trust from users and who uses it. The quality of explanations can be different amongst their users as they all have different expertise levels, including different situation and conditions.
A fundamental barrier to making AI systems explainable is the technical complexity of such systems. End users often lack the coding knowledge required to understand software of any kind. Current methods used to explain AI are mainly technical ones, geared toward machine learning engineers for debugging purposes, rather than toward the end users who are ultimately affected by the system, causing “a gap between explainability in practice and the goal of transparency”. Proposed solutions to address the issue of technical complexity include either promoting the coding education of the general public so technical explanations would be more accessible to end users, or providing explanations in layperson terms.
The solution must avoid oversimplification. It is important to strike a balance between accuracy – how faithfully the explanation reflects the process of the AI system – and explainability – how well end users understand the process. This is a difficult balance to strike, since the complexity of machine learning makes it difficult for even ML engineers to fully understand, let alone non-experts.
The goal of explainability to end users of AI systems is to increase trust in the systems, even “address concerns about lack of ‘fairness’ and discriminatory effects”. However, even with a good understanding of an AI system, end users may not necessarily trust the system. In one study, participants were presented with combinations of white-box and black-box explanations, and static and interactive explanations of AI systems. While these explanations served to increase both their self-reported and objective understanding, it had no impact on their level of trust, which remained skeptical.
This outcome was especially true for decisions that impacted the end user in a significant way, such as graduate school admissions. Participants judged algorithms to be too inflexible and unforgiving in comparison to human decision-makers; instead of rigidly adhering to a set of rules, humans are able to consider exceptional cases as well as appeals to their initial decision. For such decisions, explainability will not necessarily cause end users to accept the use of decision-making algorithms. We will need to either turn to another method to increase trust and acceptance of decision-making algorithms, or question the need to rely solely on AI for such impactful decisions in the first place.
However, some emphasize that the purpose of explainability of artificial intelligence is not to merely increase users' trust in the system's decisions, but to calibrate the users' level of trust to the correct level. According to this principle, too much or too little user trust in the AI system will harm the overall performance of the human-system unit. When the trust is excessive, the users are not critical of possible mistakes of the system and when the users do not have enough trust in the system, they will not exhaust the benefits inherent in it.
Some scholars have suggested that explainability in AI should be considered a goal secondary to AI effectiveness, and that encouraging the exclusive development of XAI may limit the functionality of AI more broadly. Critiques of XAI rely on developed concepts of mechanistic and empiric reasoning from evidence-based medicine to suggest that AI technologies can be clinically validated even when their function cannot be understood by their operators.
Some researchers advocate the use of inherently interpretable machine learning models, rather than using post-hoc explanations in which a second model is created to explain the first. This is partly because post-hoc models increase the complexity in a decision pathway and partly because it is often unclear how faithfully a post-hoc explanation can mimic the computations of an entirely separate model. However, another view is that what is important is that the explanation accomplishes the given task at hand, and whether it is pre or post-hoc doesn't matter. If a post-hoc explanation method helps a doctor diagnose cancer better, it is of secondary importance whether it is a correct/incorrect explanation.
Peters, Procaccia, Psomas and Zhou present an algorithm for explaining the outcomes of the Borda rule using O(m2) explanations, and prove that this is tight in the worst case.
Yang, Hausladen, Peters, Pournaras, Fricker and Helbing present an empirical study of explainability in participatory budgeting. They compared the greedy and the equal shares rules, and three types of explanations: mechanism explanation (a general explanation of how the aggregation rule works given the voting input), individual explanation (explaining how many voters had at least one approved project, at least 10000 CHF in approved projects), and group explanation (explaining how the budget is distributed among the districts and topics). They compared the perceived trustworthiness and fairness of greedy and equal shares, before and after the explanations. They found out that, for MES, mechanism explanation yields the highest increase in perceived fairness and trustworthiness; the second-highest was Group explanation. For Greedy, Mechanism explanation increases perceived trustworthiness but not fairness, whereas Individual explanation increases both perceived fairness and trustworthiness. Group explanation decreases the perceived fairness and trustworthiness.
Longo, Luca; et al. (2024). "Explainable Artificial Intelligence (XAI) 2.0: A manifesto of open challenges and interdisciplinary research directions". Information Fusion. 106. arXiv:2310.19775. doi:10.1016/j.inffus.2024.102301. https://www.sciencedirect.com/science/article/pii/S1566253524000794
Mihály, Héder (2023). "Explainable AI: A Brief History of the Concept" (PDF). ERCIM News (134): 9–10. https://ercim-news.ercim.eu/images/stories/EN134/EN134-web.pdf
Phillips, P. Jonathon; Hahn, Carina A.; Fontana, Peter C.; Yates, Amy N.; Greene, Kristen; Broniatowski, David A.; Przybocki, Mark A. (2021-09-29). "Four Principles of Explainable Artificial Intelligence". NIST. doi:10.6028/nist.ir.8312. https://doi.org/10.6028/NIST.IR.8312
Vilone, Giulia; Longo, Luca (2021). "Notions of explainability and evaluation approaches for explainable artificial intelligence". Information Fusion. December 2021 - Volume 76: 89–106. doi:10.1016/j.inffus.2021.05.009. https://www.sciencedirect.com/science/article/pii/S1566253521001093
Confalonieri, Roberto; Coba, Ludovik; Wagner, Benedikt; Besold, Tarek R. (January 2021). "A historical perspective of explainable Artificial Intelligence". WIREs Data Mining and Knowledge Discovery. 11 (1). doi:10.1002/widm.1391. hdl:11577/3471605. ISSN 1942-4787. https://wires.onlinelibrary.wiley.com/doi/10.1002/widm.1391
Castelvecchi, Davide (2016-10-06). "Can we open the black box of AI?". Nature. 538 (7623): 20–23. Bibcode:2016Natur.538...20C. doi:10.1038/538020a. ISSN 0028-0836. PMID 27708329. S2CID 4465871. http://www.nature.com/articles/538020a
Sample, Ian (5 November 2017). "Computer says no: why making AIs fair, accountable and transparent is crucial". The Guardian. Retrieved 30 January 2018. https://www.theguardian.com/science/2017/nov/05/computer-says-no-why-making-ais-fair-accountable-and-transparent-is-crucial
Alizadeh, Fatemeh (2021). "I Don't Know, Is AI Also Used in Airbags?: An Empirical Study of Folk Concepts and People's Expectations of Current and Future Artificial Intelligence". Icom. 20 (1): 3–17. doi:10.1515/icom-2021-0009. S2CID 233328352. https://www.researchgate.net/publication/352638184
Edwards, Lilian; Veale, Michael (2017). "Slave to the Algorithm? Why a 'Right to an Explanation' Is Probably Not the Remedy You Are Looking For". Duke Law and Technology Review. 16: 18. SSRN 2972855. /wiki/SSRN_(identifier)
Do Couto, Mark (February 22, 2024). "Entering the Age of Explainable AI". TDWI. Retrieved 2024-09-11. https://tdwi.org/Articles/2024/02/22/ADV-ALL-Entering-the-Age-of-Explainable-AI.aspx
Gunning, D.; Stefik, M.; Choi, J.; Miller, T.; Stumpf, S.; Yang, G.-Z. (2019-12-18). "XAI-Explainable artificial intelligence". Science Robotics. 4 (37): eaay7120. doi:10.1126/scirobotics.aay7120. ISSN 2470-9476. PMID 33137719. https://openaccess.city.ac.uk/id/eprint/23405/
Rieg, Thilo; Frick, Janek; Baumgartl, Hermann; Buettner, Ricardo (2020-12-17). "Demonstration of the potential of white-box machine learning approaches to gain insights from cardiovascular disease electrocardiograms". PLOS ONE. 15 (12): e0243615. Bibcode:2020PLoSO..1543615R. doi:10.1371/journal.pone.0243615. ISSN 1932-6203. PMC 7746264. PMID 33332440. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7746264
Vilone, Giulia; Longo, Luca (2021). "Classification of Explainable Artificial Intelligence Methods through Their Output Formats". Machine Learning and Knowledge Extraction. 3 (3): 615–661. doi:10.3390/make3030032. https://doi.org/10.3390%2Fmake3030032
Loyola-González, O. (2019). "Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View". IEEE Access. 7: 154096–154113. Bibcode:2019IEEEA...7o4096L. doi:10.1109/ACCESS.2019.2949286. ISSN 2169-3536. https://doi.org/10.1109%2FACCESS.2019.2949286
Roscher, R.; Bohn, B.; Duarte, M. F.; Garcke, J. (2020). "Explainable Machine Learning for Scientific Insights and Discoveries". IEEE Access. 8: 42200–42216. arXiv:1905.08883. Bibcode:2020IEEEA...842200R. doi:10.1109/ACCESS.2020.2976199. ISSN 2169-3536. https://doi.org/10.1109%2FACCESS.2020.2976199
Murdoch, W. James; Singh, Chandan; Kumbier, Karl; Abbasi-Asl, Reza; Yu, Bin (2019-01-14). "Interpretable machine learning: definitions, methods, and applications". Proceedings of the National Academy of Sciences of the United States of America. 116 (44): 22071–22080. arXiv:1901.04592. doi:10.1073/pnas.1900654116. PMC 6825274. PMID 31619572. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6825274
Lipton, Zachary C. (June 2018). "The Mythos of Model Interpretability: In machine learning, the concept of interpretability is both important and slippery". Queue. 16 (3): 31–57. doi:10.1145/3236386.3241340. ISSN 1542-7730. https://doi.org/10.1145%2F3236386.3241340
"Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI". DeepAI. 2019-10-22. Retrieved 2021-01-13. https://deepai.org/publication/explainable-artificial-intelligence-xai-concepts-taxonomies-opportunities-and-challenges-toward-responsible-ai
Roscher, R.; Bohn, B.; Duarte, M. F.; Garcke, J. (2020). "Explainable Machine Learning for Scientific Insights and Discoveries". IEEE Access. 8: 42200–42216. arXiv:1905.08883. Bibcode:2020IEEEA...842200R. doi:10.1109/ACCESS.2020.2976199. ISSN 2169-3536. https://doi.org/10.1109%2FACCESS.2020.2976199
Montavon, Grégoire; Samek, Wojciech; Müller, Klaus-Robert (2018-02-01). "Methods for interpreting and understanding deep neural networks". Digital Signal Processing. 73: 1–15. arXiv:1706.07979. Bibcode:2018DSP....73....1M. doi:10.1016/j.dsp.2017.10.011. ISSN 1051-2004. /wiki/Klaus-Robert_M%C3%BCller
Notovich, Aviv., Chalutz-Ben Gal, Hila, & Ben-Gal, Irad (2023). "Explainable Artificial Intelligence (XAI): Motivation, Terminology, and Taxonomy" (PDF). In Machine Learning for Data Science Handbook: Data Mining and Knowledge Discovery Handbook (pp. 971-985). Cham: Springer International Publishing.{{cite web}}: CS1 maint: multiple names: authors list (link) CS1 maint: numeric names: authors list (link) https://www.iradbengal.sites.tau.ac.il/_files/ugd/901879_fcdc9f3f44dc41968ba476a66f60f078.pdf
Adadi, A.; Berrada, M. (2018). "Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)". IEEE Access. 6: 52138–52160. Bibcode:2018IEEEA...652138A. doi:10.1109/ACCESS.2018.2870052. ISSN 2169-3536. https://doi.org/10.1109%2FACCESS.2018.2870052
Rudin, Cynthia (2019). "Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead". Nature Machine Intelligence. 1 (5): 206–215. arXiv:1811.10154. doi:10.1038/s42256-019-0048-x. ISSN 2522-5839. PMC 9122117. PMID 35603010. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122117
Koh, P. W.; Nguyen, T.; Tang, Y. S.; Mussmann, S.; Pierson, E.; Kim, B.; Liang, P. (November 2020). "Concept bottleneck models". International Conference on Machine Learning. PMLR. pp. 5338–5348.
Ludan, J. M.; Lyu, Q.; Yang, Y.; Dugan, L.; Yatskar, M.; Callison-Burch, C. (2023). "Interpretable-by-Design Text Classification with Iteratively Generated Concept Bottleneck". arXiv:2310.19660 [cs.CL]. /wiki/ArXiv_(identifier)
Gunning, D.; Stefik, M.; Choi, J.; Miller, T.; Stumpf, S.; Yang, G.-Z. (2019-12-18). "XAI-Explainable artificial intelligence". Science Robotics. 4 (37): eaay7120. doi:10.1126/scirobotics.aay7120. ISSN 2470-9476. PMID 33137719. https://openaccess.city.ac.uk/id/eprint/23405/
Wenninger, Simon; Kaymakci, Can; Wiethe, Christian (2022). "Explainable long-term building energy consumption prediction using QLattice". Applied Energy. 308. Elsevier BV: 118300. Bibcode:2022ApEn..30818300W. doi:10.1016/j.apenergy.2021.118300. ISSN 0306-2619. S2CID 245428233. /wiki/Bibcode_(identifier)
Christiansen, Michael; Wilstrup, Casper; Hedley, Paula L. (2022). "Explainable "white-box" machine learning is the way forward in preeclampsia screening". American Journal of Obstetrics and Gynecology. 227 (5). Elsevier BV: 791. doi:10.1016/j.ajog.2022.06.057. ISSN 0002-9378. PMID 35779588. S2CID 250160871. /wiki/Doi_(identifier)
Wilstup, Casper; Cave, Chris (2021-01-15), Combining symbolic regression with the Cox proportional hazards model improves prediction of heart failure deaths, Cold Spring Harbor Laboratory, doi:10.1101/2021.01.15.21249874, S2CID 231609904 /wiki/Doi_(identifier)
"How AI detectives are cracking open the black box of deep learning". Science. 5 July 2017. Retrieved 30 January 2018.. https://www.science.org/content/article/how-ai-detectives-are-cracking-open-black-box-deep-learning
Dosilovic, Filip; Brcic, Mario; Hlupic, Nikica (2018-05-25). "Explainable Artificial Intelligence: A Survey" (PDF). MIPRO 2018 - 41st International Convention Proceedings. MIPRO 2018. Opatija, Croatia. pp. 210–215. doi:10.23919/MIPRO.2018.8400040. ISBN 978-953-233-095-3. Archived from the original (PDF) on 2018-12-10. Retrieved 2018-12-09. 978-953-233-095-3
Bernal, Jose; Mazo, Claudia (2022-10-11). "Transparency of Artificial Intelligence in Healthcare: Insights from Professionals in Computing and Healthcare Worldwide". Applied Sciences. 12 (20): 10228. doi:10.3390/app122010228. ISSN 2076-3417. https://doi.org/10.3390%2Fapp122010228
Antoniadi, Anna Markella; Du, Yuhan; Guendouz, Yasmine; Wei, Lan; Mazo, Claudia; Becker, Brett A.; Mooney, Catherine (January 2021). "Current Challenges and Future Opportunities for XAI in Machine Learning-Based Clinical Decision Support Systems: A Systematic Review". Applied Sciences. 11 (11): 5088. doi:10.3390/app11115088. ISSN 2076-3417. https://doi.org/10.3390%2Fapp11115088
Sample, Ian (5 November 2017). "Computer says no: why making AIs fair, accountable and transparent is crucial". The Guardian. Retrieved 30 January 2018. https://www.theguardian.com/science/2017/nov/05/computer-says-no-why-making-ais-fair-accountable-and-transparent-is-crucial
"DeepMind Has Simple Tests That Might Prevent Elon Musk's AI Apocalypse". Bloomberg.com. 11 December 2017. Retrieved 30 January 2018. https://www.bloomberg.com/news/articles/2017-12-11/deepmind-has-simple-tests-that-might-prevent-elon-musk-s-ai-apocalypse
"Learning from Human Preferences". OpenAI Blog. 13 June 2017. Retrieved 30 January 2018. https://blog.openai.com/deep-reinforcement-learning-from-human-preferences/
"Explainable Artificial Intelligence (XAI)". DARPA. Retrieved 17 July 2017. https://www.darpa.mil/program/explainable-artificial-intelligence
Biecek, Przemyslaw (23 June 2018). "DALEX: explainers for complex predictive models". Journal of Machine Learning Research. 19: 1–5. arXiv:1806.08915. /wiki/ArXiv_(identifier)
Rai, Arun. "Explainable AI: From black box to glass box." Journal of the Academy of Marketing Science 48 (2020): 137-141.
Sokol, Kacper; Flach, Peter (2018). "Glass-Box: Explaining AI Decisions With Counterfactual Statements Through Conversation With a Voice-enabled Virtual Assistant". Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence. pp. 5868–5870. doi:10.24963/ijcai.2018/865. ISBN 9780999241127. S2CID 51608978. 9780999241127
Broniatowski, David A. (2021). "Psychological Foundations of Explainability and Interpretability in Artificial Intelligence". NIST Pubs. https://www.nist.gov/publications/psychological-foundations-explainability-and-interpretability-artificial-intelligence
ISO/IEC TR 29119-11:2020, Software and systems engineering, Software testing, Part 11: Guidelines on the testing of AI-based systems. ISO. 2020. Retrieved 25 November 2023. https://www.iso.org/standard/79016.html
ISO/IEC TR 29119-11:2020, Software and systems engineering, Software testing, Part 11: Guidelines on the testing of AI-based systems. ISO. 2020. Retrieved 25 November 2023. https://www.iso.org/standard/79016.html
Verma, Yugesh (2021-12-25). "Complete Guide to SHAP - SHAPley Additive exPlanations for Practitioners". Analytics India Magazine. Retrieved 2024-07-10. https://analyticsindiamag.com/a-complete-guide-to-shap-shapley-additive-explanations-for-practitioners/
Rothman, Denis (2020-10-07). "Exploring LIME Explanations and the Mathematics Behind It". Codemotion Magazine. Retrieved 2024-07-10. https://www.codemotion.com/magazine/ai-ml/lime-explainable-ai/
Christian, Brian (2020). "TELL ME EVERYTHING: MULTITASK NETS". The Alignment Problem: Machine learning and human values. W. W. Norton & Company. ISBN 978-0-393-86833-3. 978-0-393-86833-3
Sharma, Abhishek (2018-07-11). "What Are Saliency Maps In Deep Learning?". Analytics India Magazine. Retrieved 2024-07-10. https://analyticsindiamag.com/what-are-saliency-maps-in-deep-learning/
Confalonieri, Roberto; Coba, Ludovik; Wagner, Benedikt; Besold, Tarek R. (January 2021). "A historical perspective of explainable Artificial Intelligence". WIREs Data Mining and Knowledge Discovery. 11 (1). doi:10.1002/widm.1391. hdl:11577/3471605. ISSN 1942-4787. https://wires.onlinelibrary.wiley.com/doi/10.1002/widm.1391
Luo, Haoyan; Specia, Lucia (2024-02-21). "From Understanding to Utilization: A Survey on Explainability for Large Language Models". arXiv:2401.12874 [cs.CL]. /wiki/ArXiv_(identifier)
Olah, Chris (June 27, 2022). "Mechanistic Interpretability, Variables, and the Importance of Interpretable Bases". www.transformer-circuits.pub. Retrieved 2024-07-10. https://www.transformer-circuits.pub/2022/mech-interp-essay
Mittal, Aayush (2024-06-17). "Understanding Sparse Autoencoders, GPT-4 & Claude 3 : An In-Depth Technical Exploration". Unite.AI. Retrieved 2024-07-10. https://www.unite.ai/understanding-sparse-autoencoders-gpt-4-claude-3-an-in-depth-technical-exploration/
Ropek, Lucas (2024-05-21). "New Anthropic Research Sheds Light on AI's 'Black Box'". Gizmodo. Retrieved 2024-05-23. https://gizmodo.com/new-anthropic-research-sheds-light-on-ais-black-box-1851491333
Perrigo, Billy (2024-05-21). "Artificial Intelligence Is a 'Black Box.' Maybe Not For Long". Time. Retrieved 2024-05-24. https://time.com/6980210/anthropic-interpretability-ai-safety-research/
Barber, Gregory. "Inside the 'Black Box' of a Neural Network". Wired. ISSN 1059-1028. Retrieved 2024-07-10. https://www.wired.com/story/inside-black-box-of-neural-network/
Fagan, L. M.; Shortliffe, E. H.; Buchanan, B. G. (1980). "Computer-based medical decision making: from MYCIN to VM". Automedica. 3 (2): 97–108.
Clancey, William (1987). Knowledge-Based Tutoring: The GUIDON Program. Cambridge, Massachusetts: The MIT Press.
Brown, John S.; Burton, R. R.; De Kleer, Johan (1982). "Pedagogical, natural language, and knowledge engineering techniques in SOPHIE I, II, and III". Intelligent Tutoring Systems. Academic Press. ISBN 0-12-648680-8. 0-12-648680-8
Bareiss, Ray; Porter, Bruce; Weir, Craig; Holte, Robert (1990). "Protos: An Exemplar-Based Learning Apprentice". Machine Learning. Vol. 3. Morgan Kaufmann Publishers Inc. pp. 112–139. ISBN 1-55860-119-8. 1-55860-119-8
Bareiss, Ray (1989). Exemplar-Based Knowledge Acquisition: A Unified Approach to Concept Representation, Classification, and Learning. Perspectives in Artificial Intelligence. Academic Press. /wiki/Academic_Press
Van Lent, M.; Fisher, W.; Mancuso, M. (July 2004). "An explainable artificial intelligence system for small-unit tactical behavior". Proceedings of the National Conference on Artificial Intelligence. San Jose, CA: AAAI Press. pp. 900–907. ISBN 0262511835. 0262511835
Bareiss, Ray (1989). Exemplar-Based Knowledge Acquisition: A Unified Approach to Concept Representation, Classification, and Learning. Perspectives in Artificial Intelligence. Academic Press. /wiki/Academic_Press
Russell, Stuart; Norvig, Peter (2003). Artificial Intelligence: A Modern Approach. Prentice Hall Series in Artificial Intelligence (Second ed.). Upper Saddle River, New Jersey: Prentice Hall, Pearson Education. ISBN 0-13-790395-2. 0-13-790395-2
Forbus, Kenneth; De Kleer, Johan (1993). Building Problem Solvers. Cambridge, Massachusetts: The MIT Press. ISBN 0-262-06157-0. 0-262-06157-0
Tickle, A. B.; Andrews, R.; Golea, M.; Diederich, J. (November 1998). "The truth will come to light: directions and challenges in extracting the knowledge embedded within trained artificial neural networks". IEEE Transactions on Neural Networks. 9 (6): 1057–1068. doi:10.1109/72.728352. ISSN 1045-9227. PMID 18255792. S2CID 11111578. /wiki/Doi_(identifier)
Edwards, Lilian; Veale, Michael (2017). "Slave to the Algorithm? Why a 'Right to an Explanation' Is Probably Not the Remedy You Are Looking For". Duke Law and Technology Review. 16: 18. SSRN 2972855. /wiki/SSRN_(identifier)
Sample, Ian (5 November 2017). "Computer says no: why making AIs fair, accountable and transparent is crucial". The Guardian. Retrieved 30 January 2018. https://www.theguardian.com/science/2017/nov/05/computer-says-no-why-making-ais-fair-accountable-and-transparent-is-crucial
"Accenture Unveils Tool to Help Companies Insure Their AI Is Fair". Bloomberg.com. June 2018. Retrieved 5 August 2018. https://www.bloomberg.com/news/articles/2018-06-13/accenture-unveils-tool-to-help-companies-insure-their-ai-is-fair
Minsky, et al., "The Society of Intelligent Veillance" IEEE ISTAS2013, pages 13-17.
Mukherjee, Siddhartha (27 March 2017). "A.I. Versus M.D." The New Yorker. Retrieved 30 January 2018. https://www.newyorker.com/magazine/2017/04/03/ai-versus-md
Csiszár, Orsolya; Csiszár, Gábor; Dombi, József (2020-07-08). "Interpretable neural networks based on continuous-valued logic and multicriteria decision operators". Knowledge-Based Systems. 199: 105972. arXiv:1910.02486. doi:10.1016/j.knosys.2020.105972. ISSN 0950-7051. https://doi.org/10.1016%2Fj.knosys.2020.105972
Lipton, Zachary C. (June 2018). "The Mythos of Model Interpretability: In machine learning, the concept of interpretability is both important and slippery". Queue. 16 (3): 31–57. doi:10.1145/3236386.3241340. ISSN 1542-7730. https://doi.org/10.1145%2F3236386.3241340
Murdoch, W. James; Singh, Chandan; Kumbier, Karl; Abbasi-Asl, Reza; Yu, Bin (2019-01-14). "Interpretable machine learning: definitions, methods, and applications". Proceedings of the National Academy of Sciences of the United States of America. 116 (44): 22071–22080. arXiv:1901.04592. doi:10.1073/pnas.1900654116. PMC 6825274. PMID 31619572. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6825274
Doshi-Velez, Finale; Kim, Been (2017-02-27). "Towards A Rigorous Science of Interpretable Machine Learning". arXiv:1702.08608 [stat.ML]. /wiki/ArXiv_(identifier)
Abdollahi, Behnoush, and Olfa Nasraoui. (2016). "Explainable Restricted Boltzmann Machines for Collaborative Filtering". arXiv:1606.07129 [stat.ML].{{cite arXiv}}: CS1 maint: multiple names: authors list (link) /wiki/ArXiv_(identifier)
Dombi, József; Csiszár, Orsolya (2021). Explainable Neural Networks Based on Fuzzy Logic and Multi-criteria Decision Tools. Studies in Fuzziness and Soft Computing. Vol. 408. doi:10.1007/978-3-030-72280-7. ISBN 978-3-030-72279-1. ISSN 1434-9922. S2CID 233486978. 978-3-030-72279-1
Bach, Sebastian; Binder, Alexander; Montavon, Grégoire; Klauschen, Frederick; Müller, Klaus-Robert; Samek, Wojciech (2015-07-10). Suarez, Oscar Deniz (ed.). "On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation". PLOS ONE. 10 (7): e0130140. Bibcode:2015PLoSO..1030140B. doi:10.1371/journal.pone.0130140. ISSN 1932-6203. PMC 4498753. PMID 26161953. /wiki/Klaus-Robert_M%C3%BCller
Sample, Ian (5 November 2017). "Computer says no: why making AIs fair, accountable and transparent is crucial". The Guardian. Retrieved 5 August 2018. https://www.theguardian.com/science/2017/nov/05/computer-says-no-why-making-ais-fair-accountable-and-transparent-is-crucial
Martens, David; Provost, Foster (2014). "Explaining data-driven document classifications" (PDF). MIS Quarterly. 38: 73–99. doi:10.25300/MISQ/2014/38.1.04. S2CID 14238842. http://pages.stern.nyu.edu/~fprovost/Papers/MartensProvost_Explaining.pdf
""Why Should I Trust You?" | Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining". doi:10.1145/2939672.2939778. S2CID 13029170. {{cite journal}}: Cite journal requires |journal= (help) /wiki/Doi_(identifier)
Lundberg, Scott M; Lee, Su-In (2017), Guyon, I.; Luxburg, U. V.; Bengio, S.; Wallach, H. (eds.), "A Unified Approach to Interpreting Model Predictions" (PDF), Advances in Neural Information Processing Systems 30, Curran Associates, Inc., pp. 4765–4774, arXiv:1705.07874, retrieved 2020-03-13 /wiki/Su-In_Lee
Carter, Brandon; Mueller, Jonas; Jain, Siddhartha; Gifford, David (2019-04-11). "What made you do this? Understanding black-box decisions with sufficient input subsets". The 22nd International Conference on Artificial Intelligence and Statistics: 567–576. http://proceedings.mlr.press/v89/carter19a.html
Shrikumar, Avanti; Greenside, Peyton; Kundaje, Anshul (2017-07-17). "Learning Important Features Through Propagating Activation Differences". International Conference on Machine Learning: 3145–3153. http://proceedings.mlr.press/v70/shrikumar17a.html
"Axiomatic attribution for deep networks | Proceedings of the 34th International Conference on Machine Learning - Volume 70". dl.acm.org. Icml'17: 3319–3328. 6 August 2017. Retrieved 2020-03-13. https://dl.acm.org/doi/abs/10.5555/3305890.3306024
Xu, Feiyu; Uszkoreit, Hans; Du, Yangzhou; Fan, Wei; Zhao, Dongyan; Zhu, Jun (2019), Tang, Jie; Kan, Min-Yen; Zhao, Dongyan; Li, Sujian (eds.), "Explainable AI: A Brief Survey on History, Research Areas, Approaches and Challenges", Natural Language Processing and Chinese Computing, vol. 11839, Cham: Springer International Publishing, pp. 563–574, doi:10.1007/978-3-030-32236-6_51, ISBN 978-3-030-32235-9, retrieved 2024-12-03 978-3-030-32235-9
Le Merrer, Erwan; Trédan, Gilles (September 2020). "Remote explainability faces the bouncer problem". Nature Machine Intelligence. 2 (9): 529–539. arXiv:1910.01432. doi:10.1038/s42256-020-0216-z. ISSN 2522-5839. S2CID 225207140. https://www.nature.com/articles/s42256-020-0216-z
Aivodji, Ulrich; Arai, Hiromi; Fortineau, Olivier; Gambs, Sébastien; Hara, Satoshi; Tapp, Alain (2019-05-24). "Fairwashing: the risk of rationalization". International Conference on Machine Learning. PMLR: 161–170. arXiv:1901.09749. http://proceedings.mlr.press/v97/aivodji19a.html
Rudin, Cynthia (2019). "Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead". Nature Machine Intelligence. 1 (5): 206–215. arXiv:1811.10154. doi:10.1038/s42256-019-0048-x. ISSN 2522-5839. PMC 9122117. PMID 35603010. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122117
Singh, Chandan; Nasseri, Keyan; Tan, Yan Shuo; Tang, Tiffany; Yu, Bin (4 May 2021). "imodels: a python package for fitting interpretable models". Journal of Open Source Software. 6 (61): 3192. Bibcode:2021JOSS....6.3192S. doi:10.21105/joss.03192. ISSN 2475-9066. S2CID 235529515. https://joss.theoj.org/papers/10.21105/joss.03192
Vidal, Thibaut; Schiffer, Maximilian (2020). "Born-Again Tree Ensembles". International Conference on Machine Learning. 119. PMLR: 9743–9753. arXiv:2003.11132. http://proceedings.mlr.press/v119/vidal20a.html
Ustun, Berk; Rudin, Cynthia (1 March 2016). "Supersparse linear integer models for optimized medical scoring systems". Machine Learning. 102 (3): 349–391. arXiv:1502.04269. doi:10.1007/s10994-015-5528-6. ISSN 1573-0565. S2CID 207211836. https://link.springer.com/article/10.1007/s10994-015-5528-6
Bostrom, N., & Yudkowsky, E. (2014). The ethics of artificial intelligence. The Cambridge Handbook of Artificial Intelligence, 316-334. https://intelligence.org/files/EthicsofAI.pdf
"FAT* Conference". https://fatconference.org/
"Computer programs recognise white men better than black women". The Economist. 2018. Retrieved 5 August 2018. https://www.economist.com/science-and-technology/2018/02/15/computer-programs-recognise-white-men-better-than-black-women
Olah, Chris; Cammarata, Nick; Schubert, Ludwig; Goh, Gabriel; Petrov, Michael; Carter, Shan (10 March 2020). "Zoom In: An Introduction to Circuits". Distill. 5 (3): e00024.001. doi:10.23915/distill.00024.001. ISSN 2476-0757. https://distill.pub/2020/circuits/zoom-in/
Li, Yixuan; Yosinski, Jason; Clune, Jeff; Lipson, Hod; Hopcroft, John (8 December 2015). "Convergent Learning: Do different neural networks learn the same representations?". Feature Extraction: Modern Questions and Challenges. PMLR: 196–212. http://proceedings.mlr.press/v44/li15convergent.html
Hendricks, Lisa Anne; Akata, Zeynep; Rohrbach, Marcus; Donahue, Jeff; Schiele, Bernt; Darrell, Trevor (2016). "Generating Visual Explanations". Computer Vision – ECCV 2016. Lecture Notes in Computer Science. Vol. 9908. Springer International Publishing. pp. 3–19. arXiv:1603.08507. doi:10.1007/978-3-319-46493-0_1. ISBN 978-3-319-46492-3. S2CID 12030503. 978-3-319-46492-3
Koh, Pang Wei; Liang, Percy (17 July 2017). "Understanding Black-box Predictions via Influence Functions". International Conference on Machine Learning. PMLR: 1885–1894. arXiv:1703.04730. http://proceedings.mlr.press/v70/koh17a.html
Schmaltz, Allen (23 December 2021). "Detecting Local Insights from Global Labels: Supervised & Zero-Shot Sequence Labeling via a Convolutional Decomposition". Computational Linguistics. 47 (4): 729–773. doi:10.1162/coli_a_00416. https://doi.org/10.1162/coli_a_00416
Gouverneur, Philip; Li, Frédéric; Shirahama, Kimiaki; Luebke, Luisa; Adamczyk, Wacław M.; Szikszay, Tibor M.; Luedtke, Kerstin; Grzegorzek, Marcin (2023-02-09). "Explainable Artificial Intelligence (XAI) in Pain Research: Understanding the Role of Electrodermal Activity for Automated Pain Recognition". Sensors. 23 (4): 1959. Bibcode:2023Senso..23.1959G. doi:10.3390/s23041959. ISSN 1424-8220. PMC 9960387. PMID 36850556. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9960387
"IJCAI 2017 Workshop on Explainable Artificial Intelligence (XAI)" (PDF). Earthlink. IJCAI. Archived from the original (PDF) on 4 April 2019. Retrieved 17 July 2017. https://web.archive.org/web/20190404131609/http://www.intelligentrobots.org/files/IJCAI2017/IJCAI-17_XAI_WS_Proceedings.pdf
Explainable Artificial Intelligence, First World Conference, xAI 2023, Lisbon, Portugal, July 26–28, 2023, Proceedings, Parts I/II/III. Communications in Computer and Information Science. Vol. 1903. springer. 2023. doi:10.1007/978-3-031-44070-0. ISBN 978-3-031-44070-0. 978-3-031-44070-0
Explainable Artificial Intelligence, Second World Conference, xAI 2024, Valletta, Malta, July 17–19, 2024, Proceedings, Part I/II/III/IV. Communications in Computer and Information Science. Vol. 2153. springer. 2024. doi:10.1007/978-3-031-63787-2. ISBN 978-3-031-63787-2. 978-3-031-63787-2
Kahn, Jeremy (12 December 2018). "Artificial Intelligence Has Some Explaining to Do". Bloomberg Businessweek. Retrieved 17 December 2018. https://www.bloomberg.com/news/articles/2018-12-12/artificial-intelligence-has-some-explaining-to-do
Bhatt, Umang; Xiang, Alice; Sharma, Shubham; Weller, Adrian; Taly, Ankur; Jia, Yunhan; Ghosh, Joydeep; Puri, Richir; M.F. Moura, José; Eckersley, Peter (2022). "Explainable Machine Learning in Deployment". Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. pp. 648–657. doi:10.1145/3351095.3375624. ISBN 9781450369367. S2CID 202572724. 9781450369367
Burrel, Jenna (2016). "How the machine 'thinks': Understanding opacity in machine learning algorithms". Big Data & Society. 3 (1). doi:10.1177/2053951715622512. S2CID 61330970. https://journals.sagepub.com/doi/pdf/10.1177/2053951715622512
Veale, Michael; Van Kleek, Max; Binns, Reuben (2018). "Fairness and Accountability Design Needs for Algorithmic Support in High-Stakes Public Sector Decision-Making". Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. Vol. 40. pp. 1–14. doi:10.1145/3173574.3174014. ISBN 9781450356206. S2CID 3639135. 9781450356206
Yang, Wenli; Wei, Yuchen; Wei, Hanyu; Chen, Yanyu; Huang, Guan; Li, Xiang; Li, Renjie; Yao, Naimeng; Wang, Xinyi; Gu, Xiaotong; Amin, Muhammad Bilal; Kang, Byeong (2023-08-10). "Survey on Explainable AI: From Approaches, Limitations and Applications Aspects". Human-Centric Intelligent Systems. 3 (3): 161–188. doi:10.1007/s44230-023-00038-y. ISSN 2667-1336. https://link.springer.com/10.1007/s44230-023-00038-y
Bhatt, Umang; Xiang, Alice; Sharma, Shubham; Weller, Adrian; Taly, Ankur; Jia, Yunhan; Ghosh, Joydeep; Puri, Richir; M.F. Moura, José; Eckersley, Peter (2022). "Explainable Machine Learning in Deployment". Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. pp. 648–657. doi:10.1145/3351095.3375624. ISBN 9781450369367. S2CID 202572724. 9781450369367
Burrel, Jenna (2016). "How the machine 'thinks': Understanding opacity in machine learning algorithms". Big Data & Society. 3 (1). doi:10.1177/2053951715622512. S2CID 61330970. https://journals.sagepub.com/doi/pdf/10.1177/2053951715622512
Bhatt, Umang; Xiang, Alice; Sharma, Shubham; Weller, Adrian; Taly, Ankur; Jia, Yunhan; Ghosh, Joydeep; Puri, Richir; M.F. Moura, José; Eckersley, Peter (2022). "Explainable Machine Learning in Deployment". Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. pp. 648–657. doi:10.1145/3351095.3375624. ISBN 9781450369367. S2CID 202572724. 9781450369367
Burrel, Jenna (2016). "How the machine 'thinks': Understanding opacity in machine learning algorithms". Big Data & Society. 3 (1). doi:10.1177/2053951715622512. S2CID 61330970. https://journals.sagepub.com/doi/pdf/10.1177/2053951715622512
Hu, Tongxi; Zhang, Xuesong; Bohrer, Gil; Liu, Yanlan; Zhou, Yuyu; Martin, Jay; LI, Yang; Zhao, Kaiguang (2023). "Crop yield prediction via explainable AI and interpretable machine learning: Dangers of black box models for evaluating climate change impacts on crop yield". Agricultural and Forest Meteorology. 336: 109458. doi:10.1016/j.agrformet.2023.109458. S2CID 258552400. https://www.sciencedirect.com/science/article/pii/S0168192323001508
Cheng, Hao-Fei; Wang, Ruotang; Zhang, Zheng; O’Connell, Fiona; Gray, Terrance; Harper, F. Maxwell; Zhu, Haiyi (2019). Explaining Decision-Making Algorithms through UI: Strategies to Help Non-Expert Stakeholders. Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Vol. 559. pp. 1–12. doi:10.1145/3290605.3300789. ISBN 9781450359702. S2CID 140281803. 9781450359702
Cheng, Hao-Fei; Wang, Ruotang; Zhang, Zheng; O’Connell, Fiona; Gray, Terrance; Harper, F. Maxwell; Zhu, Haiyi (2019). Explaining Decision-Making Algorithms through UI: Strategies to Help Non-Expert Stakeholders. Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Vol. 559. pp. 1–12. doi:10.1145/3290605.3300789. ISBN 9781450359702. S2CID 140281803. 9781450359702
Liel, Yotam; Zalmanson, Lior (August 2023). "Turning Off Your Better Judgment – Conformity to Algorithmic Recommendations". Academy of Management Proceedings. 2023 (1). doi:10.5465/AMPROC.2023.277bp. ISSN 0065-0668. S2CID 260212999. http://journals.aom.org/doi/full/10.5465/AMPROC.2023.277bp
McCoy, Liam G.; Brenna, Connor T. A.; Chen, Stacy S.; Vold, Karina; Das, Sunit (2021-11-05). "Believing in black boxes: machine learning for healthcare does not need explainability to be evidence-based". Journal of Clinical Epidemiology. 142 (Online ahead of print): 252–257. doi:10.1016/j.jclinepi.2021.11.001. ISSN 0895-4356. PMID 34748907. S2CID 243810442. https://www.jclinepi.com/article/S0895-4356(21)00354-1/abstract
Ghassemi, Marzyeh; Oakden-Rayner, Luke; Beam, Andrew L. (2021-11-01). "The false hope of current approaches to explainable artificial intelligence in health care". The Lancet Digital Health. 3 (11): e745 – e750. doi:10.1016/S2589-7500(21)00208-9. ISSN 2589-7500. PMID 34711379. S2CID 239963176. https://www.thelancet.com/journals/landig/article/PIIS2589-7500(21)00208-9/abstract
McCoy, Liam G.; Brenna, Connor T. A.; Chen, Stacy S.; Vold, Karina; Das, Sunit (2021-11-05). "Believing in black boxes: machine learning for healthcare does not need explainability to be evidence-based". Journal of Clinical Epidemiology. 142 (Online ahead of print): 252–257. doi:10.1016/j.jclinepi.2021.11.001. ISSN 0895-4356. PMID 34748907. S2CID 243810442. https://www.jclinepi.com/article/S0895-4356(21)00354-1/abstract
Rudin, Cynthia (2019). "Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead". Nature Machine Intelligence. 1 (5): 206–215. arXiv:1811.10154. doi:10.1038/s42256-019-0048-x. ISSN 2522-5839. PMC 9122117. PMID 35603010. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9122117
Sarkar, Advait (2022). "Is explainable AI a race against model complexity?" (PDF). Workshop on Transparency and Explanations in Smart Systems (TeXSS), in Conjunction with ACM Intelligent User Interfaces (IUI 2022): 192–199. arXiv:2205.10119 – via CEUR Workshop Proceedings. http://ceur-ws.org/Vol-3124/paper22.pdf
Procaccia, Ariel D. (2019), Laslier, Jean-François; Moulin, Hervé; Sanver, M. Remzi; Zwicker, William S. (eds.), "Axioms Should Explain Solutions", The Future of Economic Design: The Continuing Development of a Field as Envisioned by Its Researchers, Studies in Economic Design, Cham: Springer International Publishing, pp. 195–199, doi:10.1007/978-3-030-18050-8_27, ISBN 978-3-030-18050-8, retrieved 2024-03-21 978-3-030-18050-8
Cailloux, Olivier; Endriss, Ulle (2016-05-09). "Arguing about Voting Rules". Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems. AAMAS '16. Richland, SC: International Foundation for Autonomous Agents and Multiagent Systems: 287–295. ISBN 978-1-4503-4239-1. 978-1-4503-4239-1
Peters, Dominik; Procaccia, Ariel D; Psomas, Alexandros; Zhou, Zixin (2020). "Explainable Voting". Advances in Neural Information Processing Systems. 33. Curran Associates, Inc.: 1525–1534. https://proceedings.neurips.cc/paper/2020/hash/10c72a9d42dd07a028ee910f7854da5d-Abstract.html
Yang, Joshua C.; Hausladen, Carina I.; Peters, Dominik; Pournaras, Evangelos; Regula Häenggli Fricker; Helbing, Dirk (2024). "Designing Digital Voting Systems for Citizens: Achieving Fairness and Legitimacy in Participatory Budgeting". Digital Government: Research and Practice. 5 (3): 1–30. arXiv:2310.03501. doi:10.1145/3665332. /wiki/ArXiv_(identifier)
Nizri, Meir; Hazon, Noam; Azaria, Amos (2022-06-28). "Explainable Shapley-Based Allocation (Student Abstract)". Proceedings of the AAAI Conference on Artificial Intelligence. 36 (11): 13023–13024. doi:10.1609/aaai.v36i11.21648. ISSN 2374-3468. S2CID 250296641. https://ojs.aaai.org/index.php/AAAI/article/view/21648