Search CORE

8 research outputs found

Unexplainability and Incomprehensibility of Artificial Intelligence

Author: Yampolskiy Roman
Publication venue
Publication date
Field of study

Explainability and comprehensibility of AI are important requirements for intelligent systems deployed in real-world domains. Users want and frequently need to understand how decisions impacting them are made. Similarly it is important to understand how an intelligent system functions for safety and security reasons. In this paper, we describe two complementary impossibility results (Unexplainability and Incomprehensibility), essentially showing that advanced AIs would not be able to accurately explain some of their decisions and for the decisions they could explain people would not understand some of those explanations

PhilPapers

Comprehending software correctness implies comprehending an intelligence-related limitation

Author: Arthur Charlesworth
Barrow J. D.
Chalmers D. J.
Gödel K.
Gödel K.
McDermott D.
Minsky M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Impossibility Results in AI: A Survey

Author: Brcic Mario
Yampolskiy Roman
Publication venue: ThinkIR: The University of Louisville\u27s Institutional Repository
Publication date: 01/09/2021
Field of study

An impossibility theorem demonstrates that a particular problem or set of problems cannot be solved as described in the claim. Such theorems put limits on what is possible to do concerning artificial intelligence, especially the super-intelligent one. As such, these results serve as guidelines, reminders, and warnings to AI safety, AI policy, and governance researchers. These might enable solutions to some long-standing questions in the form of formalizing theories in the framework of constraint satisfaction without committing to one option. In this paper, we have categorized impossibility theorems applicable to the domain of AI into five categories: deduction, indistinguishability, induction, tradeoffs, and intractability. We found that certain theorems are too specific or have implicit assumptions that limit application. Also, we added a new result (theorem) about the unfairness of explainability, the first explainability-related result in the induction category. We concluded that deductive impossibilities deny 100%-guarantees for security. In the end, we give some ideas that hold potential in explainability, controllability, value alignment, ethics, and group decision-making. They can be deepened by further investigation

arXiv.org e-Print Archive

University of Louisville

The Comprehensibility Theorem and the Foundations of Artificial Intelligence

Author: A Charlesworth
AJ Wiles
Arthur Charlesworth
AW Appel
B Franklin
D Hofstadter
D Hofstadter
DE Knuth
DE Knuth
E Hamilton
G LaForte
G Lynch
GE Forsythe
H Moravec
H Wang
J Harrison
JH Lint van
K Appel
K Gödel
K Gödel
L Kirby
M Minsky
P Cohen
PM Dorin
RE Hodel
S Mac Lane
SJ Russell
T Franzén
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

On Controllability of Artificial Intelligence

Author: Yampolskiy Roman
Publication venue
Publication date
Field of study

Invention of artificial general intelligence is predicted to cause a shift in the trajectory of human civilization. In order to reap the benefits and avoid pitfalls of such powerful technology it is important to be able to control it. However, possibility of controlling artificial general intelligence and its more advanced version, superintelligence, has not been formally established. In this paper, we present arguments as well as supporting evidence from multiple domains indicating that advanced AI can’t be fully controlled. Consequences of uncontrollability of AI are discussed with respect to future of humanity and research on AI, and AI safety and security. This paper can serve as a comprehensive reference for the topic of uncontrollability

PhilPapers