11,558 research outputs found
Clue: Cross-modal Coherence Modeling for Caption Generation
We use coherence relations inspired by computational models of discourse to
study the information needs and goals of image captioning. Using an annotation
protocol specifically devised for capturing image--caption coherence relations,
we annotate 10,000 instances from publicly-available image--caption pairs. We
introduce a new task for learning inferences in imagery and text, coherence
relation prediction, and show that these coherence annotations can be exploited
to learn relation classifiers as an intermediary step, and also train
coherence-aware, controllable image captioning models. The results show a
dramatic improvement in the consistency and quality of the generated captions
with respect to information needs specified via coherence relations.Comment: Accepted as a long paper to ACL 202
Recommended from our members
But What Do You Need To Produce a But?
We study the problem of appropriately generating connectives (e.g., 'but', 'because', 'since', 'however') in a discourse. We claim that connectives operate at the discourse level rather than the semantic level, and that they indicate pragmatic features of the units they connect. Therefore, in order to choose the appropriate connective, a surface generator must find in its input a set of pragmatic features that affect or are affected by the choice of a connective. We present such a set of features and show their role in a variety of examples of the connective 'but'
What Do Paraconsistent, Undecidable, Random, Computable and Incomplete mean? A Review of Godel's Way: Exploits into an undecidable world by Gregory Chaitin, Francisco A Doria, Newton C.A. da Costa 160p (2012) (review revised 2019)
In āGodelās Wayā three eminent scientists discuss issues such as undecidability, incompleteness, randomness, computability and paraconsistency. I approach these issues from the Wittgensteinian viewpoint that there are two basic issues which have completely different solutions. There are the scientific or empirical issues, which are facts about the world that need to be investigated observationally and philosophical issues as to how language can be used intelligibly (which include certain questions in mathematics and logic), which need to be decided by looking at how we actually use words in particular contexts. When we get clear about which language game we are playing, these topics are seen to be ordinary scientific and mathematical questions like any others. Wittgensteinās insights have seldom been equaled and never surpassed and are as pertinent today as they were 80 years ago when he dictated the Blue and Brown Books. In spite of its failingsāreally a series of notes rather than a finished bookāthis is a unique source of the work of these three famous scholars who have been working at the bleeding edges of physics, math and philosophy for over half a century. Da Costa and Doria are cited by Wolpert (see below or my articles on Wolpert and my review of Yanofskyās āThe Outer Limits of Reasonā) since they wrote on universal computation, and among his many accomplishments, Da Costa is a pioneer in paraconsistency.
Those wishing a comprehensive up to date framework for human behavior from the modern two systems view may consult my book āThe Logical Structure of Philosophy, Psychology, Mind and Language in Ludwig Wittgenstein and John Searleā 2nd ed (2019). Those interested in more of my writings may see āTalking Monkeys--Philosophy, Psychology, Science, Religion and Politics on a Doomed Planet--Articles and Reviews 2006-2019 3rd ed (2019), The Logical Structure of Human Behavior (2019), and Suicidal Utopian Delusions in the 21st Century 4th ed (2019
Learning Features that Predict Cue Usage
Our goal is to identify the features that predict the occurrence and
placement of discourse cues in tutorial explanations in order to aid in the
automatic generation of explanations. Previous attempts to devise rules for
text generation were based on intuition or small numbers of constructed
examples. We apply a machine learning program, C4.5, to induce decision trees
for cue occurrence and placement from a corpus of data coded for a variety of
features previously thought to affect cue usage. Our experiments enable us to
identify the features with most predictive power, and show that machine
learning can be used to induce decision trees useful for text generation.Comment: 10 pages, 2 Postscript figures, uses aclap.sty, psfig.te
Review of 'The Outer Limits of Reason' by Noson Yanofsky 403p (2013) (review revised 2019)
I give a detailed review of 'The Outer Limits of Reason' by Noson Yanofsky from a unified perspective of Wittgenstein and evolutionary psychology. I indicate that the difficulty with such issues as paradox in language and math, incompleteness, undecidability, computability, the brain and the universe as computers etc., all arise from the failure to look carefully at our use of language in the appropriate context and hence the failure to separate issues of scientific fact from issues of how language works. I discuss Wittgenstein's views on incompleteness, paraconsistency and undecidability and the work of Wolpert on the limits to computation. To sum it up: The Universe According to Brooklyn---Good Science, Not So Good Philosophy.
Those wishing a comprehensive up to date framework for human behavior from the modern two systems view may consult my book āThe Logical Structure of Philosophy, Psychology, Mind and Language in Ludwig Wittgenstein and John Searleā 2nd ed (2019). Those interested in more of my writings may see āTalking Monkeys--Philosophy, Psychology, Science, Religion and Politics on a Doomed Planet--Articles and Reviews 2006-2019 3rd ed (2019) and Suicidal Utopian Delusions in the 21st Century 4th ed (2019
Session 5: Development, Neuroscience and Evolutionary Psychology
Proceedings of the Pittsburgh Workshop in History and Philosophy of Biology, Center for Philosophy of Science, University of Pittsburgh, March 23-24 2001 Session 5: Development, Neuroscience and Evolutionary Psycholog
Determining intended evidence relations in natural language arguments
Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/72555/1/j.1467-8640.1991.tb00386.x.pd
- ā¦