Search CORE

11,558 research outputs found

Clue: Cross-modal Coherence Modeling for Caption Generation

Author: Alikhani Malihe
Li Shengjie
Sharma Piyush
Soricut Radu
Stone Matthew
Publication venue
Publication date: 02/05/2020
Field of study

We use coherence relations inspired by computational models of discourse to study the information needs and goals of image captioning. Using an annotation protocol specifically devised for capturing image--caption coherence relations, we annotate 10,000 instances from publicly-available image--caption pairs. We introduce a new task for learning inferences in imagery and text, coherence relation prediction, and show that these coherence annotations can be exploited to learn relation classifiers as an intermediary step, and also train coherence-aware, controllable image captioning models. The results show a dramatic improvement in the consistency and quality of the generated captions with respect to information needs specified via coherence relations.Comment: Accepted as a long paper to ACL 202

arXiv.org e-Print Archive

Recommended from our members

But What Do You Need To Produce a But?

Author: Elhadad Michael
McKeown Kathleen
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/1988
Field of study

We study the problem of appropriately generating connectives (e.g., 'but', 'because', 'since', 'however') in a discourse. We claim that connectives operate at the discourse level rather than the semantic level, and that they indicate pragmatic features of the units they connect. Therefore, in order to choose the appropriate connective, a surface generator must find in its input a set of pragmatic features that affect or are affected by the choice of a connective. We present such a set of features and show their role in a variety of examples of the connective 'but'

Columbia University Academic Commons

What Do Paraconsistent, Undecidable, Random, Computable and Incomplete mean? A Review of Godel's Way: Exploits into an undecidable world by Gregory Chaitin, Francisco A Doria, Newton C.A. da Costa 160p (2012) (review revised 2019)

Author: Starks Michael
Publication venue
Publication date: 01/01/2019
Field of study

In ‘Godel’s Way’ three eminent scientists discuss issues such as undecidability, incompleteness, randomness, computability and paraconsistency. I approach these issues from the Wittgensteinian viewpoint that there are two basic issues which have completely different solutions. There are the scientific or empirical issues, which are facts about the world that need to be investigated observationally and philosophical issues as to how language can be used intelligibly (which include certain questions in mathematics and logic), which need to be decided by looking at how we actually use words in particular contexts. When we get clear about which language game we are playing, these topics are seen to be ordinary scientific and mathematical questions like any others. Wittgenstein’s insights have seldom been equaled and never surpassed and are as pertinent today as they were 80 years ago when he dictated the Blue and Brown Books. In spite of its failings—really a series of notes rather than a finished book—this is a unique source of the work of these three famous scholars who have been working at the bleeding edges of physics, math and philosophy for over half a century. Da Costa and Doria are cited by Wolpert (see below or my articles on Wolpert and my review of Yanofsky’s ‘The Outer Limits of Reason’) since they wrote on universal computation, and among his many accomplishments, Da Costa is a pioneer in paraconsistency. Those wishing a comprehensive up to date framework for human behavior from the modern two systems view may consult my book ‘The Logical Structure of Philosophy, Psychology, Mind and Language in Ludwig Wittgenstein and John Searle’ 2nd ed (2019). Those interested in more of my writings may see ‘Talking Monkeys--Philosophy, Psychology, Science, Religion and Politics on a Doomed Planet--Articles and Reviews 2006-2019 3rd ed (2019), The Logical Structure of Human Behavior (2019), and Suicidal Utopian Delusions in the 21st Century 4th ed (2019

PhilPapers

Learning Features that Predict Cue Usage

Author: Di Eugenio Barbara
Moore Johanna D.
Paolucci Massimo
Publication venue
Publication date: 01/01/1997
Field of study

Our goal is to identify the features that predict the occurrence and placement of discourse cues in tutorial explanations in order to aid in the automatic generation of explanations. Previous attempts to devise rules for text generation were based on intuition or small numbers of constructed examples. We apply a machine learning program, C4.5, to induce decision trees for cue occurrence and placement from a corpus of data coded for a variety of features previously thought to affect cue usage. Our experiments enable us to identify the features with most predictive power, and show that machine learning can be used to induce decision trees useful for text generation.Comment: 10 pages, 2 Postscript figures, uses aclap.sty, psfig.te

arXiv.org e-Print Archive

CiteSeerX

Crossref

Edinburgh Research Explorer

Review of 'The Outer Limits of Reason' by Noson Yanofsky 403p (2013) (review revised 2019)

Author: Starks Michael
Publication venue
Publication date: 01/01/2019
Field of study

I give a detailed review of 'The Outer Limits of Reason' by Noson Yanofsky from a unified perspective of Wittgenstein and evolutionary psychology. I indicate that the difficulty with such issues as paradox in language and math, incompleteness, undecidability, computability, the brain and the universe as computers etc., all arise from the failure to look carefully at our use of language in the appropriate context and hence the failure to separate issues of scientific fact from issues of how language works. I discuss Wittgenstein's views on incompleteness, paraconsistency and undecidability and the work of Wolpert on the limits to computation. To sum it up: The Universe According to Brooklyn---Good Science, Not So Good Philosophy. Those wishing a comprehensive up to date framework for human behavior from the modern two systems view may consult my book ‘The Logical Structure of Philosophy, Psychology, Mind and Language in Ludwig Wittgenstein and John Searle’ 2nd ed (2019). Those interested in more of my writings may see ‘Talking Monkeys--Philosophy, Psychology, Science, Religion and Politics on a Doomed Planet--Articles and Reviews 2006-2019 3rd ed (2019) and Suicidal Utopian Delusions in the 21st Century 4th ed (2019

PhilPapers

Session 5: Development, Neuroscience and Evolutionary Psychology

Author: Machamer Peter
Quartz Steven
Scarantino Andrea
Sullivan Jackie
Publication venue
Publication date: 01/01/2002
Field of study

Proceedings of the Pittsburgh Workshop in History and Philosophy of Biology, Center for Philosophy of Science, University of Pittsburgh, March 23-24 2001 Session 5: Development, Neuroscience and Evolutionary Psycholog

PhilSci Archive

Determining intended evidence relations in natural language arguments

Author: Carberry S.
Clark H. H.
Cohen R.
Kautz H.
Poole D.
Reiter R.
Sadock J.
Smedley T. J.
Song F.
Young M. A.
Publication venue: 'Wiley'
Publication date: 01/05/1991
Field of study

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/72555/1/j.1467-8640.1991.tb00386.x.pd

Crossref

Deep Blue Documents at the University of Michigan