Search CORE

51 research outputs found

Human evaluation of Kea, an automatic keyphrasing system.

Author: Jones Steve
Paynter Gordon W.
Publication venue: University of Waikato, Department of Computer Science
Publication date: 01/01/2001
Field of study

This paper describes an evaluation of the Kea automatic keyphrase extraction algorithm. Tools that automatically identify keyphrases are desirable because document keyphrases have numerous applications in digital library systems, but are costly and time consuming to manually assign. Keyphrase extraction algorithms are usually evaluated by comparison to author-specified keywords, but this methodology has several well-known shortcomings. The results presented in this paper are based on subjective evaluations of the quality and appropriateness of keyphrases by human assessors, and make a number of contributions. First, they validate previous evaluations of Kea that rely on author keywords. Second, they show Kea's performance is comparable to that of similar systems that have been evaluated by human assessors. Finally, they justify the use of author keyphrases as a performance metric by showing that authors generally choose good keywords

CiteSeerX

Crossref

Research Commons@Waikato

Automating iterative tasks with programming by demonstration

Author: Paynter Gordon W.
Publication venue: The University of Waikato
Publication date: 01/01/2000
Field of study

Programming by demonstration is an end-user programming technique that allows people to create programs by showing the computer examples of what they want to do. Users do not need specialised programming skills. Instead, they instruct the computer by demonstrating examples, much as they might show another person how to do the task. Programming by demonstration empowers users to create programs that perform tedious and time-consuming computer chores. However, it is not in widespread use, and is instead confined to research applications that end users never see. This makes it difficult to evaluate programming by demonstration tools and techniques. This thesis claims that domain-independent programming by demonstration can be made available in existing applications and used to automate iterative tasks by end users. It is supported by Familiar, a domain-independent, AppleScript-based programming-by-demonstration tool embodying standard machine learning algorithms. Familiar is designed for end users, so works in the existing applications that they regularly use. The assertion that programming by demonstration can be made available in existing applications is validated by identifying the relevant platform requirements and a range of platforms that meet them. A detailed scrutiny of AppleScript highlights problems with the architecture and with many implementations, and yields a set of guidelines for designing applications that support programming-by-demonstration. An evaluation shows that end users are capable of using programming by demonstration to automate iterative tasks. However, the subjects tended to prefer other tools, choosing Familiar only when the alternatives were unsuitable or unavailable. Familiar's inferencing is evaluated on an extensive set of examples, highlighting the tasks it can perform and the functionality it requires

Research Commons@Waikato

Interactive document summarisation.

Author: Jones Steve
Lundy Stephen
Paynter Gordon W.
Publication venue: University of Waikato, Department of Computer Science
Publication date: 01/02/2001
Field of study

This paper describes the Interactive Document Summariser (IDS), a dynamic document summarisation system, which can help users of digital libraries to access on-line documents more effectively. IDS provides dynamic control over summary characteristics, such as length and topic focus, so that changes made by the user are instantly reflected in an on-screen summary. A range of 'summary-in-context' views support seamless transitions between summaries and their source documents. IDS creates summaries by extracting keyphrases from a document with the Kea system, scoring sentences according to the keyphrases that they contain, and then extracting the highest scoring sentences. We report an evaluation of IDS summaries, in which human assessors identified suitable summary sentences in source documents, against which IDS summaries were judged. We found that IDS summaries were better than baseline summaries, and identify the characteristics of Kea keyphrases that lead to the best summaries

Research Commons@Waikato

A user evaluation of hierarchical phrase browsing

Author: Edgar Katrina D.
Nichols David M.
Paynter Gordon W.
Thomson Kirsten
Witten Ian H.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2003
Field of study

Phrase browsing interfaces based on hierarchies of phrases extracted automatically from document collections offer a useful compromise between automatic full-text searching and manually-created subject indexes. The literature contains descriptions of such systems that many find compelling and persuasive. However, evaluation studies have either been anecdotal, or focused on objective measures of the quality of automatically-extracted index terms, or restricted to questions of computational efficiency and feasibility. This paper reports on an empirical, controlled user study that compares hierarchical phrase browsing with full-text searching over a range of information seeking tasks. Users found the results located via phrase browsing to be relevant and useful but preferred keyword searching for certain types of queries. Users experiences were marred by interface details, including inconsistencies between the phrase browser and the surrounding digital library interface

CiteSeerX

Crossref

Research Commons@Waikato

Experiences in deploying metadata analysis tools for institutional repositories

Author: Bainbridge David
Blandford Ann
Chan Chu-Hsiang
McKay Dana
Nichols David M.
Paynter Gordon W.
Twidale Michael B.
Publication venue: 'Informa UK Limited'
Publication date: 01/04/2009
Field of study

Current institutional repository software provides few tools to help metadata librarians understand and analyze their collections. In this article, we compare and contrast metadata analysis tools that were developed simultaneously, but independently, at two New Zealand institutions during a period of national investment in research repositories: the Metadata Analysis Tool (MAT) at The University of Waikato, and the Kiwi Research Information Service (KRIS) at the National Library of New Zealand. The tools have many similarities: they are convenient, online, on-demand services that harvest metadata using OAI-PMH; they were developed in response to feedback from repository administrators; and they both help pinpoint specific metadata errors as well as generating summary statistics. They also have significant differences: one is a dedicated tool wheres the other is part of a wider access tool; one gives a holistic view of the metadata whereas the other looks for specific problems; one seeks patterns in the data values whereas the other checks that those values conform to metadata standards. Both tools work in a complementary manner to existing Web-based administration tools. We have observed that discovery and correction of metadata errors can be quickly achieved by switching Web browser views from the analysis tool to the repository interface, and back. We summarize the findings from both tools' deployment into a checklist of requirements for metadata analysis tools

Research Commons@Waikato

UCL Discovery

Swinburne Research Bank

Beyond equilibrium climate sensitivity

Author: A Borodina
A Donohoe
A Ganopolski
A Gettelman
A Gettelman
A Hall
A Hannart
A Jarvis
A Millner
A Modak
A Ollila
A Otto
A Otto
A Schmittner
AA Lacis
AC Clement
AE Dessler
AE Dessler
AE Dessler
AG Libardoni
AH MacDougall
AJ Majda
AP Schurer
AP Sokolov
AP Sokolov
AP Weigel
AS von der Heydt
B Hare
B Lin
B Lin
B Medeiros
B Sanso
B Sansó
B Santer
B Stevens
B Stevens
B Stevens
B Tian
BEJ Rose
BEJ Rose
BH Samset
BJ Soden
BJ Soden
BJ Soden
BM Sanderson
BM Sanderson
BM Sanderson
BM Sanderson
BM Sanderson
BM Sanderson
BM Sanderson
BM Sanderson
BM Sanderson
C Covey
C Hope
C Loehle
C Loehle
C Lorius
C Monckton
C Piani
C Proistosescu
C Tebaldi
C Zhai
C Zhou
C Zhou
CA Senior
CE Forest
CE Forest
CE Forest
CE Leith
CH Bishop
CS Bretherton
D Ehlert
D Klocke
D Masson
D Masson
D Paynter
D Swingedouw
DA Stainforth
DB Kirk-Davidoff
DH Douglass
DI Stern
DJ Frame
DJ Frame
DJ Long
DJ Lunt
DJ Lunt
DJ Rowlands
DJ Rowlands
DJA Johansson
DJL Olivié
DL Royer
DL Royer
DM Lemoine
DM Murphy
DM Murphy
DM Murphy
DM Smith
DMH Sexton
DS Trossman
DT Shindell
DW Lea
DW Pierce
E Anagnostou
E Hawkins
E Specht
E-S Chung
E-S Chung
EJ Rohling
EL Davin
EM Volodin
EO Hulburt
F Brient
F Hourdin
F Lehner
F Li
F Möller
F Ragone
FA-M Bender
FC Cooper
G Abramowitz
G Abramowitz
G Foster
G Hegerl
G Myhre
G Myhre
G Shaffer
GA Meehl
GA Schmidt
Gabriele C. Hegerl
GC Cawley
GC Hegerl
GH Roe
GH Roe
GH Roe
GJ Boer
GJ Boer
GJ Boer
GK Plattner
GL Stephens
GN Plass
GR Harris
GR North
GR van der Werf
GS Callendar
GS Jones
H Goelzer
H Harde
H Harde
H Shiogama
H Su
HD Matthews
HJ Dowsett
I Mahlstein
I Medhaug
I Tan
I Zaliapin
IM Held
J Bloch-Johnson
J Boé
J Charney
J Gregory
J Hansen
J Hansen
J Hansen
J Hansen
J Hansen
J He
J Kiehl
J Park
J Rogelj
J Rogelj
J Rogelj
J Rogelj
J Rogelj
J Räisänen
J Räisänen
J Shukla
J Thuburn
J van der Sluijs
J Vial
J-L Dufresne
JA Church
JA Crook
JC Hargreaves
JC Hargreaves
JC Hargreaves
JC Hargreaves
JD Annan
JD Annan
JD Annan
JD Annan
JD Annan
JD Annan
JE Hansen
JE Hansen
JE Kutzbach
JH van Hateren
JM Gregory
JM Gregory
JM Gregory
JM Gregory
JM Gregory
JM Gregory
JM Gregory
JM Lyman
JM Murphy
JR Bates
JR Bates
JR Kummer
JR Norris
JT Fasullo
JT Fasullo
JT Kiehl
K Bryan
K Caldeira
K Cowtan
K Marvel
K Marvel
K Meraner
K Tanaka
K Tanaka
K Zickfeld
K Zickfeld
K Zickfeld
K Zickfeld
KA Dyez
KB Tokarska
KC Armour
KC Armour
KC Armour
KD Williams
KE Trenberth
KE Trenberth
KE Trenberth
KE Trenberth
KE Trenberth
KE Trenberth
KK Tung
KM Grise
KS Carslaw
L Bengtsson
L Skinner
L Tomassini
L Tomassini
L Tomassini
LDD Harvey
LDD Harvey
LDD Harvey
LE Padilla
M Aldrin
M Crucifix
M Huber
M Huber
M Huber
M J. Ring
M Jun
M Meinshausen
M Oppenheimer
M Pagani
M Previdi
M Richardson
M Richardson
M Rypdal
M Steinacher
M Steinacher
M Webster
M Winton
M Winton
M Yoshimori
M Zhao
M-C Liang
Ma Martínez-Botí
MAA Rugenstein
MAA Rugenstein
MAA Rugenstein
Maria A. A. Rugenstein
MB Baker
MC Morantine
MD Zelinka
MF Loutre
MG Morgan
MI Budyko
MI Hoffert
MJ Rodwell
MJ Webb
ML Weitzman
MO Andreae
MR Allen
MR Allen
MR Allen
MR Allen
MR Raupach
MR Raupach
MR Raupach
N Feldl
N Huneeus
N Lewis
N Lewis
N Lewis
N Lewis
N Schaller
N Stuber
ND Gordon
NG Andronova
NG Loeb
NG Loeb
NG Loeb
NG Loeb
NM Urban
NM Urban
NP Gillett
NP Gillett
NP Gillett
NP Gillett
O Geoffroy
O Geoffroy
P Braconnot
P Chylek
P Chylek
P Chylek
P Friedlingstein
P Good
P Huybers
P Köhler
P Köhler
P Stott
P Whetton
PA Stott
PA Stott
PA Stott
PA Stott
PA Stott
PB Holden
PJ Gleckler
PM Caldwell
PM Caldwell
PM Forster
PM Forster
PM Forster
PMDF Forster
PMF Forster
PO Hopcroft
PT Brown
Q Wu
R Caballero
R Colman
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Knutti
R Olson
R Olson
R Pincus
RA Colman
RB Skeie
RD Cess
RE Came
RE Dickinson
RE Zeebe
Reto Knutti
RG Watts
RG Williams
RJ Millar
RJ Stouffer
RK Kaufmann
RP Allan
RS Lindzen
RS Lindzen
RT Wetherald
RV Abramov
RW Bodman
RW Spencer
RW Spencer
RW Spencer
S Arrhenius
S Bony
S Bony
S Bony
S Bony
S Kato
S Koumoutsaris
S Levis
S Li
S Lovejoy
S Lovejoy
S Manabe
S Manabe
S Manabe
S Manabe
S Solomon
S Solomon
S Wenzel
S-P Xie
SA Klein
SB Idso
SC Scherrer
SC Sherwood
SC Sherwood
SCB Raper
SE Schwartz
SE Schwartz
SE Schwartz
SE Schwartz
SE Schwartz
SFB Tett
SFB Tett
SH Schneider
SP Harrison
T Andrews
T Andrews
T Andrews
T Andrews
T Augustsson
T DelSole
T DelSole
T Dunkley Jones
T Friedrich
T Masters
T Masters
T Mauritsen
T Mauritsen
T Reichler
T Schneider
T Schneider von Deimling
T Storelvmo
T Yokohata
TL Bell
TL Edwards
TL Frölicher
TM Merlis
TML Wigley
TML Wigley
TML Wigley
TR Karl
U Siegenthaler
V Eyring
V Lucarini
V Ramanathan
W-C Wang
WD Sellers
WS Parker
X Qu
Y Kamae
Y Kamae
Y Robiou du Pont
Y Tsushima
Y Tsushima
Y-S Choi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/09/2017
Field of study

ISSN:1752-0908ISSN:1752-089

Repository for Publications and Research Data

Crossref

Edinburgh Research Explorer

Developing Practical Automatic Metadata Assignment and Evaluation Tools for Internet Resources

Author: Gordon W. Paynter
Publication venue: ACM Press
Publication date: 01/01/2005
Field of study

This paper describes the development of practical automatic metadata assignment tools to support automatic record creation for virtual libraries, metadata repositories and digital libraries, with particular reference to library-standard metadata. The development process is incremental in nature, and depends upon an automatic metadata evaluation tool to objectively measure its progress. The evaluation tool is based on and informed by the metadata created and maintained by librarian experts at the INFOMINE Project, and uses different metrics to evaluate different metadata fields. In this paper, we describe the form and function of common metadata fields, and identify appropriate performance measures for these fields. The automatic metadata assignment tools in the iVia virtual library software are described, and their performance is measured. Finally, we discuss the limitations of automatic metadata evaluation, and cases where we choose to ignore its evidence in favor of human judgment

CiteSeerX

An Evaluation of Document Keyphrase Sets

Author: Jones Steve
Paynter Gordon W.
Publication venue: British Computer Society
Publication date: 01/01/2003
Field of study

Keywords and keyphrases have many useful roles as document surrogates and descriptors, but the manual production of keyphrase metadata for large digital library collections is at best expensive and time-consuming, and at worst logistically impossible. Algorithms for keyphrase extraction like Kea and Extractor produce a set of phrases that are associated with a document. Though these sets are often utilized as a group, keyphrase extraction is usually evaluated by measuring the quality of individual keyphrases. This paper reports an assessment that asks human assessors to rate entire sets of keyphrases produced by Kea, Extractor and document authors. The results provide further evidence that human assessors rate all three sources highly (with some caveats), but show that the relationship between the quality of the phrases in a set and the set as a whole is not always simple. Choosing the best individual phrases will not necessarily produce the best set; combinations of lesser phrases may result in better overall quality

Research Commons@Waikato

Predicting Library of Congress Classifications from Library of Congress Subject Headings

Author: Eibe Frank
Gordon W. Paynter
Publication venue
Publication date: 01/01/2003
Field of study

This paper addresses the problem of automatically assigning a Library of Congress Classi cation (LCC) to a work given its set of Library of Congress Subject Headings (LCSH). LCC are organized in a tree: the root node of this hierarchy comprises all possible topics, and leaf nodes correspond to the most specialized topic areas de ned. We describe a procedure that, given a resource identi ed by its LCSH, automatically places that resource in the LCC hierarchy. The procedure uses machine learning techniques and training data from a large library catalog to learn a classi cation model mapping from sets of LCSH to nodes in the LCC tree. We present empirical results for our technique showing its accuracy on an independent collection of 50,000 LCSH/LCC pairs

CiteSeerX

Research Commons@Waikato