Search CORE

221 research outputs found

Online learning via dynamic reranking for Computer Assisted Translation

Author: F. Casacuberta
F. Rosenblatt
K. Crammer
R. Zens
S. Barrachina
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

New techniques for online adaptation in computer assisted translation are explored and compared to previously existing approaches. Under the online adaptation paradigm, the translation system needs to adapt itself to real-world changing scenarios, where training and tuning may only take place once, when the system is set-up for the first time. For this purpose, post-edit information, as described by a given quality measure, is used as valuable feedback within a dynamic reranking algorithm. Two possible approaches are presented and evaluated. The first one relies on the well-known perceptron algorithm, whereas the second one is a novel approach using the Ridge regression in order to compute the optimum scaling factors within a state-of-the-art SMT system. Experimental results show that such algorithms are able to improve translation quality by learning from the errors produced by the system on a sentence-by-sentence basis.This paper is based upon work supported by the EC (FEDER/FSE) and the Spanish MICINN under projects MIPRCV “Consolider Ingenio 2010” (CSD2007-00018) and iTrans2 (TIN2009-14511). Also supported by the Spanish MITyC under the erudito.com (TSI-020110-2009-439) project, by the Generalitat Valenciana under grant Prometeo/2009/014 and scholarship GV/2010/067 and by the UPV under grant 20091027Martínez Gómez, P.; Sanchis Trilles, G.; Casacuberta Nolla, F. (2011). Online learning via dynamic reranking for Computer Assisted Translation. En Computational Linguistics and Intelligent Text Processing. Springer Verlag (Germany). 6609:93-105. https://doi.org/10.1007/978-3-642-19437-5_8S931056609Brown, P., Pietra, S.D., Pietra, V.D., Mercer, R.: The mathematics of machine translation. In: Computational Linguistics, vol. 19, pp. 263–311 (1993)Zens, R., Och, F.J., Ney, H.: Phrase-based statistical machine translation. In: Jarke, M., Koehler, J., Lakemeyer, G. (eds.) KI 2002. LNCS (LNAI), vol. 2479, pp. 18–32. Springer, Heidelberg (2002)Koehn, P., Och, F.J., Marcu, D.: Statistical phrase-based translation. In: Proc. HLT/NAACL 2003, pp. 48–54 (2003)Callison-Burch, C., Fordyce, C., Koehn, P., Monz, C., Schroeder, J.: (meta-) evaluation of machine translation. In: Proc. of the Workshop on SMT. ACL, pp. 136–158 (2007)Papineni, K., Roukos, S., Ward, T.: Maximum likelihood and discriminative training of direct translation models. In: Proc. of ICASSP 1988, pp. 189–192 (1998)Och, F., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: Proc. of the ACL 2002, pp. 295–302 (2002)Och, F., Zens, R., Ney, H.: Efficient search for interactive statistical machine translation. In: Proc. of EACL 2003, pp. 387–393 (2003)Sanchis-Trilles, G., Casacuberta, F.: Log-linear weight optimisation via bayesian adaptation in statistical machine translation. In: Proceedings of COLING 2010, Beijing, China (2010)Callison-Burch, C., Bannard, C., Schroeder, J.: Improving statistical translation through editing. In: Proc. of 9th EAMT Workshop Broadening Horizons of Machine Translation and its Applications, Malta (2004)Barrachina, S., et al.: Statistical approaches to computer-assisted translation. Computational Linguistics 35, 3–28 (2009)Casacuberta, F., et al.: Human interaction for high quality machine translation. Communications of the ACM 52, 135–138 (2009)Ortiz-Martínez, D., García-Varea, I., Casacuberta, F.: Online learning for interactive statistical machine translation. In: Proceedings of NAACL HLT, Los Angeles (2010)España-Bonet, C., Màrquez, L.: Robust estimation of feature weights in statistical machine translation. In: 14th Annual Conference of the EAMT (2010)Reverberi, G., Szedmak, S., Cesa-Bianchi, N., et al.: Deliverable of package 4: Online learning algorithms for computer-assisted translation (2008)Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., Singer, Y.: Online passive-aggressive algorithms. Journal of Machine Learning Research 7, 551–585 (2006)Snover, M., Dorr, B., Schwartz, R., Micciulla, L., Makhoul, J.: A study of translation edit rate with targeted human annotation. In: Proc. of AMTA, Cambridge, MA, USA (2006)Papineni, K., Roukos, S., Ward, T., Zhu, W.: Bleu: A method for automatic evaluation of machine translation. In: Proc. of ACL 2002 (2002)Rosenblatt, F.: The perceptron: A probabilistic model for information storage and organization in the brain. Psychological Review 65, 386–408 (1958)Collins, M.: Discriminative training methods for hidden markov models: Theory and experiments with perceptron algorithms. In: EMNLP 2002, Philadelphia, PA, USA, pp. 1–8 (2002)Koehn, P.: Europarl: A parallel corpus for statistical machine translation. In: Proc. of the MT Summit X, pp. 79–86 (2005)Koehn, P., et al.: Moses: Open source toolkit for statistical machine translation. In: Proc. of the ACL Demo and Poster Sessions, Prague, Czech Republic, pp. 177–180 (2007)Och, F.: Minimum error rate training for statistical machine translation. In: Proc. of ACL 2003, pp. 160–167 (2003)Kneser, R., Ney, H.: Improved backing-off for m-gram language modeling. In: IEEE Int. Conf. on Acoustics, Speech and Signal Processing II, pp. 181–184 (1995)Stolcke, A.: SRILM – an extensible language modeling toolkit. In: Proc. of ICSLP 2002, pp. 901–904 (2002

Crossref

RiuNet

Ranking and Reranking with Perceptron

Author: A. B. J. Novikoff
Aravind K. Joshi
E. Charniak
E. F. Harrington
F. Rosenblatt
K. Crammer
K. Crammer
K. Crammer
L. Shen
Libin Shen
M. Collins
R. E. Schapire
S. Har-Peled
Y. Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Classification of protein interaction sentences via gaussian processes

Author: A. Aizerman
A.M. Cohen
C.D. Manning
C.D. Manning
C.E. Rasmussen
C.H. Ding
D.D. Lewis
E.M. Marcotte
H. Chen
J. Huang
J.C. Platt
J.D. Kim
J.H. Albert
K. Crammer
K. Sugiyama
K.M.A. Chai
M. Girolami
M. Girolami
N. Lama
N. Lawrence
R. Bunescu
S. Rogers
S.S. Keerthi
Silva
T. Joachims
V. Vapnik
W. Chu
W. Chu
Y. Hao
Y. Lee
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The increase in the availability of protein interaction studies in textual format coupled with the demand for easier access to the key results has lead to a need for text mining solutions. In the text processing pipeline, classification is a key step for extraction of small sections of relevant text. Consequently, for the task of locating protein-protein interaction sentences, we examine the use of a classifier which has rarely been applied to text, the Gaussian processes (GPs). GPs are a non-parametric probabilistic analogue to the more popular support vector machines (SVMs). We find that GPs outperform the SVM and na\"ive Bayes classifiers on binary sentence data, whilst showing equivalent performance on abstract and multiclass sentence corpora. In addition, the lack of the margin parameter, which requires costly tuning, along with the principled multiclass extensions enabled by the probabilistic framework make GPs an appealing alternative worth of further adoption

Multi-domain learning by confidence-weighted parameter combination

Author: Alex Kulesza
B. Bakker
D. M. J. Tax
H. Daumé
J. Kittler
K. Crammer
K. Woods
Koby Crammer
M. Marcus
Mark Dredze
N. Littlestone
R. Ando
R. Caruana
S. Thrun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Profiles and Majority Voting-Based Ensemble Method for Protein Secondary Structure Prediction

Author: Aizerman A.
Crammer K.
Dietterich T.
Minsky M.
Nguyen M.
Nguyen M.
Ou Y.
Piatt J.
Rifkin R.
Rumellart D.E.
Schölkopf B.
Vapnik V.
Vapnik V.
Weston J.
Publication venue: Libertas Academica
Publication date: 01/01/2011
Field of study

Machine learning techniques have been widely applied to solve the problem of predicting protein secondary structure from the amino acid sequence. They have gained substantial success in this research area. Many methods have been used including k-Nearest Neighbors (k-NNs), Hidden Markov Models (HMMs), Artificial Neural Networks (ANNs) and Support Vector Machines (SVMs), which have attracted attention recently. Today, the main goal remains to improve the prediction quality of the secondary structure elements. The prediction accuracy has been continuously improved over the years, especially by using hybrid or ensemble methods and incorporating evolutionary information in the form of profiles extracted from alignments of multiple homologous sequences. In this paper, we investigate how best to combine k-NNs, ANNs and Multi-class SVMs (M-SVMs) to improve secondary structure prediction of globular proteins. An ensemble method which combines the outputs of two feed-forward ANNs, k-NN and three M-SVM classifiers has been applied. Ensemble members are combined using two variants of majority voting rule. An heuristic based filter has also been applied to refine the prediction. To investigate how much improvement the general ensemble method can give rather than the individual classifiers that make up the ensemble, we have experimented with the proposed system on the two widely used benchmark datasets RS126 and CB513 using cross-validation tests by including PSI-BLAST position-specific scoring matrix (PSSM) profiles as inputs. The experimental results reveal that the proposed system yields significant performance gains when compared with the best individual classifier

Crossref

Directory of Open Access Journals

PubMed Central

Building multiclass classifiers for remote homology detection and fold recognition

Author: A Heger
A Krogh
A Sun
AG Murzin
B Taskar
C Leslie
C Leslie
CA Orengo
CD Huang
CH Ding
D Mittelman
E le
E Lindahl
EL Allwein
F Aiolli
F Rosenblatt
George Karypis
H Rangwala
H Saigo
Huzefa Rangwala
I Tsochantaridis
J Rousu
J Shi
J Weston
K Crammer
K Crammer
L Holm
L Liao
M Collins
M Collins
M Marti-Renom
P Baldi
R Kuang
R Rifkin
S Altschul
SB Needleman
SE Brenner
T Jaakkola
T Jaakkola
T Joachims
TF Smith
TG Dietterich
V Vapnik
W Pearson
Y Guermeur
Y Guermeur
Y Hou
Y Hou
Z Barutcuoglu
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Protein remote homology detection and fold recognition are central problems in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for solving these problems. These methods are primarily used to solve binary classification problems and they have not been extensively used to solve the more general multiclass remote homology prediction and fold recognition problems. RESULTS: We present a comprehensive evaluation of a number of methods for building SVM-based multiclass classification schemes in the context of the SCOP protein classification. These methods include schemes that directly build an SVM-based multiclass model, schemes that employ a second-level learning approach to combine the predictions generated by a set of binary SVM-based classifiers, and schemes that build and combine binary classifiers for various levels of the SCOP hierarchy beyond those defining the target classes. CONCLUSION: Analyzing the performance achieved by the different approaches on four different datasets we show that most of the proposed multiclass SVM-based classification approaches are quite effective in solving the remote homology prediction and fold recognition problems and that the schemes that use predictions from binary models constructed for ancestral categories within the SCOP hierarchy tend to not only lead to lower error rates but also reduce the number of errors in which a superfamily is assigned to an entirely different fold and a fold is predicted as being from a different SCOP class. Our results also show that the limited size of the training data makes it hard to learn complex second-level models, and that models of moderate complexity lead to consistently better results

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Minnesota Digital Conservancy

Collaborative Ranking with a Push at the Top

Author: Agarwal S.
Baeza-Yates R.
Boyd S.
Burges C.
Chapelle O.
Crammer K.
Ding N.
Mnih A.
Rendle S.
Sculley D.
Tsochantaridis I.
Volkovs M.
Weimer M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Brain-Computer Interface Based on Generation of Visual Images

Author: A Bashashati
A Campbell
A Delorme
A Finke
A Georghiades
A Ishai
A Nijholt
A Nikolaev
Alexander Frolov
Alexander Zhavoronkov
B Allison
B Blankertz
C Neuper
Charles Cantor
E Formaggio
E Leuthardt
E Niedermeyer
G Dornhege
G Pfurtscheller
G Pfurtscheller
G Pfurtscheller
H Ramoser
Irina Fedulova
J Fruitet
J Gallant
J Haynes
J Millán
J Millán
J Wolpaw
J Wolpaw
J Wolpaw
K Ang
K Crammer
K Kay
KA Norman
M Besserve
M Boly
M Cerf
M Grosse-Wentrup
M Krauledat
Mikhail Bakhnyan
P Berg
Pavel Bobrov
Q Zhao
R Grech
R Krepki
Simon Rogers
W Klimesch
Y Kamitani
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

This paper examines the task of recognizing EEG patterns that correspond to performing three mental tasks: relaxation and imagining of two types of pictures: faces and houses. The experiments were performed using two EEG headsets: BrainProducts ActiCap and Emotiv EPOC. The Emotiv headset becomes widely used in consumer BCI application allowing for conducting large-scale EEG experiments in the future. Since classification accuracy significantly exceeded the level of random classification during the first three days of the experiment with EPOC headset, a control experiment was performed on the fourth day using ActiCap. The control experiment has shown that utilization of high-quality research equipment can enhance classification accuracy (up to 68% in some subjects) and that the accuracy is independent of the presence of EEG artifacts related to blinking and eye movement. This study also shows that computationally-inexpensive Bayesian classifier based on covariance matrix analysis yields similar classification accuracy in this problem as a more sophisticated Multi-class Common Spatial Patterns (MCSP) classifier

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

DSpace at VSB Technical University of Ostrava

Recommended from our members

Junior staffing changes and the temporal ecology of adverse incidents in acute psychiatric wards

Author: Appleton W.
Bowers L.
Bowers L.
Brunnenberg W.
Cancro R.
Carmel H.
Coldwell J.
Cooper A.
Cooper S.J.
Crammer J.L.
Depp F.
Dolan M.
Dooley E.
Farragher B.
Greenberg W.M.
Gudjonsson G.H.
Hodgkinson P.
Ionno J.
Kent M.
Kernodle R.W.
Kleis L.S.
Larkin E.
Nijman H.
Nijman H.L.I.
Niskanen P.
Noble P.
Nuechterlein K.H.
Owen C.
Proulx F.
Rasmussen K.
Shah A.
Simpson A.
Sommer G.
Stockman C.L.J.
Sundqvist-Stensman U.B.
Swindall L.E.
Tam E.
Walker Z.
Walsh E.
Publication venue: 'Wiley'
Publication date: 01/01/2007
Field of study

Aim. This paper reports an examination of the relationship between adverse incident rates, the arrival of new junior staff on wards, and days of the week on acute psychiatric wards. Background. Incidents of violence, absconding and self-harm in acute inpatient services pose risks to patients and staff. Previous research suggests that the arrival of inexperienced new staff may trigger more adverse incidents. Findings on the relationship between incidents and the weekly routine are inconsistent. Method. A retrospective analysis was conducted of formally reported incident rates, records of nursing student allocations and junior doctor rotation patterns, using Poisson Regression. Variance between days of the week was explored using contingency table analysis. The data covered 30 months on 17 psychiatric wards, and were collected in 2002–2004. Findings. The arrival of new and inexperienced staff on the wards was not associated with increases in adverse incident rates. Most types of incidents were less frequent at weekends and midweek. Incident rates were unchanged on ward-round days, but increased rates were found on the days before and after ward rounds. Conclusion. Increased patient tension is associated with raised incident rates. It may be possible to reduce incident rates by moderating stimulation in the environment and by mobilizing support for patients during critical periods

City Research Online

Crossref

Radboud Repository

Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes

Author: A Horzyk
AA Alizadeh
AK Jain
Anirban Mukhopadhyay
AV Lukashin
C Xiang
CA Coello Coello
CW Hsu
D Dembele
DE Goldberg
DJ Lockhart
E Zitzler
I Davidson
J Handl
J Herrero
JC Bezdek
JT Tou
K Crammer
K Deb
M Hollander
MB Eisen
P Reymonda
P Rousseeuw
P Tamayo
R Sharan
RJ Cho
S Bandyopadhyay
S Bandyopadhyay
S Bandyopadhyay
S Bandyopadhyay
S Bandyopadhyay
S Chu
S Tavazoie
Sanghamitra Bandyopadhyay
SY Kim
SZ Selim
U Maulik
U Maulik
Ujjwal Maulik
V Vapnik
VR Iyer
X Wen
XL Xie
Y Xu
ZS Qin
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central