Search CORE

9 research outputs found

A machine learning approach to server-side anti-spam e-mail filtering

Author: Gerasimov S.
Mashechkin I.
Petrovskiy M.
Rozinkin A.
Publication venue: Інститут програмних систем НАН України
Publication date: 01/01/2006
Field of study

Spam-detection systems based on traditional methods have several obvious disadvantages like low detection rate, necessity of regular knowledge bases’ updates, impersonal filtering rules. New intelligent methods for spam detection, which use statistical and machine learning algorithms, solve these problems successfully. But these methods are not widespread in spam filtering for enterprise-level mail servers, because of their high resources consumption and insufficient accuracy regarding false-positive errors. The developed solution offers precise and fast algorithm. Its classification quality is better than the quality of Naïve-Bayes method that is the most widespread machine learning method now. The problem of time efficiency that is typical for all learning based methods for spam filtering is solved using multi-agent architecture. It allows easy system scaling and building unified corporate spam detection system based on heterogeneous enterprise mail systems. Pilot program implementation and its experimental evaluation for standard data sets and for real mail flows have demonstrated that our approach outperforms existing learning and traditional spam filtering methods. That allows considering it as a promising platform for constructing enterprise spam filtering systems

Наукова електронна бібліотека періодичних видань НАН України (Vernadsky National Library of Ukraine)

Search for places for creation of consistent checkpoints in parallel programs by analyzing the program trace

Author: A. A. Smirnov
C. J. Fidge
I. V. Mashechkin
I. V. Mashechkin
I. V. Mashechkin
L. Lamport
R. H. B. Netzer
Publication venue: 'Allerton Press'
Publication date
Field of study

Crossref

A lemma in open sequential voting by veto

Author: Aleskerov
Felsenthal
Hall
Irina I. Pospelova
Kukushkin
Mashechkin
Moulin
Moulin
Moulin
Mueller
Natalia M. Novikova
Sotskov
Yuval
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Outlier Detection in Complex Structured Event Streams

Author: A Ben-Hur
B L Welch
E W T Ngai
H Hoffmann
I. V. Mashechkin
J C Bezdek
M Kazachuk
M Petrovskiy
M Petrovskiy
M. A. Kazachuk
M. I. Petrovskiy
O Gorokhov
O. E. Gorokhov
R A J Everitt
Publication venue: 'Allerton Press'
Publication date
Field of study

Crossref

Software System for Users Continuous Identification Based on Behavioral Information About the Work with Standard Input Devices

Author: A Alsultan
A Eesa
A S Namin
C C Tappert
E Al Solami
H Ceker
I. S. Popov
I. V. Mashechkin
J Liu
J V Monaco
K O Bailey
M Kazachuk
M. I. Petrovskiy
P Kang
P S Teh
P X de Oliveira
R A Everitt
V Chandrasekar
Y Zhang
Publication venue: 'Pleiades Publishing Ltd'
Publication date
Field of study

Crossref

An Efficient Framework of Utilizing the Latent Semantic Analysis in Text Extraction

Author: A Nenkova
Ahmad Hussein Ababneh
AM Azmia
AM Yousefi
C Yanmin
E Hanandeh
F Kiyoumarsi
FM Ba-Alwi
H Froud
HP Edmundson
HP Luhn
HP Luhn
I Mani
IV Mashechkin
J-P Mei
J-Y Yeh
JK Sparck
JN Singh
Joan Lu
K Rayner
K-YC Chen
M Abdel Fattah
M Al-Kabi
M Gambhir
MA Tayal
MS Binwahlan
N Ramanujam
P Baxendale
PV Ngoc
Q Wang
QA Al-Radaideh
Qiang Xu
R Ferreira
RB Yates
S Babar
S Song
T Donga
Y Sankarasubramaniam
Y Wang
YK Meena
Z He
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/08/2019
Field of study

The use of the latent semantic analysis (LSA) in text mining demands large space and time requirements. This paper proposes a new text extraction method that sets a framework on how to employ the statistical semantic analysis in the text extraction in an efficient way. The method uses the centrality feature and omits the segments of the text that have a high verbatim, statistical, or semantic similarity with previously processed segments. The identification of similarity is based on a new multi-layer similarity method that computes the similarity in three statistical layers, it uses the Jaccard similarity and the vector space model in the first and second layers respectively, and uses the LSA in the third layer. The multi-layer similarity restricts the use of the third layer for the segments that the first and second layers failed to estimate their similarities. Rouge tool is used in the evaluation, but because Rouge does not consider the extract’s size, we supplemented it with a new evaluation strategy based on the compression rate and the ratio of the sentences intersections between the automatic and the reference extracts. Our comparisons with classical LSA and traditional statistical extractions showed that we reduced the use of the LSA procedure by 52%, and we obtained 65% reduction on the original matrix dimensions, also, we obtained remarkable accuracy results. It is concluded that the employment of the centrality feature with the proposed multi-layer framework yields a significant solution in terms of efficiency and accuracy in the field of text extraction

Crossref

University of Huddersfield Repository

Huddersfield Research Portal

Measuring the Probabilistic Photometric Redshifts of X-ray Quasars Based on the Quantile Regression of Ensembles of Decision Trees

Author: A. A. Tsyplakov
A. G. A. Brown
A. Kolodzig
A. Lawrence
A. Merloni
A. Patej
A. Refregier
A. V. Dorogush
A. V. Meshcheryakov
B. Abolfathi
B. Leistedt
Ch. Ch. A. Onken
D. Ch. Martin
D. H. Weinberg
D. J. Eisenstein
D. W. Scott
D. Wittman
D.W. Hogg
E. L. Wright
F. B. Abdalla
F. Pedregosa
G. Bruzual
G. Hutsi
G. Ke
G. Mountrichas
I. V. Mashechkin
I. ˆParis
J. B. A. D. Myers
J. Friedman
J. Friedman
J. Mitchell
J. T. A. de Jong
K. C. Chambers
K. L. Polsterer
K. P. Murphy
L. Breiman
L. Breiman
L. Breiman
L. Breiman
M. Backer de
M. Brescia
M. Carrasco Kind
M. F. Skrutskie
M. Fernández-Delgado
M. Markatou
M. Salvato
M. Zamo
N. Meinshausen
P. A. Abell
P. Chaudhuri
P. Geurts
P. J. E. Peebles
Q. V. Le
Q. Yang
R. Beck
R. Beck
R. Caruana
R. G. James
R. H. Becker
R. Koenker
R. Koenker
R. Kohavi
S. A. Smee
S. J. Schmidt
S. R. Rosen
S. V. Gerasimov
T. Chen
T. Gneiting
T. Gneiting
T. Hastie
T. Ho
T. K. Ho
T. M. C. Abbott
T. T. Ananna
T.M. C. Abbott
Th. Boller
V. V. Glazkova
W. Voges
W. Voges
X. Morice-Atkinson
Yo. Freund
Yu. Liu
Publication venue: 'Pleiades Publishing Ltd'
Publication date
Field of study

Crossref