32,050 research outputs found
CEAI: CCM based Email Authorship Identification Model
In this paper we present a model for email authorship identification (EAI) by
employing a Cluster-based Classification (CCM) technique. Traditionally,
stylometric features have been successfully employed in various authorship
analysis tasks; we extend the traditional feature-set to include some more
interesting and effective features for email authorship identification (e.g.
the last punctuation mark used in an email, the tendency of an author to use
capitalization at the start of an email, or the punctuation after a greeting or
farewell). We also included Info Gain feature selection based content features.
It is observed that the use of such features in the authorship identification
process has a positive impact on the accuracy of the authorship identification
task. We performed experiments to justify our arguments and compared the
results with other base line models. Experimental results reveal that the
proposed CCM-based email authorship identification model, along with the
proposed feature set, outperforms the state-of-the-art support vector machine
(SVM)-based models, as well as the models proposed by Iqbal et al. [1, 2]. The
proposed model attains an accuracy rate of 94% for 10 authors, 89% for 25
authors, and 81% for 50 authors, respectively on Enron dataset, while 89.5%
accuracy has been achieved on authors' constructed real email dataset. The
results on Enron dataset have been achieved on quite a large number of authors
as compared to the models proposed by Iqbal et al. [1, 2]
Comment on 'Valid molecular dynamics simulations of human hemoglobin require a surprisingly large box size'.
A recent molecular dynamics investigation into the stability of hemoglobin concluded that the unliganded protein is only stable in the T state when a solvent box is used in the simulations that is ten times larger than what is usually employed (El Hage et al., 2018). Here, we express three main concerns about that study. In addition, we find that with an order of magnitude more statistics, the reported box size dependence is not reproducible. Overall, no significant effects on the kinetics or thermodynamics of conformational transitions were observed
Railway freight transport and logistics: Methods for relief, algorithms for verification and proposals for the adjustment of tunnel inner surfaces
In Europe, the attention to efficiency and safety of international railway freight transport has grown in recent years and this has drawn attention to the importance of verifying the clearance between vehicle and lining, mostly when different and variable rolling stock types are expected. This work consists of defining an innovative methodology, with the objective of surveying the tunnel structures, verifying the clearance conditions, and designing a retrofitting work if necessary. The method provides for the use of laser scanner, thermocameras, and ground penetrating radar to survey the geometrical and structural conditions of the tunnel; an algorithm written by the authors permits to verify the clearances. Two different types of works are possible if the inner tunnel surfaces interfere with the profile of the rolling stock passing through: modification of the railroad track or modification of the tunnel intrados by mean milling of its lining. The presented case study demonstrates that the proposed methodology is useful for verifying compatibility between the design vehicle gauge and the existing tunnel intrados, and to investigate the chance to admit rolling stocks from different states. Consequently, the results give the railway management body a chance to perform appropriate measurements in those cases where the minimum clearance requirements are not achieved
- …