Search CORE

4,951 research outputs found

Classifying Web Exploits with Topic Modeling

Author: Ruohonen Jukka
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/10/2017
Field of study

This short empirical paper investigates how well topic modeling and database meta-data characteristics can classify web and other proof-of-concept (PoC) exploits for publicly disclosed software vulnerabilities. By using a dataset comprised of over 36 thousand PoC exploits, near a 0.9 accuracy rate is obtained in the empirical experiment. Text mining and topic modeling are a significant boost factor behind this classification performance. In addition to these empirical results, the paper contributes to the research tradition of enhancing software vulnerability information with text mining, providing also a few scholarly observations about the potential for semi-automatic classification of exploits in the existing tracking infrastructures.Comment: Proceedings of the 2017 28th International Workshop on Database and Expert Systems Applications (DEXA). http://ieeexplore.ieee.org/abstract/document/8049693

arXiv.org e-Print Archive

Crossref

Recommended from our members

Geovisualization of dynamics, movement and change: key issues and developing approaches in visualization research

Author: Andrienko G.
Andrienko N.
Dykes J.
Fabrikant S. I.
Wachowicz M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Data-Driven Decisions and Actions in Today’s Software Development

Author: Alexandru Carol V
Ciurumelea Adelina
Gall Harald
Grano Giovanni
Laaber Christoph
Panichella Sebastiano
Proksch Sebastian
Schermann Gerald
Vassallo Carmine
Zhao Jitong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Today’s software development is all about data: data about the software product itself, about the process and its different stages, about the customers and markets, about the development, the testing, the integration, the deployment, or the runtime aspects in the cloud. We use static and dynamic data of various kinds and quantities to analyze market feedback, feature impact, code quality, architectural design alternatives, or effects of performance optimizations. Development environments are no longer limited to IDEs in a desktop application or the like but span the Internet using live programming environments such as Cloud9 or large-volume repositories such as BitBucket, GitHub, GitLab, or StackOverflow. Software development has become “live” in the cloud, be it the coding, the testing, or the experimentation with different product options on the Internet. The inherent complexity puts a further burden on developers, since they need to stay alert when constantly switching between tasks in different phases. Research has been analyzing the development process, its data and stakeholders, for decades and is working on various tools that can help developers in their daily tasks to improve the quality of their work and their productivity. In this chapter, we critically reflect on the challenges faced by developers in a typical release cycle, identify inherent problems of the individual phases, and present the current state of the research that can help overcome these issues

Crossref

ZORA

Holistic recommender systems for software engineering

Author: Lanza Michele
Mocci Andrea
Ponzanelli Luca
Publication venue
Publication date: 04/07/2017
Field of study

The knowledge possessed by developers is often not sufficient to overcome a programming problem. Short of talking to teammates, when available, developers often gather additional knowledge from development artifacts (e.g., project documentation), as well as online resources. The web has become an essential component in the modern developer’s daily life, providing a plethora of information from sources like forums, tutorials, Q&A websites, API documentation, and even video tutorials. Recommender Systems for Software Engineering (RSSE) provide developers with assistance to navigate the information space, automatically suggest useful items, and reduce the time required to locate the needed information. Current RSSEs consider development artifacts as containers of homogeneous information in form of pure text. However, text is a means to represent heterogeneous information provided by, for example, natural language, source code, interchange formats (e.g., XML, JSON), and stack traces. Interpreting the information from a pure textual point of view misses the intrinsic heterogeneity of the artifacts, thus leading to a reductionist approach. We propose the concept of Holistic Recommender Systems for Software Engineering (H-RSSE), i.e., RSSEs that go beyond the textual interpretation of the information contained in development artifacts. Our thesis is that modeling and aggregating information in a holistic fashion enables novel and advanced analyses of development artifacts. To validate our thesis we developed a framework to extract, model and analyze information contained in development artifacts in a reusable meta- information model. We show how RSSEs benefit from a meta-information model, since it enables customized and novel analyses built on top of our framework. The information can be thus reinterpreted from an holistic point of view, preserving its multi-dimensionality, and opening the path towards the concept of holistic recommender systems for software engineering

RERO DOC Digital Library

Challenges in Bridging Social Semantics and Formal Semantics on the Web

Author: A Cooper
A Edwards
E Cabrio
E Cabrio
E Cabrio
G Erétéo
I Mirbel
L Costabello
LC Freeman
O Corby
P Neveu
PM Dung
R Hasan
RN Raghavan
S Angeletou
Publication venue
Publication date: 04/07/2013
Field of study

This paper describes several results of Wimmics, a research lab which names stands for: web-instrumented man-machine interactions, communities, and semantics. The approaches introduced here rely on graph-oriented knowledge representation, reasoning and operationalization to model and support actors, actions and interactions in web-based epistemic communities. The re-search results are applied to support and foster interactions in online communities and manage their resources

arXiv.org e-Print Archive

Crossref

HAL-UNICE

INRIA a CCSD electronic archive server

HAL Descartes