Search CORE

2,408 research outputs found

An Approach for Fast Fault Detection in Virtual Network

Author: Fang Ruiming
Wang Yan
Publication venue: Faculty of Mechanical Engineering in Slavonski Brod; Faculty of Electrical Engineering, Computer Science and Information Technology Osijek; Faculty of Civil Engineering in Osijek
Publication date: 01/01/2023
Field of study

The diversity of applications in cloud computing and the dynamic nature of environment deployment makes virtual machines, containers, and distributed software systems to often have various software failures, which make it impossible to provide external services normally. Whether it is cloud management or distributed application itself, it takes a few seconds to find the fault of protocol class detection methods on the management or control surfaces of distributed applications, hundreds of milliseconds to find the fault of protocol class detection methods based on user interfaces, and the main time from the failure to recovery of distributed software systems is spent in detecting the fault. Therefore, timely discovery of faults (virtual machines, containers, software) is the key to subsequent fault diagnosis, isolation and recovery. Considering the network connection of virtual machines/containers in cloud infrastructure, more and more intelligent virtual network cards are used to connect virtual network elements (Virtual Router or Virtual Switch). This paper studies a fault detection mechanism of virtual machines, containers and distributed software based on the message driven mode of virtual network elements. Taking advantage of the VIRTIO message queue memory sharing feature between the front-end and back-end in the virtual network card of the virtualization network element and the virtual machine or container it detects in the same server in the cloud network, when the virtualization network element sends packets to the virtual machine or container, quickly check whether the message on the queue header of the previously sent VIRTIO message has been received and processed. If it has not been received and processed beyond a certain time threshold, it indicates that the virtual machine, the container and distributed software have failed. The method in this paper can significantly improve the fault detection performance of virtual machine/container/distributed application (from the second pole to the millisecond level) for a large number of business message scenarios, and provide faster fault detection for the rapid convergence of virtual network traffic, migration of computing nodes, and high availability of distributed applications

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Software Defined Networks based Smart Grid Communication: A Comprehensive Survey

Author: Assi Chadi
Davy Alan
Jennings Brendan
Rehmani Mubashir Husain
Publication venue
Publication date: 01/01/2019
Field of study

The current power grid is no longer a feasible solution due to ever-increasing user demand of electricity, old infrastructure, and reliability issues and thus require transformation to a better grid a.k.a., smart grid (SG). The key features that distinguish SG from the conventional electrical power grid are its capability to perform two-way communication, demand side management, and real time pricing. Despite all these advantages that SG will bring, there are certain issues which are specific to SG communication system. For instance, network management of current SG systems is complex, time consuming, and done manually. Moreover, SG communication (SGC) system is built on different vendor specific devices and protocols. Therefore, the current SG systems are not protocol independent, thus leading to interoperability issue. Software defined network (SDN) has been proposed to monitor and manage the communication networks globally. This article serves as a comprehensive survey on SDN-based SGC. In this article, we first discuss taxonomy of advantages of SDNbased SGC.We then discuss SDN-based SGC architectures, along with case studies. Our article provides an in-depth discussion on routing schemes for SDN-based SGC. We also provide detailed survey of security and privacy schemes applied to SDN-based SGC. We furthermore present challenges, open issues, and future research directions related to SDN-based SGC.Comment: Accepte

arXiv.org e-Print Archive

WIT Repository

Management and Service-aware Networking Architectures (MANA) for Future Internet Position Paper: System Functions, Capabilities and Requirements

Author: Abramowicz H.
Brunner M.
Chemouil P.
Galis A.
Pras Aiko
Raz D.
Publication venue: IEEE Communications Society
Publication date: 01/01/2009
Field of study

Future Internet (FI) research and development threads have recently been gaining momentum all over the world and as such the international race to create a new generation Internet is in full swing: GENI, Asia Future Internet, Future Internet Forum Korea, European Union Future Internet Assembly (FIA). This is a position paper identifying the research orientation with a time horizon of 10 years, together with the key challenges for the capabilities in the Management and Service-aware Networking Architectures (MANA) part of the Future Internet (FI) allowing for parallel and federated Internet(s)

UCL Discovery

University of Twente Research Information

Virtual infrastructure approach for SADU implementation

Author: Postolache Florin
Publication venue: Danubius University of Galati, Romania
Publication date: 29/12/2011
Field of study

This paper proposes a new way of structuring the virtual infrastructure in terms of knowledge acquisition for SADU implementation. Through virtualization, the hardware infrastructure becomes a service, virtual machines become predominant within the infrastructure and their way of functioning can be affected by certain malfunctions. SADU supports human diagnostician in order to provide possible solutions to the raised issues. A virtualized IT infrastructure consists of layers that virtualized the used hardware components. Based on layers membership we propose a new approach to structuring the virtual infrastructure. This approach has fundamentally changed the architecture at the layer levels, increasing the security and availability of resources. Grouping layers depending on membership or type (hardware, virtual infrastructure, software) around the concept of stack led to a separation of intelligent agents’ responsibility for knowledge acquisition and fault diagnosis, allowing a better understanding of the field, reflected by developing a new ontology

Danubius University, Romania: Danubius Journals

Framework on Economical Implication and Issues of SADU Implementation

Author: Postolache Florin
Publication venue: Danubius University of Galati, Romania
Publication date: 02/07/2011
Field of study

Due to software which plays an increasingly role in everyday life, interaction betweenhumans and computers will increase in importance; therefore, the ability to support interactions forefficient re-use of experience is a major challenge for systems in the future. Trace Based Reasoningwill have a significant impact on applications sharing experience, when they are based on the web inparticular, since traces allow us to imagine several ways of interaction in systems and to combinemultiple modes of interaction in a single system. In the conducted study we aimed at developing anAssist System of Human Diagnostician (SADU), meaning that this system will have the humanknowledge and then information retrieved by interaction with humans at the SADU request

Danubius University, Romania: Danubius Journals

Automatic Software Repair: a Bibliography

Author: Monperrus Martin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/01/2016
Field of study

This article presents a survey on automatic software repair. Automatic software repair consists of automatically finding a solution to software bugs without human intervention. This article considers all kinds of repairs. First, it discusses behavioral repair where test suites, contracts, models, and crashing inputs are taken as oracle. Second, it discusses state repair, also known as runtime repair or runtime recovery, with techniques such as checkpoint and restart, reconfiguration, and invariant restoration. The uniqueness of this article is that it spans the research communities that contribute to this body of knowledge: software engineering, dependability, operating systems, programming languages, and security. It provides a novel and structured overview of the diversity of bug oracles and repair operators used in the literature

arXiv.org e-Print Archive

HAL - Lille 3

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

A System Architecture for Real-time Anomaly Detection in Large-scale NFV Systems

Author: Gulenko Anton
Kao Odej
Liu Feng
Schmidt Florian
Wallschläger Marcel
Publication venue
Publication date: 01/01/2016
Field of study

Virtualization as a key IT technology has developed to a predominant model in data centers in recent years. The flexibility regarding scaling-out and migration of virtual machines for seamless maintenance has enabled a new level of continuous operation and changed service provisioning significantly. Meanwhile, services from domains striving for highest possible availability – e.g. from the telecommunications domain – are adopting this approach as well and are investing significant efforts into the development of Network Function Virtualization (NFV). However, the availability requirements for such infrastructures are much higher than typical for IT services built upon standard software with off-the-shelf hardware. They require sophisticated methods and mechanisms for fast detection and recovery of failures. This paper presents a set of methods and an implemented prototype for anomaly detection in cloud-based infrastructures with specific focus on the deployment of virtualized network functions. The framework is built upon OpenStack, which is the current de-facto standard of open-source cloud software and aims at increasing the availability and fault tolerance level by providing an extensive monitoring and analysis pipeline able to detect failures or degraded performance in real-time. The indicators for anomalies are created using supervised and non-supervised classification methods and preliminary experimental measurements showed a high percentage of correctly identified anomaly situations. After a successful failure detection, a set of pre-defined countermeasures is activated in order to mask or repair outages or situations with degraded performance

DepositOnce