17 research outputs found

    Improving Data Management and Data Movement Efficiency in Hybrid Storage Systems

    Get PDF
    University of Minnesota Ph.D. dissertation.July 2017. Major: Computer Science. Advisor: David Du. 1 computer file (PDF); ix, 116 pages.In the big data era, large volumes of data being continuously generated drive the emergence of high performance large capacity storage systems. To reduce the total cost of ownership, storage systems are built in a more composite way with many different types of emerging storage technologies/devices including Storage Class Memory (SCM), Solid State Drives (SSD), Shingle Magnetic Recording (SMR), Hard Disk Drives (HDD), and even across off-premise cloud storage. To make better utilization of each type of storage, industries have provided multi-tier storage through dynamically placing hot data in the faster tiers and cold data in the slower tiers. Data movement happens between devices on one single device and as well as between devices connected via various networks. Toward improving data management and data movement efficiency in such hybrid storage systems, this work makes the following contributions: To bridge the giant semantic gap between applications and modern storage systems, passing a piece of tiny and useful information (I/O access hints) from upper layers to the block storage layer may greatly improve application performance or ease data management in heterogeneous storage systems. We present and develop a generic and flexible framework, called HintStor, to execute and evaluate various I/O access hints on heterogeneous storage systems with minor modifications to the kernel and applications. The design of HintStor contains a new application/user level interface, a file system plugin and a block storage data manager. With HintStor, storage systems composed of various storage devices can perform pre-devised data placement, space reallocation and data migration polices assisted by the added access hints. Each storage device/technology has its own unique price-performance tradeoffs and idiosyncrasies with respect to workload characteristics they prefer to support. To explore the internal access patterns and thus efficiently place data on storage systems with fully connected (i.e., data can move from one device to any other device instead of moving tier by tier) differential pools (each pool consists of storage devices of a particular type), we propose a chunk-level storage-aware workload analyzer framework, simplified as ChewAnalyzer. With ChewAnalzyer, the storage manager can adequately distribute and move the data chunks across different storage pools. To reduce the duplicate content transferred between local storage devices and devices in remote data centers, an inline Network Redundancy Elimination (NRE) process with Content-Defined Chunking (CDC) policy can obtain a higher Redundancy Elimination (RE) ratio but may suffer from a considerably higher computational requirement than fixed-size chunking. We build an inline NRE appliance which incorporates an improved FPGA based scheme to speed up CDC processing. To efficiently utilize the hardware resources, the whole NRE process is handled by a Virtualized NRE (VNRE) controller. The uniqueness of this VNRE that we developed lies in its ability to exploit the redundancy patterns of different TCP flows and customize the chunking process to achieve a higher RE ratio

    Cyber Security

    Get PDF
    This open access book constitutes the refereed proceedings of the 16th International Annual Conference on Cyber Security, CNCERT 2020, held in Beijing, China, in August 2020. The 17 papers presented were carefully reviewed and selected from 58 submissions. The papers are organized according to the following topical sections: access control; cryptography; denial-of-service attacks; hardware security implementation; intrusion/anomaly detection and malware mitigation; social network security and privacy; systems security

    Introductory Computer Forensics

    Get PDF
    INTERPOL (International Police) built cybercrime programs to keep up with emerging cyber threats, and aims to coordinate and assist international operations for ?ghting crimes involving computers. Although signi?cant international efforts are being made in dealing with cybercrime and cyber-terrorism, ?nding effective, cooperative, and collaborative ways to deal with complicated cases that span multiple jurisdictions has proven dif?cult in practic

    Technologies and Applications for Big Data Value

    Get PDF
    This open access book explores cutting-edge solutions and best practices for big data and data-driven AI applications for the data-driven economy. It provides the reader with a basis for understanding how technical issues can be overcome to offer real-world solutions to major industrial areas. The book starts with an introductory chapter that provides an overview of the book by positioning the following chapters in terms of their contributions to technology frameworks which are key elements of the Big Data Value Public-Private Partnership and the upcoming Partnership on AI, Data and Robotics. The remainder of the book is then arranged in two parts. The first part “Technologies and Methods” contains horizontal contributions of technologies and methods that enable data value chains to be applied in any sector. The second part “Processes and Applications” details experience reports and lessons from using big data and data-driven approaches in processes and applications. Its chapters are co-authored with industry experts and cover domains including health, law, finance, retail, manufacturing, mobility, and smart cities. Contributions emanate from the Big Data Value Public-Private Partnership and the Big Data Value Association, which have acted as the European data community's nucleus to bring together businesses with leading researchers to harness the value of data to benefit society, business, science, and industry. The book is of interest to two primary audiences, first, undergraduate and postgraduate students and researchers in various fields, including big data, data science, data engineering, and machine learning and AI. Second, practitioners and industry experts engaged in data-driven systems, software design and deployment projects who are interested in employing these advanced methods to address real-world problems

    Cyber Security

    Get PDF
    This open access book constitutes the refereed proceedings of the 16th International Annual Conference on Cyber Security, CNCERT 2020, held in Beijing, China, in August 2020. The 17 papers presented were carefully reviewed and selected from 58 submissions. The papers are organized according to the following topical sections: access control; cryptography; denial-of-service attacks; hardware security implementation; intrusion/anomaly detection and malware mitigation; social network security and privacy; systems security

    Technologies and Applications for Big Data Value

    Get PDF
    This open access book explores cutting-edge solutions and best practices for big data and data-driven AI applications for the data-driven economy. It provides the reader with a basis for understanding how technical issues can be overcome to offer real-world solutions to major industrial areas. The book starts with an introductory chapter that provides an overview of the book by positioning the following chapters in terms of their contributions to technology frameworks which are key elements of the Big Data Value Public-Private Partnership and the upcoming Partnership on AI, Data and Robotics. The remainder of the book is then arranged in two parts. The first part “Technologies and Methods” contains horizontal contributions of technologies and methods that enable data value chains to be applied in any sector. The second part “Processes and Applications” details experience reports and lessons from using big data and data-driven approaches in processes and applications. Its chapters are co-authored with industry experts and cover domains including health, law, finance, retail, manufacturing, mobility, and smart cities. Contributions emanate from the Big Data Value Public-Private Partnership and the Big Data Value Association, which have acted as the European data community's nucleus to bring together businesses with leading researchers to harness the value of data to benefit society, business, science, and industry. The book is of interest to two primary audiences, first, undergraduate and postgraduate students and researchers in various fields, including big data, data science, data engineering, and machine learning and AI. Second, practitioners and industry experts engaged in data-driven systems, software design and deployment projects who are interested in employing these advanced methods to address real-world problems

    24th Nordic Conference on Computational Linguistics (NoDaLiDa)

    Get PDF

    Effects of Diversity and Neuropsychological Performance in an NFL Cohort

    Get PDF
    Objective: The aim of this study was to examine the effect of ethnicity on neuropsychological test performance by comparing scores of white and black former NFL athletes on each subtest of the WMS. Participants and Methods: Data was derived from a de-identified database in South Florida consisting of 63 former NFL white (n=28, 44.4%) and black (n=35, 55.6%) athletes (Mage= 50.38; SD= 11.57). Participants completed the following subtests of the WMS: Logical Memory I and II, Verbal Paired Associates I and II, and Visual Reproduction I and II. Results: A One-Way ANOVA yielded significant effect between ethnicity and performance on several subtests from the WMS-IV. Black athletes had significantly lower scores compared to white athletes on Logical Memory II: F(1,61) = 4.667, p= .035, Verbal Paired Associates I: F(1,61) = 4.536, p = .037, Verbal Paired Associates: II F(1,61) = 4.677, p = .034, and Visual Reproduction I: F(1,61) = 6.562, p = .013. Conclusions: Results suggest significant differences exist between white and black athletes on neuropsychological test performance, necessitating the need for proper normative samples for each ethnic group. It is possible the differences found can be explained by the psychometric properties of the assessment and possibility of a non-representative sample for minorities, or simply individual differences. Previous literature has found white individuals to outperform African-Americans on verbal and non-verbal cognitive tasks after controlling for socioeconomic and other demographic variables (Manly & Jacobs, 2002). This highlights the need for future investigators to identify cultural factors and evaluate how ethnicity specifically plays a role on neuropsychological test performance. Notably, differences between ethnic groups can have significant implications when evaluating a sample of former athletes for cognitive impairment, as these results suggest retired NFL minorities may be more impaired compared to retired NFL white athletes

    Distinguishing Performance on Tests of Executive Functions Between Those with Depression and Anxiety

    Get PDF
    Objective: To see if there are differences in executive functions between those diagnosed with Major Depressive Disorder (MDD) and those with Generalized Anxiety Disorder (GAD).Participants and Methods: The data were chosen from a de-identified database at a neuropsychological clinic in South Florida. The sample used was adults diagnosed with MDD (n=75) and GAD (n=71) and who had taken the Halstead Category Test, Trail Making Test, Stroop Test, and the Wisconsin Card Sorting Test. Age (M=32.97, SD=11.75), gender (56.7% female), and race (52.7% White) did not differ between groups. IQ did not differ but education did (MDD=13.41 years, SD=2.45; GAD=15.11 years, SD=2.40), so it was ran as a covariate in the analyses. Six ANCOVAs were run separately with diagnosis being held as the fixed factor and executive function test scores held as dependent variables. Results: The MDD group only performed worse on the Category Test than the GAD group ([1,132]=4.022, p\u3c .05). Even though both WCST scores used were significantly different between the two groups, both analyses failed Levene’s test of Equality of Error Variances, so the data were not interpreted. Conclusions: Due to previous findings that those diagnosed with MDD perform worse on tests of executive function than normal controls (Veiel, 1997), this study wanted to compare executive function performance between those diagnosed with MDD and those with another common psychological disorder. The fact that these two groups only differed on the Category Test shows that there may not be much of a difference in executive function deficits between those with MDD and GAD. That being said, not being able to interpret the scores on the WCST test due to a lack of homogeneity of variance indicates that a larger sample size is needed to compare these two types of patients, as significant differences may be found. The results of this specific study, however, could mean that the Category Test could be used in assisting the diagnosis of a MDD patient
    corecore