94 research outputs found
PLDANet: Reasonable Combination of PCA and LDA Convolutional Networks
Integrating deep learning with traditional machine learning methods is an intriguing research direction. For example, PCANet and LDANet adopts Principal Component Analysis (PCA) and Fisher Linear Discriminant Analysis (LDA) to learn convolutional kernels separately. It is not reasonable to adopt LDA to learn filter kernels in each convolutional layer, local features of images from different classes may be similar, such as background areas. Therefore, it is meaningful to adopt LDA to learn filter kernels only when all the patches carry information from the whole image. However, to our knowledge, there are no existing works that study how to combine PCA and LDA to learn convolutional kernels to achieve the best performance. In this paper, we propose the convolutional coverage theory. Furthermore, we propose the PLDANet model which adopts PCA and LDA reasonably in different convolutional layers based on the coverage theory. The experimental study has shown the effectiveness of the proposed PLDANet model
A Method for Automatically Generating Join Queries Based on Relations-Attributes Distance Matrix over Data Lakes
Techniques for identifying joinable or unionable tables in data lakes can yield valuable information for data scientists. However, more than half of their working time is spent familiarizing themselves with the metadata and correlations of datasets. Simplifying the use of information in data lakes is crucial for enhancing their utilization. The existing solution of integrating correlated relations into a single large data table via full disjunction requires integration updating when either data or metadata changes, complicating data maintenance. This paper proposes a method for automatically generating join queries based on the distance matrix of relations and attributes in data lakes. The distance matrix only requires updating when metadata changes, simplifying data maintenance. Experimental results demonstrate that once the distance matrix is generated, the time required to generate the join queries is negligible. Compared to the existing solution, the time cost for executing join queries over correlated tables is nearly identical to that of selection queries over integrated tables. The results of these two queries are also the same, showcasing the effectiveness and efficiency of our method
Query with Assumptions for Probabilistic Relational Databases
Users may have prior knowledge about a probabilistic database. They prefer to query over a probabilistic database on their prior knowledge which cannot be written as component clauses of conventional SQL queries. A naive approach is to query over a new database version, which is generated by transforming the original probabilistic database to satisfy users\u27 prior knowledge; however, it is impractical to generate a different probabilistic database version for each prior knowledge. In this paper, we propose the concept of the query with assumptions which allow users to describe their prior knowledge with a newly introduced ASSUMPTION clause of SQL. We also propose an approach to obtain the result of a query based on assumption clauses. The experimental studies show our approach has better performance compared to the naive approach
Enabling Access Control for Encrypted Multi-Dimensional Data in Cloud Computing through Range Search
With the growing popularity of cloud computing, data owners are increasingly opting to outsource their data to cloud servers due to the numerous benefits it offers. However, this outsourcing raises concerns about data privacy since the data stored on remote cloud servers is not directly controlled by the owners. Encryption of the data is an effective approach to mitigate these privacy concerns. However, encrypted data lacks distinguishability, leading to limitations in supporting common operations such as range search and access control. In this research paper, we propose a method called RSAC (Range Search Supporting Access Control) for encrypted multi-dimensional data in cloud computing. Our method leverages policy design, bucket embedding, algorithm design, and Ciphertext Policy-Attribute Based Encryption (CPABE) to achieve its objectives. We present extensive experimental results that demonstrate the efficiency of our method and conduct a thorough security analysis to ensure its robustness. Our proposed RSAC method addresses the challenges of range search and access control over encrypted multi-dimensional data, thus contributing to enhancing privacy and security in cloud computing environments
Recommended from our members
Formation and Characterization of Lactoferrin-Hyaluronic Acid Conjugates and Their Effects on the Storage Stability of Sesamol Emulsions
The purpose of this study was to fabricate biopolymer conjugates from lactoferrin (LF) and hyaluronic acid (HA) and then to investigate their potential as emulsifiers for forming sesamol-loaded emulsions. Initially, LF-HA covalent conjugates were formed using the carbodiimide coupling method in aqueous solutions at pH = 4.5, and then the nature of the conjugates was investigated using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), Fourier Transform Infrared Spectroscopy (FTIR) spectroscopy, and fluorescence spectroscopy. The results demonstrated the formation of an amide link between the amine groups of LF and the carboxyl groups of HA. Sesamol emulsions were prepared using the LF-HA conjugates as emulsifiers and their stability was determined. The conjugates improved both the physical and chemical stability of the emulsions during storage. Optimum stability of the emulsion was obtained at a LF-to-HA molar ratio of 2:1. Our results suggest that LF-HA conjugates may be effective emulsifiers for use in food stuffs and other applications
Recommended from our members
Common genetic variants in ADCY5 and gestational glycemic traits.
Two meta-analysis of genome wide association studies identified two variants at adenylate cyclase 5 (ADCY5) associated with type 2 diabetes mellitus, fasting and 2-hour glucose in non-pregnant individuals of European descent. The objective of our study was to explore the role of common variants in ADCY5 on gestational glycemic traits, including plasma glucose, insulin values, β cell function and insulin resistance in the fasted state as well as plasma glucose 1 hour after a 50-gram glucose challenge test among Chinese Han women. Homoeostasis model assessment (HOMA) was used to quantify β cell function (HOMA1-β and HOMA2-β) and insulin resistance (HOMA1-IR and HOMA2-IR). Thirty-five single nucleotide polymorphisms (SNPs) in ADCY5 were genotyped in 929 unrelated Chinese Han women with singleton pregnancies. Three SNPs (rs6797915, rs9856662 and rs9875803) displayed evidence for association with plasma glucose 1 hour after a 50-gram glucose challenge test (P = 0.042, 0.018 and 0.018, respectively), one (rs6777397) displayed evidence for association with HOMA1-β (P = 0.014), and one (rs6762009) displayed evidence for association with HOMA1-IR (P = 0.033). These results provide additional insight into the effects of genetic variation within ADCY5 in glucose metabolism, especially during pregnancy and in non-European descent populations
The Influence of Tone Inventory on ERP without Focal Attention: A Cross-Language Study
This study investigates the effect of tone inventories on brain activities underlying pitch without focal attention. We find that the electrophysiological responses to across-category stimuli are larger than those to within-category stimuli when the pitch contours are superimposed on nonspeech stimuli; however, there is no electrophysiological response difference associated with category status in speech stimuli. Moreover, this category effect in nonspeech stimuli is stronger for Cantonese speakers. Results of previous and present studies lead us to conclude that brain activities to the same native lexical tone contrasts are modulated by speakers’ language experiences not only in active phonological processing but also in automatic feature detection without focal attention. In contrast to the condition with focal attention, where phonological processing is stronger for speech stimuli, the feature detection (pitch contours in this study) without focal attention as shaped by language background is superior in relatively regular stimuli, that is, the nonspeech stimuli. The results suggest that Cantonese listeners outperform Mandarin listeners in automatic detection of pitch features because of the denser Cantonese tone system
Commonness, rarity, and intraspecific variation in traits and performance in tropical tree seedlings
Abstract One of the few rules in ecology is that communities are composed of many rare and few common species. Trait-based investigations of abundance distributions have generally focused on speciesmean trait values with mixed success. Here, using large tropical tree seedling datasets in China and Puerto Rico, we take an alternative approach that considers the magnitude of intraspecific variation in traits and growth as it relates to species abundance. We find that common species are less variable in their traits and growth. Common species also occupy core positions within community trait space indicating that they are finely tuned for the available conditions. Rare species are functionally peripheral and are likely transients struggling for success in the given environment. The work highlights the importance of considering intraspecific variation in trait-based ecology and demonstrates asymmetry in the magnitude of intraspecific variation among species is critical for understanding of how traits are related to abundance
Recommended from our members
Expansion of the Tibetan Plateau during the Neogene
The appearance of detritus shed from mountain ranges along the northern margin of the Tibetan Plateau heralds the Cenozoic development of high topography. Current estimates of the age of the basal conglomerate in the Qaidam basin place this event in Paleocene-Eocene. Here we present new magnetostratigraphy and mammalian biostratigraphy that refine the onset of basin fill to ∼25.5 Myr and reveal that sediment accumulated continuously until ∼4.8 Myr. Sediment provenance implies a sustained source in the East Kunlun Shan throughout this time period. However, the appearance of detritus from the Qilian Shan at ∼12 Myr suggests emergence of topography north of the Qaidam occurred during the late Miocene. Our results imply that deformation and mountain building significantly post-date Indo-Asian collision and challenge the suggestion that the extent of the plateau has remained constant through time. Rather, our results require expansion of high topography during the past 25 Myr
- …