4,479 research outputs found
An improved switching hybrid recommender system using naive Bayes classifier and collaborative filtering
Recommender Systems apply machine learning and data mining techniques for filtering unseen information and can predict whether a user would like a given resource. To date a number of recommendation algorithms have been proposed, where collaborative filtering and content-based filtering are the two most famous and adopted recommendation techniques. Collaborative filtering recommender systems recommend items by identifying other users with similar taste and use their opinions for recommendation; whereas content-based recommender systems recommend items based on the content information of the items. These systems suffer from scalability, data sparsity, over specialization, and cold-start problems resulting in poor quality recommendations and reduced coverage. Hybrid recommender systems combine individual systems to avoid certain aforementioned limitations of these systems. In this paper, we proposed a unique switching hybrid recommendation approach by combining a Naive Bayes classification approach with the collaborative filtering. Experimental results on two different data sets, show that the proposed algorithm is scalable and provide better performance – in terms of accuracy and coverage – than other algorithms while at the same time eliminates some recorded problems with the recommender systems
A Low Dimensional Approximation For Competence In Bacillus Subtilis
The behaviour of a high dimensional stochastic system described by a Chemical
Master Equation (CME) depends on many parameters, rendering explicit simulation
an inefficient method for exploring the properties of such models. Capturing
their behaviour by low-dimensional models makes analysis of system behaviour
tractable. In this paper, we present low dimensional models for the
noise-induced excitable dynamics in Bacillus subtilis, whereby a key protein
ComK, which drives a complex chain of reactions leading to bacterial
competence, gets expressed rapidly in large quantities (competent state) before
subsiding to low levels of expression (vegetative state). These rapid reactions
suggest the application of an adiabatic approximation of the dynamics of the
regulatory model that, however, lead to competence durations that are incorrect
by a factor of 2. We apply a modified version of an iterative functional
procedure that faithfully approximates the time-course of the trajectories in
terms of a 2-dimensional model involving proteins ComK and ComS. Furthermore,
in order to describe the bimodal bivariate marginal probability distribution
obtained from the Gillespie simulations of the CME, we introduce a tunable
multiplicative noise term in a 2-dimensional Langevin model whose stationary
state is described by the time-independent solution of the corresponding
Fokker-Planck equation.Comment: 12 pages, to be published in IEEE/ACM Transactions on Computational
Biology and Bioinformatic
Phase Transitions and Symmetry Breaking in Genetic Algorithms with Crossover
In this paper, we consider the role of the crossover operator in genetic algorithms. Specifically, we study optimisation problems that exhibit many local optima and consider how crossover affects the rate at which the population breaks the symmetry of the problem. As an example of such a problem, we consider the subset sum problem. In so doing, we demonstrate a previously unobserved phenomenon, whereby the genetic algorithm with crossover exhibits a critical mutation rate, at which its performance sharply diverges from that of the genetic algorithm without crossover. At this critical mutation rate, the genetic algorithm with crossover exhibits a rapid increase in population diversity. We calculate the details of this phenomenon on a simple instance of the subset sum problem and show that it is a classic phase transition between ordered and disordered populations. Finally, we show that this critical mutation rate corresponds to the transition between the genetic algorithm accelerating or preventing symmetry breaking and that the critical mutation rate represents an optimum in terms of the balance of exploration and exploitation within the algorithm
Unsupervised clustering approach for network anomaly detection
This paper describes the advantages of using the anomaly detection approach over the misuse detection technique in detecting unknown network intrusions or attacks. It also investigates the performance of various clustering algorithms when applied to anomaly detection. Five different clustering algorithms: k-Means, improved k-Means, k-Medoids, EM clustering and distance-based outlier detection algorithms are used. Our experiment shows that misuse detection techniques, which implemented four different classifiers (naïve Bayes, rule induction, decision tree and nearest neighbour) failed to detect network traffic, which contained a large number of unknown intrusions; where the highest accuracy was only 63.97% and the lowest false positive rate was 17.90%. On the other hand, the anomaly detection module showed promising results where the distance-based outlier detection algorithm outperformed other algorithms with an accuracy of 80.15%. The accuracy for EM clustering was 78.06%, for k-Medoids it was 76.71%, for improved k-Means it was 65.40% and for k-Means it was 57.81%. Unfortunately, our anomaly detection module produces high false positive rate (more than 20%) for all four clustering algorithms. Therefore, our future work will be more focus in reducing the false positive rate and improving the accuracy using more advance machine learning technique
Optimization of renormalization group transformations
SIGLEAvailable from British Library Document Supply Centre- DSC:D82965 / BLDSC - British Library Document Supply CentreGBUnited Kingdo
Ocean harvesting buoy using offshore wind turbines
While wind power has been taking off ocean power has remained stuck in the development stage. There are several design challenges for ocean power. Ocean water is corrosive to many materials due to the high salt concentration. Ocean waves vary in strength significantly from weak to very strong. Most energy harvesting devices require maintenance and having the device in the ocean makes it difficult to maintenance. Lastly the current cost per kWh is significantly higher than other forms of energy generation. A solution to a few of these problems is to piggyback off projects that are already being installed. An offshore wind turbine provides the means to overcome a few of these issues. This design is a mechanically simple device; a buoy that wraps around the base of the turbine that floats on top of the water. As the water rises and falls it will cause the buoy to move up and down. This device will drive a generator that is placed within the wind turbine above the water line to generate electricity. This energy will be carried by the high capacity wires already in place used by the wind turbines. The problems that are solved via this solution are as follows. Installation costs are significantly reduced because there is existing infrastructure. Maintenance becomes easier when the majority of the equipment is above the water and on an existing structure. The majority of the equipment is protected from powerful ocean waves, corrosive water, and invasive sea life. This research is important because we have yet to see a ocean power device that has taken off commercially, meaning the ocean remains a massive untapped energy resource that if it could be used would impact societies positively across the world.
Please click Additional Files below to see the full abstract
Incremental Kernel Mapping Algorithms for Scalable Recommender Systems
Recommender systems apply machine learning techniques for filtering unseen information and can predict whether a user would like a given item. Kernel Mapping Recommender (KMR)system algorithms have been proposed, which offer state-of-the-art performance. One potential drawback of the KMR algorithms is that the training is done in one step and hence they cannot accommodate the incremental update with the arrival of new data making them unsuitable for the dynamic environments. From this line of research, we propose a new heuristic, which can build the model incrementally without retraining the whole model from scratch when new data (item or user) are added to the recommender system dataset. Furthermore, we proposed a novel perceptron type algorithm, which is a fast incremental algorithm for building the model that maintains a good level of accuracy and scales well with the data. We show empirically over two datasets that the proposed algorithms give quite accurate results while providing significant computation savings
- …