1 research outputs found
Recommended from our members
Recommender systems and market approaches for industrial data management
Industrial companies are dealing with an increasing data overload problem in all
aspects of their business: vast amounts of data are generated in and outside each
company. Determining which data is relevant and how to get it to the right users is
becoming increasingly difficult. There are a large number of datasets to be
considered, and an even higher number of combinations of datasets that each user
could be using.
Current techniques to address this data overload problem necessitate detailed
analysis. These techniques have limited scalability due to their manual effort and
their complexity, which makes them unpractical for a large number of datasets.
Search, the alternative used by many users, is limited by the user’s knowledge
about the available data and does not consider the relevance or costs of providing
these datasets.
Recommender systems and so-called market approaches have previously been
used to solve this type of resource allocation problem, as shown for example in
allocation of equipment for production processes in manufacturing or for spare part
supplier selection. They can therefore also be seen as a potential application for
the problem of data overload.
This thesis introduces the so-called RecorDa approach: an architecture using
market approaches and recommender systems on their own or by combining them
into one system. Its purpose is to identify which data is more relevant for a user’s
decision and improve allocation of relevant data to users.
Using a combination of case studies and experiments, this thesis develops and
tests the approach. It further compares RecorDa to search and other mechanisms.
The results indicate that RecorDa can provide significant benefit to users with
easier and more flexible access to relevant datasets compared to other
techniques, such as search in these databases. It is able to provide a fast increase
in precision and recall of relevant datasets while still keeping high novelty and
coverage of a large variety of datasets