4 research outputs found

    Methoden für Trendanalysen im Web zur Unterstützung des Customer Relationship Management

    Get PDF
    Mit dem Einzug des Web 2.0 ins tägliche Leben haben Individuen die Möglichkeit ihre Meinungen und Gefühle in Form von Blogs zu veröffentlichen. Die Analyse der Trends in dieser Blogosphäre kann maßgeblich zur Unterstützung der Kundenrückgewinnung in einem CRM-System eingesetzt werden. In dieser Forschungsarbeit werden bestehende Ansätze zur Trenderkennung im Allgemeinen untersucht und anschließend die Eignung ihrer Applikation auf Weblogs geprüft. Dazu wird ausgehend von bestehenden wissenschaftlichen Arbeiten ein System zur Trendanalyse prototypisch implementiert und die Analyseergebnisse im Anschluss evaluiert

    Techniques for data pattern selection and abstraction

    Get PDF
    This thesis concerns the problem of prototype reduction in instance-based learning. In order to deal with problems such as storage requirements, sensitivity to noise and computational complexity, various algorithms have been presented that condense the number of stored prototypes, while maintaining competent classification accuracy. Instance selection, which recovers a smaller subset of the original training set, is the most widely used technique for instance reduction. But, prototype abstraction that generates new prototypes to replace the initial ones has also gained a lot of interest recently. The major contribution of this work is the proposal of four novel frameworks for performing prototype reduction, the Class Boundary Preserving algorithm (CBP), a hybrid method that uses both selection and generation of prototypes, Instance Seriation for Prototype Abstraction (ISPA), which is an abstraction algorithm, and two selective techniques, Spectral Instance Reduction (SIR) and Direct Weight Optimization (DWO). CBP is a multi-stage method based on a simple heuristic that is very effective in identifying samples close to class borders. Using a noise filter harmful instances are removed, while the powerful heuristic determines the geometrical distribution of patterns around every instance. Together with the concepts of nearest enemy pairs and mean shift clustering this algorithm decides on the final set of retained prototypes. DWO is a selection model whose output set of prototypes is decided by a set of binary weights. These weights are computed according to an objective function composed of the ratio between the nearest friend and nearest enemy of every sample. In order to obtain good quality results DWO is optimized using a genetic algorithm. ISPA is an abstraction technique that employs the concept of data seriation to organize instances in an arrangement that favours merging between them. As a result, a new set of prototypes is created. Results show that CBP, SIR and DWO, the three major algorithms presented in this thesis, are competent and efficient in terms of at least one of the two basic objectives, classification accuracy and condensation ratio. The comparison against other successful condensation algorithms illustrates the competitiveness of the proposed models. The SIR algorithm presents a set of border discriminating features (BDFs) that depicts the local distribution of friends and enemies of all samples. These are then used along with spectral graph theory to partition the training set in to border and internal instances
    corecore