283 research outputs found

    Deep Multi-view Learning to Rank

    Full text link
    We study the problem of learning to rank from multiple information sources. Though multi-view learning and learning to rank have been studied extensively leading to a wide range of applications, multi-view learning to rank as a synergy of both topics has received little attention. The aim of the paper is to propose a composite ranking method while keeping a close correlation with the individual rankings simultaneously. We present a generic framework for multi-view subspace learning to rank (MvSL2R), and two novel solutions are introduced under the framework. The first solution captures information of feature mappings from within each view as well as across views using autoencoder-like networks. Novel feature embedding methods are formulated in the optimization of multi-view unsupervised and discriminant autoencoders. Moreover, we introduce an end-to-end solution to learning towards both the joint ranking objective and the individual rankings. The proposed solution enhances the joint ranking with minimum view-specific ranking loss, so that it can achieve the maximum global view agreements in a single optimization process. The proposed method is evaluated on three different ranking problems, i.e. university ranking, multi-view lingual text ranking and image data ranking, providing superior results compared to related methods.Comment: Published at IEEE TKD

    Training High Quality Spam-detection Models Using Weak Labels

    Get PDF
    To be effective in detecting spam in online content sharing networks, it is necessary that techniques used to detect spam have good precision, high recall, and the ability to adapt to new types of spam. A bottleneck in developing such machine learning techniques is the lack of availability of high quality labeled training data. Human labeling to obtain high quality labeled data is expensive and not scalable. Current approaches such as unsupervised learning or semi-supervised learning can only produce low quality labels. Generally, the present disclosure is directed to a weak supervision approach to train a machine learning model to detect spam content items. Weak labels are generated for content items in training data using various techniques such as rules that encode domain knowledge and/or anomaly detection techniques such as unsupervised machine learning/ clustering or semi-supervised machine learning. The accuracy of the various techniques is estimated based on observed agreements/ disagreements in the weak labels. The weak labels are combined into a single value (e.g., per content item) that is used as a probabilistic training label to train a machine learning model using supervised learning that is noise aware. In the training, a penalty is applied for deviation from the probabilistic label such that the penalty is higher for a label associated with a higher confidence and lower for a label associated with a lower confidence. The model thus trained can be used to detect spam content

    Determining Projection Format for a Video

    Get PDF
    Generally, the present disclosure is directed to determining one or more projection formats for a video. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to predict a projection format for a segment of a video based on image data extracted from the video

    Contextual Advertising Based on Content Recognition in a Video

    Get PDF
    Generally, the present disclosure is directed to providing relevant advertisements based on the visual content of a video. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to determine a relevant advertisement and/or a relevant time for the relevant advertisement based on image data taken from a video

    Using Imagery to Identify Abandoned Property in Public Spaces

    Get PDF
    Generally, the present disclosure is directed to identifying private property that has been illegally dumped or stored for a prolonged period without use. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to predict whether an item has been abandoned in a public right-of-way based on imagery of the item and record of public right-of-way

    Planning Group Meals Based on Preferences of Attendees

    Get PDF
    Generally, the present disclosure is directed to determining an optimal place and time for a meeting based on the preferences of the people attending the meeting. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to predict an optimal restaurant and time for a group meal based on personal preferences and/or time availability of members of the group

    Removing Bias from Deep Learning Systems

    Get PDF
    Generally, the present disclosure is directed to training machine learning models, e.g., deep learning models, such that the impact of any implicit bias in the training dataset on the trained model is eliminated or minimized. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage a generative adversarial configuration with a generator model and a discriminator model such that the resultant trained model (generator) performs its function free of any implicit bias that may be present in the training dataset. The model as trained herein can be any type of machine learning model, e.g., a neural network or other type of model, and can be trained for any suitable purpose

    Detecting Interesting Events in a Home Security Camera System

    Get PDF
    Generally, the present disclosure is directed to a system for predicting whether the subject of a camera needs to be recorded and/or transmitted. In particular, in some implementations, the systems and methods of the present disclosure can include or otherwise leverage one or more machine-learned models to predict whether the view of the camera contains a noteworthy change in semantic meaning based on labels describing the semantic meaning of the view
    • …
    corecore