9 research outputs found

    Content-based Music Similarity with Triplet Networks

    Full text link
    We explore the feasibility of using triplet neural networks to embed songs based on content-based music similarity. Our network is trained using triplets of songs such that two songs by the same artist are embedded closer to one another than to a third song by a different artist. We compare two models that are trained using different ways of picking this third song: at random vs. based on shared genre labels. Our experiments are conducted using songs from the Free Music Archive and use standard audio features. The initial results show that shallow Siamese networks can be used to embed music for a simple artist retrieval task

    A Review of Resume Analysis and Job Description Matching Using Machine Learning

    Get PDF
    In the contemporary job market, the effective matching of resumes to job descriptions is a critical facet of talent acquisition. This research paper provides a comprehensive review of the advancements, methodologies, and challenges associated with leveraging machine learning (ML) and natural language processing (NLP) techniques for resume analysis and job description matching. The study surveys the existing literature, synthesizes key findings, and presents a taxonomy of approaches employed in the field. The paper begins by elucidating the significance of efficient resume-job description matching in enhancing the recruitment process. It then delves into the foundational principles of machine learning as applied to human resource management, emphasizing the role of natural language processing, pattern recognition, and semantic analysis in extracting relevant information from resumes and job descriptions. The review encompasses an in-depth analysis of various machine learning algorithms and models utilized in resume parsing, including but not limited to neural networks, support vector machines (SVM), and ensemble methods. Moreover, the paper investigates the incorporation of deep learning architectures, such as convolutional neural networks and recurrent neural networks, for more nuanced feature extraction and representation. Key challenges and limitations associated with current methodologies are thoroughly examined, addressing issues such as the need for large, diverse datasets for robust training. The paper concludes with a discussion on future research directions and emerging trends in the realm of resume analysis and job description matching. This research contributes to the existing body of knowledge by offering a comprehensive synthesis of the current state of machine learning applications in resume analysis and job description matching, providing valuable insights for researchers, practitioners, and industry professionals seeking to optimize talent acquisition processe

    Generating Natural Language Attacks in a Hard Label Black Box Setting

    Full text link
    We study an important and challenging task of attacking natural language processing models in a hard label black box setting. We propose a decision-based attack strategy that crafts high quality adversarial examples on text classification and entailment tasks. Our proposed attack strategy leverages population-based optimization algorithm to craft plausible and semantically similar adversarial examples by observing only the top label predicted by the target model. At each iteration, the optimization procedure allow word replacements that maximizes the overall semantic similarity between the original and the adversarial text. Further, our approach does not rely on using substitute models or any kind of training data. We demonstrate the efficacy of our proposed approach through extensive experimentation and ablation studies on five state-of-the-art target models across seven benchmark datasets. In comparison to attacks proposed in prior literature, we are able to achieve a higher success rate with lower word perturbation percentage that too in a highly restricted setting.Comment: Accepted at AAAI 2021 (Main Conference

    Decision Support System for Online Recruitment

    Get PDF
    International audienceIn the past, potential candidates for a job offer were in physical locations that could be reached through the major media that were available at the time, often strongly rooted in their local geographic space. Today, digital media replaced those traditional channels, offering advertisers a broader geographic reach. However digital channels are more and more numerous, making it difficult to target candidates on the web. Existing decision support system on e-recruitment in the literature does not identify the desired profile from a job offer (C1), the relevance of a resume (C2) or the changing environment of recruitment (C3). Thereby, the objective of our research is to optimize the e-recruitment process by designing a decision support system capable of targeting potential candidates at a lower cost and that addresses the challenges (C1), (C2) and (C3)

    Online Reciprocal Recommendation with Theoretical Performance Guarantees

    Get PDF
    A reciprocal recommendation problem is one where the goal of learning is not just to predict a user's preference towards a passive item (e.g., a book), but to recommend the targeted user on one side another user from the other side such that a mutual interest between the two exists. The problem thus is sharply different from the more traditional items-to-users recommendation, since a good match requires meeting the preferences of both users. We initiate a rigorous theoretical investigation of the reciprocal recommendation task in a specific framework of sequential learning. We point out general limitations, formulate reasonable assumptions enabling effective learning and, under these assumptions, we design and analyze a computationally efficient algorithm that uncovers mutual likes at a pace comparable to those achieved by a clearvoyant algorithm knowing all user preferences in advance. Finally, we validate our algorithm against synthetic and real-world datasets, showing improved empirical performance over simple baselines

    Metodología para la clasificación de documentos de texto de hojas de vida basado en aprendizaje de máquina

    Get PDF
    El proceso de selección de personal es complejo y requiere una gran cantidad de información y análisis para encontrar a los candidatos adecuados para una posición. Incluye varias etapas, como la revisión de currículums, pruebas psicológicas y verificación de referencias. Sin embargo, el análisis de currículums puede ser un desafío, ya que implica una intervención humana y la gran cantidad de información puede resultar difícil de procesar por computadora. Además, las empresas pueden enfrentar dificultades y costos elevados debido a la complejidad del proceso y la alta demanda en el mercado laboral. Para resolver este problema, se propone la metodología CVNLP (Curriculum Natural Language Processing), que utiliza un conjunto de 725 hojas de vida en formatos PDF, DOCX y DOC para analizar los currículums de manera eficiente y eficaz. La metodología se aplica de manera transversal y ha demostrado su eficacia en la selección de personal. Al reducir los costos y mejorar la eficiencia en el proceso de selección de personal, las empresas pueden centrarse en su núcleo de negocio y facilitar el proceso de selección de personal. En resumen, la metodología CVNLP se presenta como una solución prometedora para mejorar la eficacia y eficiencia en los procesos de selección de personal, especialmente para las PYMEs con recursos limitado

    Local VS. Global Models for Job-Candidate Matching

    Get PDF
    RÉSUMÉ: Avec le développement des technologies de l’information et la croissance continue du marché du recrutement électronique, l’automatisation du processus de sélection pour trouver le meilleur candidat pour un poste a suscité l’intérêt des chercheurs et des ingénieurs en logiciels ce qui a conduit au développement de modèles complexes, d’algorithmes et de techniques qui exploitent le traitement du langage naturel, la similitude sémantique et l’apprentissage automatique. Cette thèse vise à compléter ce travail, en se concentrant sur la façon d’exploiter les données existantes pour améliorer les performances. Nous évaluons la notion de modèles locaux qui sont des modèles personnalisés construits dans des sous-ensembles de données connexes ayant des caractéristiques similaires. Pour l’évaluation, nous la comparons avec les modèles globaux qui sont un modèle complexe unique sans classification préalable. Pour ce faire, nous avons travaillé avec Airudi, une société de ressources humaines Française Canadienne qui nous a fourni des données réelles que nous utilisons pour construire notre cas d’étude où nous répondons aux questions de recherche suivantes : RQ1. Comment les modèles globaux se comparent-ils en performance aux modèles locaux? RQ2. Comment la précision et le rappel fonctionnent-ils sur différents seuils?----------ABSTRACT : Selecting the best candidate for a job position is a challenging topic that has been gaining interest in research and practice. This has led to increasingly more complex models, algorithms and techniques exploiting natural language processing, semantic similarity, and machine learning. This thesis complements this work by taking a step back and focusing on how to better exploit available data in order to further improve model performance. In particular, we empirically evaluate the notion of using “local” models for subsets of the data having similar characteristics (job descriptions) as opposed to using a single, complex “Global Model.” Using job candidate and description data, we found that local models perform better than the global models in terms of precision and recall, with median improvements up to 11.64%. If we substitute the under-performing models with the global model, thus creating a hybrid local model, the difference becomes significant. Our results suggest that local models for job recommendation brings performance advantages in terms of precision and recall over a global model, motivating further research in local models for job recommendation
    corecore