43,309 research outputs found

    Sentiment Analysis using an ensemble of Feature Selection Algorithms

    Get PDF
    To determine the opinion of any person experiencing any services or buying any product, the usage of Sentiment Analysis, a continuous research in the field of text mining, is a common practice. It is a process of using computation to identify and categorize opinions expressed in a piece of text. Individuals post their opinion via reviews, tweets, comments or discussions which is our unstructured information. Sentiment analysis gives a general conclusion of audits which benefit clients, individuals or organizations for decision making. The primary point of this paper is to perform an ensemble approach on feature reduction methods identified with natural language processing and performing the analysis based on the results. An ensemble approach is a process of combining two or more methodologies. The feature reduction methods used are Principal Component Analysis (PCA) for feature extraction and Pearson Chi squared statistical test for feature selection. The fundamental commitment of this paper is to experiment whether combined use of cautious feature determination and existing classification methodologies can yield better accuracy

    Predicting Audio Advertisement Quality

    Full text link
    Online audio advertising is a particular form of advertising used abundantly in online music streaming services. In these platforms, which tend to host tens of thousands of unique audio advertisements (ads), providing high quality ads ensures a better user experience and results in longer user engagement. Therefore, the automatic assessment of these ads is an important step toward audio ads ranking and better audio ads creation. In this paper we propose one way to measure the quality of the audio ads using a proxy metric called Long Click Rate (LCR), which is defined by the amount of time a user engages with the follow-up display ad (that is shown while the audio ad is playing) divided by the impressions. We later focus on predicting the audio ad quality using only acoustic features such as harmony, rhythm, and timbre of the audio, extracted from the raw waveform. We discuss how the characteristics of the sound can be connected to concepts such as the clarity of the audio ad message, its trustworthiness, etc. Finally, we propose a new deep learning model for audio ad quality prediction, which outperforms the other discussed models trained on hand-crafted features. To the best of our knowledge, this is the first large-scale audio ad quality prediction study.Comment: WSDM '18 Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 9 page

    Econometrics meets sentiment : an overview of methodology and applications

    Get PDF
    The advent of massive amounts of textual, audio, and visual data has spurred the development of econometric methodology to transform qualitative sentiment data into quantitative sentiment variables, and to use those variables in an econometric analysis of the relationships between sentiment and other variables. We survey this emerging research field and refer to it as sentometrics, which is a portmanteau of sentiment and econometrics. We provide a synthesis of the relevant methodological approaches, illustrate with empirical results, and discuss useful software

    Cross validation of bi-modal health-related stress assessment

    Get PDF
    This study explores the feasibility of objective and ubiquitous stress assessment. 25 post-traumatic stress disorder patients participated in a controlled storytelling (ST) study and an ecologically valid reliving (RL) study. The two studies were meant to represent an early and a late therapy session, and each consisted of a "happy" and a "stress triggering" part. Two instruments were chosen to assess the stress level of the patients at various point in time during therapy: (i) speech, used as an objective and ubiquitous stress indicator and (ii) the subjective unit of distress (SUD), a clinically validated Likert scale. In total, 13 statistical parameters were derived from each of five speech features: amplitude, zero-crossings, power, high-frequency power, and pitch. To model the emotional state of the patients, 28 parameters were selected from this set by means of a linear regression model and, subsequently, compressed into 11 principal components. The SUD and speech model were cross-validated, using 3 machine learning algorithms. Between 90% (2 SUD levels) and 39% (10 SUD levels) correct classification was achieved. The two sessions could be discriminated in 89% (for ST) and 77% (for RL) of the cases. This report fills a gap between laboratory and clinical studies, and its results emphasize the usefulness of Computer Aided Diagnostics (CAD) for mental health care

    Mining and analysis of audiology data to find significant factors associated with tinnitus masker

    Get PDF
    Objectives: The objective of this research is to find the factors associated with tinnitus masker from the literature, and by using the large amount of audiology data available from a large NHS (National Health Services, UK) hearing aid clinic. The factors evaluated were hearing impairment, age, gender, hearing aid type, mould and clinical comments. Design: The research includes literature survey for factors associated with tinnitus masker, and performs the analysis of audiology data using statistical and data mining techniques. Setting: This research uses a large audiology data but it also faced the problem of limited data for tinnitus. Participants: It uses 1,316 records for tinnitus and other diagnoses, and 10,437 records of clinical comments from a hearing aid clinic. Primary and secondary outcome measures: The research is looking for variables associated with tinnitus masker, and in future, these variables can be combined into a single model to develop a decision support system to predict about tinnitus masker for a patient. Results: The results demonstrated that tinnitus maskers are more likely to be fit to individuals with milder forms of hearing loss, and the factors age, gender, type of hearing aid and mould were all found significantly associated with tinnitus masker. In particular, those patients having Age<=55 years were more likely to wear a tinnitus masker, as well as those with milder forms of hearing loss. ITE (in the ear) hearing aids were also found associated with tinnitus masker. A feedback on the results of association of mould with tinnitus masker from a professional audiologist of a large NHS (National Health Services, UK) was also taken to better understand them. The results were obtained with different accuracy for different techniques. For example, the chi-squared test results were obtained with 95% accuracy, for Support and Confidence only those results were retained which had more than 1% Support and 80% Confidence. Conclusions: The variables audiograms, age, gender, hearing aid type and mould were found associated with the choice of tinnitus masker in the literature and by using statistical and data mining techniques. The further work in this research would lead to the development of a decision support system for tinnitus masker with an explanation that how that decision was obtained
    corecore