22,396 research outputs found

    Corpus Statistics for Measuring Business Process Similarity

    Get PDF
    In a rapidly changing environment, organizations must adapt their business processes continuously. While numerous methods enable enterprises to conceptualize and analyze their organizational structure, the task of business process modeling remains complex and time-consuming. However, by reusing and adapting existing process models, enterprises can reduce the task’s complexity while improving the quality of results. To facilitate the identification of adaptable processes, several techniques of business process similarity (BPS) have been proposed in recent years. Although most approaches produce sound results in controlled evaluations, this paper argues that their applicability is limited when analyzing real-world processes, which do not fully comply with notational labeling specifications. Consequently, we aim to enhance existing BPS techniques by using corpus statistics to account for the explanatory power of words within labels of process models. Results from our evaluation suggest that corpus statistics can improve BPS computations and can positively influence the quality of practical implications

    The Role of Geographic Proximity And Industrial Structure In Metropolitan Area Business Cycles

    Get PDF
    Measurement and prediction of aggregate economic fluctuations at the region, state, and metropolitan area level is a major challenge. As data quality and analytical techniques have improved, the analysis of coincident economic cycle indicators (CEI) has progressed from national to regional to state levels. This paper continues the trend of geographic disaggregation by constructing and analyzing CEI at the MSA level. The theoretical advantage of MSA level indexes is that they reflect labor market areas. Given lack of quarterly economic time series at the MSA level, we construct a new variable, the EPI (export price index). The EPI is an index number constructed to measure changes in the prices of goods produced by major industries located in metropolitan areas. Using non-agricultural employment and the EPI as MSA-specific variables, we are able to estimate following a Stock/Watson type single factor models. We find that, for larger states, with multiple MSAs, there is substantial variation in the amplitude and timing of cycles across MSAs. Further tests group MSAs within states by applying cluster analysis to the state series for the MSAs within a state. The groupings are interesting for two reasons. First, they confirm the differences observed within states. Secondly, and perhaps most important, the groupings of cyclically similar MSAs are not always based on geographic proximity as might be expected. It appears that industrial similarity of the MSA economies is also important for cyclical performance

    Embedding Words and Senses Together via Joint Knowledge-Enhanced Training

    Get PDF
    Word embeddings are widely used in Nat-ural Language Processing, mainly due totheir success in capturing semantic infor-mation from massive corpora. However,their creation process does not allow thedifferent meanings of a word to be auto-matically separated, as it conflates theminto a single vector. We address this issueby proposing a new model which learnsword and sense embeddings jointly. Ourmodel exploits large corpora and knowl-edge from semantic networks in order toproduce a unified vector space of wordand sense embeddings. We evaluate themain features of our approach both qual-itatively and quantitatively in a variety oftasks, highlighting the advantages of theproposed method in comparison to state-of-the-art word- and sense-based models

    Econometrics meets sentiment : an overview of methodology and applications

    Get PDF
    The advent of massive amounts of textual, audio, and visual data has spurred the development of econometric methodology to transform qualitative sentiment data into quantitative sentiment variables, and to use those variables in an econometric analysis of the relationships between sentiment and other variables. We survey this emerging research field and refer to it as sentometrics, which is a portmanteau of sentiment and econometrics. We provide a synthesis of the relevant methodological approaches, illustrate with empirical results, and discuss useful software

    EntiTables: Smart Assistance for Entity-Focused Tables

    Full text link
    Tables are among the most powerful and practical tools for organizing and working with data. Our motivation is to equip spreadsheet programs with smart assistance capabilities. We concentrate on one particular family of tables, namely, tables with an entity focus. We introduce and focus on two specific tasks: populating rows with additional instances (entities) and populating columns with new headings. We develop generative probabilistic models for both tasks. For estimating the components of these models, we consider a knowledge base as well as a large table corpus. Our experimental evaluation simulates the various stages of the user entering content into an actual table. A detailed analysis of the results shows that the models' components are complimentary and that our methods outperform existing approaches from the literature.Comment: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '17), 201

    Digital Stylometry: Linking Profiles Across Social Networks

    Get PDF
    There is an ever growing number of users with accounts on multiple social media and networking sites. Consequently, there is increasing interest in matching user accounts and profiles across different social networks in order to create aggregate profiles of users. In this paper, we present models for Digital Stylometry, which is a method for matching users through stylometry inspired techniques. We experimented with linguistic, temporal, and combined temporal-linguistic models for matching user accounts, using standard and novel techniques. Using publicly available data, our best model, a combined temporal-linguistic one, was able to correctly match the accounts of 31% of 5,612 distinct users across Twitter and Facebook.Comment: SocInfo'15, Beijing, China. In proceedings of the 7th International Conference on Social Informatics (SocInfo 2015). Beijing, Chin
    • …
    corecore