2,906 research outputs found

    Natural language processing

    Get PDF
    Beginning with the basic issues of NLP, this chapter aims to chart the major research activities in this area since the last ARIST Chapter in 1996 (Haas, 1996), including: (i) natural language text processing systems - text summarization, information extraction, information retrieval, etc., including domain-specific applications; (ii) natural language interfaces; (iii) NLP in the context of www and digital libraries ; and (iv) evaluation of NLP systems

    ANALYSIS OF LEXICAL COHESION

    Get PDF
    AbstractThis research aims to find out types of Lexical Cohesion Devices (LCD encountered in research background of students’ theses and how often these LCD were used in theses submitted to English Language Education Study Program, Languages and Arts Education Department, Teacher Training and Education Faculty, Tanjungpura University. This research was a descriptive study. The corpus of this research consisted of ten research backgrounds of undergraduate thesis submitted to English Language Education Study Program. This research focused on reiteration. The data were sorted from the corpus by taking the vignettes that contained LCD. Of 7410 words corpus, there were 101 LCD encountered. They are 69 repetitions, 25 synonyms, and 7 hyponyms. The most frequently used LCD was repetition while the least frequently used was hyponymy. On the other hand, antonym and metonymy were not found. Besides, the research also shows that some students applied various LCD in their thesis while some others failed to use them.Keywords: Thesis Writing, Lexical Cohesion, Lexical Cohesion Device

    Sentiment analysis in the stock market based on Twitter data

    Get PDF
    In this dissertation, we discuss how Twitter can help detecting public sentiment towards companies listed in the stock market, in particular listed in the S&P 500 index (S&P 500). The collection of data is done through a web scrapper that collects tweets from Twitter, using advanced search features based on queries related to the companies under scrutiny. The content of tweets are classified as positive, neutral or negative sentiments and the outcome is then compared against stock market prices. To do so, it is proposed and implemented a framework with different Sentiment Analysis (SA) models and Machine Learning (ML) techniques. Also, to establish which models are more appropriate in detecting and classifying sentiments, a series of visual representations were created to evaluate and compare results. As a conclusion, the results obtained show that an increase in the volume of tweets leads to oscillations in both stock price and trading volume. Furthermore, the data analysis performed in relation to some companies under scope shows that the use of moving averages of sentiment scores makes the analysis clearer and more insightful, which is particular useful when measuring the strength or weakness of the price of a stock. In the end, it can be perceived as a momentum indicator for the stock market.Nesta dissertação, é analisada a forma como a plataforma Twitter pode ajudar a detectar sentimento público relativamente a empresas cotadas em bolsa, com foco em empresas que fazem parte do indíce americano S&P 500. A obtenção de dados é feita através de um web scrapper, que recolhe tweets através de funções de pesquisa avançada, baseada em queries associadas às empresas em análise. O conteúdo dos tweets são classificados como positivos, neutros ou negativos, sendo os resultados comparados de seguida com os preços das ações. Nesse sentido, é proposta um arquitectura de trabalho, com a respetiva implementação, que inclui vários modelos de análise de sentimento e técnicas de Machine Learning. Por outro lado, de modo a estabelecer quais são os modelos mais adequados para detectar e classificar sentimentos, são criados várias representações visuais para avaliar e comparar resultados. Como conclusão, os resultados obtidos mostram que um aumento do número de tweets conduz a oscilações, quer no preço, quer na quantidade de ações transacionadas. Além disso, a análise de dados levada a cabo relativamente a algumas empresas em estudo, mostra que a utilização de médias móveis de resultados de sentimento torna a leitura da análise mais clara e evidente, o que é bastante útil para medir a força ou fraqueza do preço de determinada ação. Acima de tudo, tal poderá ser percecionado como um indicador de momento para o mercado de capitais

    Creation and extension of ontologies for describing communications in the context of organizations

    Get PDF
    Thesis submitted to Faculdade de Ciências e Tecnologia of the Universidade Nova de Lisboa, in partial fulfillment of the requirements for the degree of Master in Computer ScienceThe use of ontologies is nowadays a sufficiently mature and solid field of work to be considered an efficient alternative in knowledge representation. With the crescent growth of the Semantic Web, it is expectable that this alternative tends to emerge even more in the near future. In the context of a collaboration established between FCT-UNL and the R&D department of a national software company, a new solution entitled ECC – Enterprise Communications Center was developed. This application provides a solution to manage the communications that enter, leave or are made within an organization, and includes intelligent classification of communications and conceptual search techniques in a communications repository. As specificity may be the key to obtain acceptable results with these processes, the use of ontologies becomes crucial to represent the existing knowledge about the specific domain of an organization. This work allowed us to guarantee a core set of ontologies that have the power of expressing the general context of the communications made in an organization, and of a methodology based upon a series of concrete steps that provides an effective capability of extending the ontologies to any business domain. By applying these steps, the minimization of the conceptualization and setup effort in new organizations and business domains is guaranteed. The adequacy of the core set of ontologies chosen and of the methodology specified is demonstrated in this thesis by its effective application to a real case-study, which allowed us to work with the different types of sources considered in the methodology and the activities that support its construction and evolution

    E-mail Filtering System for Nigerian Spam

    Get PDF
    This project shows about the project details in developing the E-Mail Filtering System specifically in filtering the Nigerian Spam. The main elements in this report consist of introduction, literature review, methodology and result and discussion. The project is developed by focusing on research activities, findings analysis and developing product. This project is developed based onthe advancement ofInformation Technology (IT) system today which is recently growing rapidly. Recent growth in the use of email for communication andthe corresponding growth in the volume of email received have made automatic processing of email desirable. Present day solutions to stop spam work by analyzing headers and message text or classifying the mail based on history. This report gives anintroduction to machine learning methods for spam filtering especially for Nigerian Spam. Anoverview of this mail system will fall back on SPAM filters that use "Naive Bayesian Filtering" which is a probabilistic approach to estimate the degree of SPAM

    Second language inner voice and identity

    Full text link
    This study investigates the phenomena of second language (L2, hereafter) inner voice for three Japanese-American English bilinguals who had long-term exposure to the L2 in naturalistic contexts, that is, by living and/or working or studying in the U.S. American English learners of L2 Japanese were included in the study as well, although only one of them had naturalistic exposure, the other having traveled to Japan in addition to being married to a Japanese national. Data for the study reveals how and when L2 inner voice is utilized, how it appears to develop, how it leads to shifts in identity toward the L2 languaculture, and how and when this takes place. Moreover, the study distinguishes the functions of L2 inner voice from those of L2 inner speech, although the two were found to co-exist at times, functioning interchangeably. Furthermore, the emergence of the L2 inner voice appears to be dependent on the prior development of L2 inner speech. Overall, the main function of L2 inner voice proves to be a bridging of language and cultural gaps between the L1 and L2 languaculture

    Nodalida 2005 - proceedings of the 15th NODALIDA conference

    Get PDF

    Dynamic managerial capabilities and competitive advantage : an empirical analysis of managers from the finance and insurance and real estate sectors

    Get PDF
    This thesis empirically investigated dynamic managerial capabilities (DMCs), which are the capacities that managers use to create, extend, and modify resources. The research objectives involved identifying, classifying, and assessing DMCs in generating competitive advantage using resource-based theory (RBT). The overall research aim was to build theory in a critical yet underdeveloped area of the literature. A multi-case study using a phenomenological approach was conducted with managers from five small-to-medium sized enterprises from the finance and insurance and real estate sectors. The managers were interviewed, and described episodes when they reconfigured resources during periods of rapid change (such as the recent financial crisis and recession) in order to compete. A survey questionnaire was also used in which respondents ranked DMCs, and discussed the joint uses of them, including which capabilities were used in developing and operating others. The results of the research showed that managers used specific transformational DMCs in periods of rapid change in order to generate advantage. The DMCs are learning-based (LBDMC) and innovation-based capabilities (IBDMC) and involve participative leadership (PL). They are mutually interdependent and reinforcing, impact on ordinary capabilities, and are evolutionarily fit. They exhibited commonalities, yet are considered idiosyncratic in detail. The results are relevant to the field of strategic management in terms of theory development and practical applicability. The academic contribution exploits a gap in the extant literature, and the research shows how DMCs can be developed, used, and maintained in practice

    Parâmetros relevantes para caracterização de rios montanhosos : revisão

    Get PDF
    Mountain rivers are situated in a large portion of the terrestrial surface, especially in headwaters regions, and have been used for various purposes such as recreation, sporting activities, water resources and hydroelectric power generation. However, hydrogeomorphic characteristics of mountain rivers are not fully understood. In this context, the present paper aimed to identify relevant parameters for characterizing rivers in these environments based on bibliographical review. It was identified which parameters have been used and how they have been used to characterize mountain rivers in distinct classifications. The most cited parameters were channel gradient, relation between river width and depth, entrenchment ratio, discharge, sediment transport and grain-size distribution. Also, the current situation related to researches in fluvial geomorphology in mountain rivers in Brazil was evaluated, and the strong need of field survey as basis for the best understanding of mountain fluvial dynamics and characterization was verified.Rios montanhosos estão presentes em uma grande porção dos territórios do planeta, especialmente nas regiões de cabeceiras, e vêm sendo utilizados para diversos fins, tais como recreação e atividades desportivas, mananciais de água e geração de energia hidrelétrica. Entretanto, suas características hidrogeomorfológicas ainda não são plenamente conhecidas. Neste contexto, o presente trabalho abordou os parâmetros relevantes necessários para caracterização de rios nestes ambientes a partir de revisão bibliográfica, em que se buscou avaliar o modo como os rios estavam sendo caracterizados e quais parâmetros hidrogeomorfológicos estavam sendo analisados em diferentes classificações. Os parâmetros mais comumente utilizados na caracterização de rios montanhosos são a declividade do canal, a relação entre largura e profundidade do rio, o grau de entrincheiramento do canal, a vazão, a carga de sedimentos e a granulometria dos sedimentos. Ainda, avaliou-se o cenário brasileiro no que tange a pesquisa em hidrogeomorfologia fluvial em rios montanhosos, constatando-se a necessidade de realizar mais atividades em campo para melhor entendimento da dinâmica fluvial montanhosa e caracterização fluvial
    • …
    corecore