2,906 research outputs found
Natural language processing
Beginning with the basic issues of NLP, this chapter aims to chart the major research activities in this area since the last ARIST Chapter in 1996 (Haas, 1996), including: (i) natural language text processing systems - text summarization, information extraction, information retrieval, etc., including domain-specific applications; (ii) natural language interfaces; (iii) NLP in the context of www and digital libraries ; and (iv) evaluation of NLP systems
ANALYSIS OF LEXICAL COHESION
AbstractThis research aims to find out types of Lexical Cohesion Devices (LCD encountered in research background of students’ theses and how often these LCD were used in theses submitted to English Language Education Study Program, Languages and Arts Education Department, Teacher Training and Education Faculty, Tanjungpura University. This research was a descriptive study. The corpus of this research consisted of ten research backgrounds of undergraduate thesis submitted to English Language Education Study Program. This research focused on reiteration. The data were sorted from the corpus by taking the vignettes that contained LCD. Of 7410 words corpus, there were 101 LCD encountered. They are 69 repetitions, 25 synonyms, and 7 hyponyms. The most frequently used LCD was repetition while the least frequently used was hyponymy. On the other hand, antonym and metonymy were not found. Besides, the research also shows that some students applied various LCD in their thesis while some others failed to use them.Keywords: Thesis Writing, Lexical Cohesion, Lexical Cohesion Device
Sentiment analysis in the stock market based on Twitter data
In this dissertation, we discuss how Twitter can help detecting public sentiment towards companies
listed in the stock market, in particular listed in the S&P 500 index (S&P 500). The
collection of data is done through a web scrapper that collects tweets from Twitter, using advanced
search features based on queries related to the companies under scrutiny. The content
of tweets are classified as positive, neutral or negative sentiments and the outcome is then
compared against stock market prices. To do so, it is proposed and implemented a framework
with different Sentiment Analysis (SA) models and Machine Learning (ML) techniques. Also, to
establish which models are more appropriate in detecting and classifying sentiments, a series
of visual representations were created to evaluate and compare results.
As a conclusion, the results obtained show that an increase in the volume of tweets leads to
oscillations in both stock price and trading volume. Furthermore, the data analysis performed
in relation to some companies under scope shows that the use of moving averages of sentiment
scores makes the analysis clearer and more insightful, which is particular useful when measuring
the strength or weakness of the price of a stock. In the end, it can be perceived as a
momentum indicator for the stock market.Nesta dissertação, é analisada a forma como a plataforma Twitter pode ajudar a detectar sentimento
público relativamente a empresas cotadas em bolsa, com foco em empresas que fazem
parte do indÃce americano S&P 500. A obtenção de dados é feita através de um web scrapper, que
recolhe tweets através de funções de pesquisa avançada, baseada em queries associadas às empresas
em análise. O conteúdo dos tweets são classificados como positivos, neutros ou negativos,
sendo os resultados comparados de seguida com os preços das ações. Nesse sentido, é proposta
um arquitectura de trabalho, com a respetiva implementação, que inclui vários modelos de
análise de sentimento e técnicas de Machine Learning. Por outro lado, de modo a estabelecer
quais são os modelos mais adequados para detectar e classificar sentimentos, são criados várias
representações visuais para avaliar e comparar resultados.
Como conclusão, os resultados obtidos mostram que um aumento do número de tweets conduz
a oscilações, quer no preço, quer na quantidade de ações transacionadas. Além disso, a análise
de dados levada a cabo relativamente a algumas empresas em estudo, mostra que a utilização
de médias móveis de resultados de sentimento torna a leitura da análise mais clara e evidente,
o que é bastante útil para medir a força ou fraqueza do preço de determinada ação. Acima de
tudo, tal poderá ser percecionado como um indicador de momento para o mercado de capitais
Creation and extension of ontologies for describing communications in the context of organizations
Thesis submitted to Faculdade de Ciências e Tecnologia of the Universidade Nova de Lisboa, in partial fulfillment of the requirements for the degree of Master in Computer ScienceThe use of ontologies is nowadays a sufficiently mature and solid field of work to be considered an efficient alternative in knowledge representation. With the crescent growth of the Semantic Web, it is expectable that this alternative tends to emerge even more in the near future.
In the context of a collaboration established between FCT-UNL and the R&D department of a national software company, a new solution entitled ECC – Enterprise Communications Center was developed. This application provides a solution to manage the communications that enter, leave or are made within an organization, and includes intelligent classification of communications and conceptual search techniques in a communications repository. As specificity may be the key to obtain acceptable results with these processes, the use of ontologies becomes crucial to represent the existing knowledge about the specific domain of an organization.
This work allowed us to guarantee a core set of ontologies that have the power of expressing the general context of the communications made in an organization, and of a methodology based upon a series of concrete steps that provides an effective capability of extending the ontologies to any business domain. By applying these steps, the minimization of the conceptualization and setup effort in new organizations and business domains is guaranteed.
The adequacy of the core set of ontologies chosen and of the methodology specified is demonstrated in this thesis by its effective application to a real case-study, which allowed us to work with the different types of sources considered in the methodology and the activities that support its construction and evolution
E-mail Filtering System for Nigerian Spam
This project shows about the project details in developing the E-Mail Filtering
System specifically in filtering the Nigerian Spam. The main elements in this report
consist of introduction, literature review, methodology and result and discussion. The
project is developed by focusing on research activities, findings analysis and developing
product. This project is developed based onthe advancement ofInformation Technology
(IT) system today which is recently growing rapidly. Recent growth in the use of email
for communication andthe corresponding growth in the volume of email received have
made automatic processing of email desirable. Present day solutions to stop spam work
by analyzing headers and message text or classifying the mail based on history. This
report gives anintroduction to machine learning methods for spam filtering especially for
Nigerian Spam. Anoverview of this mail system will fall back on SPAM filters that use
"Naive Bayesian Filtering" which is a probabilistic approach to estimate the degree of
SPAM
Second language inner voice and identity
This study investigates the phenomena of second language (L2, hereafter) inner voice for three Japanese-American English bilinguals who had long-term exposure to the L2 in naturalistic contexts, that is, by living and/or working or studying in the U.S. American English learners of L2 Japanese were included in the study as well, although only one of them had naturalistic exposure, the other having traveled to Japan in addition to being married to a Japanese national. Data for the study reveals how and when L2 inner voice is utilized, how it appears to develop, how it leads to shifts in identity toward the L2 languaculture, and how and when this takes place. Moreover, the study distinguishes the functions of L2 inner voice from those of L2 inner speech, although the two were found to co-exist at times, functioning interchangeably. Furthermore, the emergence of the L2 inner voice appears to be dependent on the prior development of L2 inner speech. Overall, the main function of L2 inner voice proves to be a bridging of language and cultural gaps between the L1 and L2 languaculture
Recommended from our members
Research on autonomous English-learning strategies of American culture and language students
This study will serve as a guideline for adult second-language learners who want to strengthen the effectiveness of self-directed learning outcomes or develop their self-directed learning strategies. Through the understanding of learning factors, fossilized language elements, and their own learning strategies, learners will be more confident and motivated to pursue their personal second-language acquisition goals and aspirations. These strategies are designed to help Taiwanese second-language learners
Dynamic managerial capabilities and competitive advantage : an empirical analysis of managers from the finance and insurance and real estate sectors
This thesis empirically investigated dynamic managerial capabilities (DMCs), which are
the capacities that managers use to create, extend, and modify resources. The research
objectives involved identifying, classifying, and assessing DMCs in generating
competitive advantage using resource-based theory (RBT). The overall research aim
was to build theory in a critical yet underdeveloped area of the literature. A multi-case
study using a phenomenological approach was conducted with managers from five
small-to-medium sized enterprises from the finance and insurance and real estate
sectors. The managers were interviewed, and described episodes when they
reconfigured resources during periods of rapid change (such as the recent financial
crisis and recession) in order to compete. A survey questionnaire was also used in which
respondents ranked DMCs, and discussed the joint uses of them, including which
capabilities were used in developing and operating others. The results of the research
showed that managers used specific transformational DMCs in periods of rapid change
in order to generate advantage. The DMCs are learning-based (LBDMC) and
innovation-based capabilities (IBDMC) and involve participative leadership (PL). They
are mutually interdependent and reinforcing, impact on ordinary capabilities, and are
evolutionarily fit. They exhibited commonalities, yet are considered idiosyncratic in
detail. The results are relevant to the field of strategic management in terms of theory
development and practical applicability. The academic contribution exploits a gap in the
extant literature, and the research shows how DMCs can be developed, used, and
maintained in practice
Parâmetros relevantes para caracterização de rios montanhosos : revisão
Mountain rivers are situated in a large portion of the terrestrial surface, especially in headwaters regions, and have been used for various purposes such as recreation, sporting activities, water resources and hydroelectric power generation. However, hydrogeomorphic characteristics of mountain rivers are not fully understood. In this context, the present paper aimed to identify relevant parameters for characterizing rivers in these environments based on bibliographical review. It was identified which parameters have been used and how they have been used to characterize mountain rivers in distinct classifications. The most cited parameters were channel gradient, relation between river width and depth, entrenchment ratio, discharge, sediment transport and grain-size distribution. Also, the current situation related to researches in fluvial geomorphology in mountain rivers in Brazil was evaluated, and the strong need of field survey as basis for the best understanding of mountain fluvial dynamics and characterization was verified.Rios montanhosos estão presentes em uma grande porção dos territórios do planeta, especialmente nas regiões de cabeceiras, e vêm sendo utilizados para diversos fins, tais como recreação e atividades desportivas, mananciais de água e geração de energia hidrelétrica. Entretanto, suas caracterÃsticas hidrogeomorfológicas ainda não são plenamente conhecidas. Neste contexto, o presente trabalho abordou os parâmetros relevantes necessários para caracterização de rios nestes ambientes a partir de revisão bibliográfica, em que se buscou avaliar o modo como os rios estavam sendo caracterizados e quais parâmetros hidrogeomorfológicos estavam sendo analisados em diferentes classificações. Os parâmetros mais comumente utilizados na caracterização de rios montanhosos são a declividade do canal, a relação entre largura e profundidade do rio, o grau de entrincheiramento do canal, a vazão, a carga de sedimentos e a granulometria dos sedimentos. Ainda, avaliou-se o cenário brasileiro no que tange a pesquisa em hidrogeomorfologia fluvial em rios montanhosos, constatando-se a necessidade de realizar mais atividades em campo para melhor entendimento da dinâmica fluvial montanhosa e caracterização fluvial
- …