3,361 research outputs found

    The best of both worlds: highlighting the synergies of combining manual and automatic knowledge organization methods to improve information search and discovery.

    Get PDF
    Research suggests organizations across all sectors waste a significant amount of time looking for information and often fail to leverage the information they have. In response, many organizations have deployed some form of enterprise search to improve the 'findability' of information. Debates persist as to whether thesauri and manual indexing or automated machine learning techniques should be used to enhance discovery of information. In addition, the extent to which a knowledge organization system (KOS) enhances discoveries or indeed blinds us to new ones remains a moot point. The oil and gas industry was used as a case study using a representative organization. Drawing on prior research, a theoretical model is presented which aims to overcome the shortcomings of each approach. This synergistic model could help to re-conceptualize the 'manual' versus 'automatic' debate in many enterprises, accommodating a broader range of information needs. This may enable enterprises to develop more effective information and knowledge management strategies and ease the tension between what arc often perceived as mutually exclusive competing approaches. Certain aspects of the theoretical model may be transferable to other industries, which is an area for further research

    Laruelle Qua Stiegler: On Non-Marxism and the Transindividual

    Get PDF
    Alexander R. Galloway and Jason R. LaRiviére’s article “Compression in Philosophy” seeks to pose François Laruelle’s engagement with metaphysics against Bernard Stiegler’s epistemological rendering of idealism. Identifying Laruelle as the theorist of genericity, through which mankind and the world are identified through an index of “opacity,” the authors argue that Laruelle does away with all deleterious philosophical “data.” Laruelle’s generic immanence is posed against Stiegler’s process of retention and discretization, as Galloway and LaRiviére argue that Stiegler’s philosophy seeks to reveal an enchanted natural world through the development of noesis. By further developing Laruelle and Stiegler’s Marxian projects, I seek to demonstrate the relation between Stiegler's artefaction and “compression” while, simultaneously, I also seek to create further bricolage between Laruelle and Stiegler. I also further elaborate on their distinct engagement(s) with Marx, offering the mold of synthesis as an alternative to compression when considering Stiegler’s work on transindividuation. In turn, this paper seeks to survey some of the contemporary theorists drawing from Stiegler (Yuk Hui, Al-exander Wilson and Daniel Ross) and Laruelle (Anne-Françoise Schmidt, Gilles Grelet, Ray Brassier, Katerina Kolozova, John Ó Maoilearca and Jonathan Fardy) to examine political discourse regarding the posthuman and non-human, with a particular interest in Kolozova’s unified theory of standard philosophy and Capital

    N-gram Based Text Categorization Method for Improved Data Mining

    Get PDF
    Though naïve Bayes text classifiers are widely used because of its simplicity and effectiveness, the techniques for improving performances of these classifiers have been rarely studied. Naïve Bayes classifiers which are widely used for text classification in machine learning are based on the conditional probability of features belonging to a class, which the features are selected by feature selection methods. However, its performance is often imperfect because it does not model text well, and by inappropriate feature selection and some disadvantages of the Naive Bayes itself. Sentiment Classification or Text Classification is the act of taking a set of labeled text documents, learning a correlation between a document’s contents and its corresponding labels and then predicting the labels of a set of unlabeled test documents as best as possible. Text Classification is also sometimes called Text Categorization. Text classification has many applications in natural language processing tasks such as E-mail filtering, Intrusion detection systems, news filtering, prediction of user preferences, and organization of documents. The Naive Bayes model makes strong assumptions about the data: it assumes that words in a document are independent. This assumption is clearly violated in natural language text: there are various types of dependences between words induced by the syntactic, semantic, pragmatic and conversational structure of a text. Also, the particular form of the probabilistic model makes assumptions about the distribution of words in documents that are violated in practice. We address this problem and show that it can be solved by modeling text data differently using N-Grams. N-gram Based Text Categorization is a simple method based on statistical information about the usage of sequences of words. We conducted an experiment to demonstrate that our simple modification is able to improve the performance of Naive Bayes for text classification significantly. Keywords: Data Mining, Text Classification, Text Categorization, Naïve Bayes, N-Grams

    Users’ Continuance Participation in the Online Peer-to-peer Healthcare Community: A Text Mining Approach

    Get PDF
    The online peer-to-peer healthcare communities are known as the platform where dispersed groups of patients and their families query information, seek and offer support, and connect with others. The success of such communities relies on users’ ongoing involvement to generate benefits for both individuals and the communities. This study attempts to understand users’ continuance participation in online peer-to-peer healthcare community by classifying users’ goals of participation based on the user-generated text contents. We proposed a rule-based classification framework to categorize users’ goals of posting contents into four categories: information seeking, experience sharing, information sharing, and social interaction. We formalize and test the relationship between users’ continuance participation and all four posting goals, and find that the first three goals have significant impact on users’ continuance participation. Our findings can help researchers and practitioners better understand users’ behavior in the online peer-to-peer healthcare community

    Complex adaptive systems based data integration : theory and applications

    Get PDF
    Data Definition Languages (DDLs) have been created and used to represent data in programming languages and in database dictionaries. This representation includes descriptions in the form of data fields and relations in the form of a hierarchy, with the common exception of relational databases where relations are flat. Network computing created an environment that enables relatively easy and inexpensive exchange of data. What followed was the creation of new DDLs claiming better support for automatic data integration. It is uncertain from the literature if any real progress has been made toward achieving an ideal state or limit condition of automatic data integration. This research asserts that difficulties in accomplishing integration are indicative of socio-cultural systems in general and are caused by some measurable attributes common in DDLs. This research’s main contributions are: (1) a theory of data integration requirements to fully support automatic data integration from autonomous heterogeneous data sources; (2) the identification of measurable related abstract attributes (Variety, Tension, and Entropy); (3) the development of tools to measure them. The research uses a multi-theoretic lens to define and articulate these attributes and their measurements. The proposed theory is founded on the Law of Requisite Variety, Information Theory, Complex Adaptive Systems (CAS) theory, Sowa’s Meaning Preservation framework and Zipf distributions of words and meanings. Using the theory, the attributes, and their measures, this research proposes a framework for objectively evaluating the suitability of any data definition language with respect to degrees of automatic data integration. This research uses thirteen data structures constructed with various DDLs from the 1960\u27s to date. No DDL examined (and therefore no DDL similar to those examined) is designed to satisfy the law of requisite variety. No DDL examined is designed to support CAS evolutionary processes that could result in fully automated integration of heterogeneous data sources. There is no significant difference in measures of Variety, Tension, and Entropy among DDLs investigated in this research. A direction to overcome the common limitations discovered in this research is suggested and tested by proposing GlossoMote, a theoretical mathematically sound description language that satisfies the data integration theory requirements. The DDL, named GlossoMote, is not merely a new syntax, it is a drastic departure from existing DDL constructs. The feasibility of the approach is demonstrated with a small scale experiment and evaluated using the proposed assessment framework and other means. The promising results require additional research to evaluate GlossoMote’s approach commercial use potential

    Understanding Collaborative Sensemaking for System Design — An Investigation of Musicians\u27 Practice

    Get PDF
    There is surprisingly little written in information science and technology literature about the design of tools used to support the collaboration of creators. Understanding collaborative sensemaking through the use of language has been traditionally applied to non-work domains, but this method is also well-suited for informing hypotheses about the design collaborative systems. The presence of ubiquitous, mobile technology, and development of multi-user virtual spaces invites investigation of design which is based on naturalistic, real world, creative group behaviors, including the collaborative work of musicians. This thesis is considering the co-construction of new (musical) knowledge by small groups. Co-construction of new knowledge is critical to the definition of an information system because it emphasizes coordination and resource sharing among group members (versus individual members independently doing their own tasks and only coming together to collate their contributions as a final product). This work situates the locus of creativity on the process itself, rather than on the output (the musical result) or the individuals (members of the band). This thesis describes a way to apply quantitative observations to inform qualitative assessment of the characteristics of collaborative sensemaking in groups. Conversational data were obtained from nine face-to-face collaborative composing sessions, involving three separate bands producing 18 hours of recorded interactions. Topical characteristics of the discussion, namely objects, plans, properties and performance; as well as emergent patterns of generative, evaluative, revision, and management conversational acts within the group were seen as indicative of knowledge construction. The findings report the use of collaborative pathways: iterative cycles of generation, evaluation and revision of temporary solutions used to move the collaboration forward. In addition, bracketing of temporary solutions served to help collaborators reuse content and offload attentional resources. Ambiguity in language, evaluation criteria, goal formation, and group awareness meant that existing knowledge representations were insufficient in making sense of incoming data and necessitated reformulating those representations. Further, strategic use of affective language was found to be instrumental in bridging knowledge gaps. Based on these findings, features of a collaborative system are proposed to help in facilitating sensemaking routines at various stages of a creative task. This research contributes to the theoretical understanding of collaborative sensemaking during non-work, creative activities in order to inform the design of systems for supporting these activities. By studying an environment which forms a potential microcosm of virtual interaction between groups, it provides a framework for understanding and automating collaborative discussion content in terms of the features of dialogue

    Theory and Applications for Advanced Text Mining

    Get PDF
    Due to the growth of computer technologies and web technologies, we can easily collect and store large amounts of text data. We can believe that the data include useful knowledge. Text mining techniques have been studied aggressively in order to extract the knowledge from the data since late 1990s. Even if many important techniques have been developed, the text mining research field continues to expand for the needs arising from various application fields. This book is composed of 9 chapters introducing advanced text mining techniques. They are various techniques from relation extraction to under or less resourced language. I believe that this book will give new knowledge in the text mining field and help many readers open their new research fields

    Management of medical records in support of primary health care services of Diepsloot clinics in Gauteng Province of South Africa

    Get PDF
    Text in English with abstracts in English, Afrikaans and isiZulu and keywords in EnglishThe study investigated the management of medical records in the Primary Health Care services (PHCs) of Diepsloot. The study investigated the regulatory framework, records infrastructure, records security, records management staff skills and the filing system. A qualitative design guided by the interpretive paradigm was used to guide the case study. Interviews, focus groups, and observations generated data from 50 participants. The study revealed that the regulatory instruments used to manage records lack implementation and compliance. There was a lack of security measures, a shortage of records management infrastructure and inconsistency in the filing system. There is a low level of skill in the records management staff. The study recommended the implementation of a regulatory policy that will guide and ensure effective governance of records in PHCs. Records should be secure from misuse by unscrupulous individuals. PHC records need to be managed by experienced professionals. The filing system should be easily accessible.Die studie het ondersoek ingestel na die bestuur van mediese rekords in die Primêre Gesondheidsorgdienste (PHC's) van Diepsloot. Die studie het ondersoek ingestel na die regulatoriese raamwerk, rekord van infrastruktuur, rekord sekuriteit, vaardighede vir rekordbestuur en die liasseerstelsel. 'n Kwalitatiewe ontwerp gelei deur die interpretatiewe paradigma is gebruik om die gevallestudie te lei. Onderhoude, fokusgroepe en waarnemings het gegewens van 50 deelnemers gegenereer. Die regulatoriese instrumente wat gebruik word om rekords te bestuur, het geen implementering en nakoming nie. Die studie het aan die lig gebring dat daar 'n gebrek aan veiligheidsmaatreëls was, 'n tekort aan infrastruktuur vir rekordbestuur en teenstrydigheid in die liasseringstelsel. Die personeel in rekordbestuur het 'n lae vlak van vaardigheid. Die studie het die implementering van 'n regulatoriese beleid aanbeveel wat die doeltreffende bestuur van rekords in PHC's sal lei en verseker. Rekords moet beskerm word teen misbruik deur gewetenlose individue. PHC-rekords moet deur ervare professionele persone uitgevoer word. Die liasseerstelsel moet maklik toeganklik wees.Lolu cwaningo luphenywe ngokuphathwa kwamarekhodi ezokwelashwa emnyangweni Wezokunakekelwa kwempilo okuyisisekelo (i-PHCs) eDiepsloot. Ucwaningo luphenywe ngohlaka lokulawula, ingqalasizinda yamarekhodi, ukuphepha kwamarekhodi, amakhono okuphathwa kwamarekhodi nohlelo lokufayila. Umklamo olungaqanjwa uqondiswa yi-paradigm yokutolika wasetshenziselwa ukuqondisa ucwaningo lwesigameko. Izingxoxo, amaqembu okugxila kanye nokubukwa kukhiqize idatha evela kubahlanganyeli abangu 50. Izinsizakusebenza zokulawula ezisetshenziselwa ukuphatha amarekhodi zingenakho ukusebenza nokuhambisana. Ucwaningo luveze ukuthi bekukhona ukuntuleka kwezindlela zokuphepha, ukushoda kwengqalasizinda yokuphathwa kwamarekhodi kanye nokungahambelani ohlelweni lokugcwalisa. Kunezinga eliphansi lekhono kubasebenzi bokuphathwa kwamarekhodi. Ucwaningo lincome ukusetshenziswa kwenqubomgomo yokulawula ezohola futhi iqinisekise ukuphathwa kwamarekhodi kuma-PHCs ngendlela efanele. Amarekhodi kufanele avikeleke ekusetshenzisweni kabi ngabantu abangathembekile. Amarekhodi we-PHC adinga ukuqhutshwa ngochwepheshe abanolwazi. Uhlelo lokufayila kufanele lutholakale kalula.Information ScienceM. Inf

    The seventeen theoretical constructs of information searching and information retrieval

    Full text link
    In this article, we identify, compare, and contrast theoretical constructs for the fields of information searching and information retrieval to emphasize the uniqueness of and synergy between the fields. Theoretical constructs are the foundational elements that underpin a field's core theories, models, assumptions, methodologies, and evaluation metrics. We provide a framework to compare and contrast the theoretical constructs in the fields of information searching and information retrieval using intellectual perspective and theoretical orientation . The intellectual perspectives are information searching , information retrieval , and cross-cutting ; and the theoretical orientations are information , people , and technology . Using this framework, we identify 17 significant constructs in these fields contrasting the differences and comparing the similarities. We discuss the impact of the interplay among these constructs for moving research forward within both fields. Although there is tension between the fields due to contradictory constructs, an examination shows a trend toward convergence. We discuss the implications for future research within the information searching and information retrieval fields.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/77538/1/21358_ftp.pd
    • …
    corecore