587 research outputs found
Using Linguistic Features to Estimate Suicide Probability of Chinese Microblog Users
If people with high risk of suicide can be identified through social media
like microblog, it is possible to implement an active intervention system to
save their lives. Based on this motivation, the current study administered the
Suicide Probability Scale(SPS) to 1041 weibo users at Sina Weibo, which is a
leading microblog service provider in China. Two NLP (Natural Language
Processing) methods, the Chinese edition of Linguistic Inquiry and Word Count
(LIWC) lexicon and Latent Dirichlet Allocation (LDA), are used to extract
linguistic features from the Sina Weibo data. We trained predicting models by
machine learning algorithm based on these two types of features, to estimate
suicide probability based on linguistic features. The experiment results
indicate that LDA can find topics that relate to suicide probability, and
improve the performance of prediction. Our study adds value in prediction of
suicidal probability of social network users with their behaviors
Timeline Follow-Back Versus Global Self-Reports of Tobacco Smoking: A Comparison of Findings With Non-Daily Smokers
Methods assessing non-daily smoking are of concern because biochemical measures can not verify self-reports beyond 7 days. This study compares two self-reported smoking measures for non-daily smokers. A total of 389 college students, (48% female, 96% white, mean age of 19) smoking between 1 and 29 days out of the past 30, completed computer assessments in three cohorts with the order of administration of the measures counterbalanced. Values from the two measures were highly correlated. Comparisons of Timeline Follow-Back (TLFB) with the global questions for the total sample of non-daily smokers yielded statistically significant differences (p\u3c.001), albeit small, between measures with the TLFB resulting on average in 2.38 more total cigarettes smoked out of the past 30 days, 0.46 less smoking days, and 0.21 more cigarettes smoked per day. Analyses by level of smoking showed that the discordance between the measures differed by frequency of smoking. Global questions of days smoked resulted in frequent reporting in multiples of five days, suggesting digit bias. Overall the two measures of smoking were highly correlated and equally effective for identifying any smoking in a 30-day period among non-daily smokers
Structure of a dimeric photosystem II complex from a cyanobacterium acclimated to far-red light
Photosystem II (PSII) is the water-splitting enzyme central to oxygenic photosynthesis. To drive water oxidation, light is harvested by accessory pigments, mostly chlorophyll (Chl) a molecules, which absorb visible light (400–700 nm). Some cyanobacteria facultatively acclimate to shaded environments by altering their photosynthetic machinery to additionally absorb far-red light (FRL, 700–800 nm), a process termed far-red light photoacclimation or FaRLiP. During far-red light photoacclimation, FRL-PSII is assembled with FRL-specific isoforms of the subunits PsbA, PsbB, PsbC, PsbD, and PsbH, and some Chl-binding sites contain Chls d or f instead of the usual Chl a. The structure of an apo-FRL-PSII monomer lacking the FRL-specific PsbH subunit has previously been determined, but visualization of the dimeric complex has remained elusive. Here, we report the cryo-EM structure of a dimeric FRL–PSII complex. The site assignments for Chls d and f are consistent with those assigned in the previous apo-FRL-PSII monomeric structure. All sites that bind Chl d or Chl f at high occupancy exhibit a FRL-specific interaction of the formyl moiety of the Chl d or Chl f with the protein environment, which in some cases involves a phenylalanine sidechain. The structure retains the FRL-specific PsbH2 subunit, which appears to alter the energetic landscape of FRL-PSII, redirecting energy transfer from the phycobiliprotein complex to a Chl f molecule bound by PsbB2 that acts as a bridge for energy transfer to the electron transfer chain. Collectively, these observations extend our previous understanding of the structure-function relationship that allows PSII to function using lower energy FRL
Building data warehouses in the era of big data: an approach for scalable and flexible big data warehouses
During the last few years, the concept of Big Data Warehousing gained significant attention from the scientific community, highlighting the need to make design changes to the traditional Data Warehouse (DW) due to its limitations, in order to achieve new characteristics relevant in Big Data contexts (e.g., scalability on commodity hardware, real-time performance, and flexible storage). The state-of-the-art in Big Data Warehousing reflects the young age of the concept, as well as ambiguity and the lack of common approaches to build Big Data Warehouses (BDWs). Consequently, an approach to design and implement these complex systems is of major relevance to business analytics researchers and practitioners. In this tutorial, the design and implementation of BDWs is targeted, in order to present a general approach that researchers and practitioners can follow in their Big Data Warehousing projects, exploring several demonstration cases focusing on system design and data modelling examples in areas like smart cities, retail, finance, manufacturing, among others
A meta-analysis of state-of-the-art electoral prediction from Twitter data
Electoral prediction from Twitter data is an appealing research topic. It
seems relatively straightforward and the prevailing view is overly optimistic.
This is problematic because while simple approaches are assumed to be good
enough, core problems are not addressed. Thus, this paper aims to (1) provide a
balanced and critical review of the state of the art; (2) cast light on the
presume predictive power of Twitter data; and (3) depict a roadmap to push
forward the field. Hence, a scheme to characterize Twitter prediction methods
is proposed. It covers every aspect from data collection to performance
evaluation, through data processing and vote inference. Using that scheme,
prior research is analyzed and organized to explain the main approaches taken
up to date but also their weaknesses. This is the first meta-analysis of the
whole body of research regarding electoral prediction from Twitter data. It
reveals that its presumed predictive power regarding electoral prediction has
been rather exaggerated: although social media may provide a glimpse on
electoral outcomes current research does not provide strong evidence to support
it can replace traditional polls. Finally, future lines of research along with
a set of requirements they must fulfill are provided.Comment: 19 pages, 3 table
- …