12 research outputs found
Sentiment Analysis Across Multiple African Languages: A Current Benchmark
Sentiment analysis is a fundamental and valuable task in NLP. However, due to
limitations in data and technological availability, research into sentiment
analysis of African languages has been fragmented and lacking. With the recent
release of the AfriSenti-SemEval Shared Task 12, hosted as a part of The 17th
International Workshop on Semantic Evaluation, an annotated sentiment analysis
of 14 African languages was made available. We benchmarked and compared current
state-of-art transformer models across 12 languages and compared the
performance of training one-model-per-language versus
single-model-all-languages. We also evaluated the performance of standard
multilingual models and their ability to learn and transfer cross-lingual
representation from non-African to African languages. Our results show that
despite work in low resource modeling, more data still produces better models
on a per-language basis. Models explicitly developed for African languages
outperform other models on all tasks. Additionally, no one-model-fits-all
solution exists for a per-language evaluation of the models evaluated.
Moreover, for some languages with a smaller sample size, a larger multilingual
model may perform better than a dedicated per-language model for sentiment
classification.Comment: Accepted to be published as part of SIAIA @ AAAI 202
Evaluating Novel Mask-RCNN Architectures for Ear Mask Segmentation
The human ear is generally universal, collectible, distinct, and permanent.
Ear-based biometric recognition is a niche and recent approach that is being
explored. For any ear-based biometric algorithm to perform well, ear detection
and segmentation need to be accurately performed. While significant work has
been done in existing literature for bounding boxes, a lack of approaches
output a segmentation mask for ears. This paper trains and compares three newer
models to the state-of-the-art MaskRCNN (ResNet 101 +FPN) model across four
different datasets. The Average Precision (AP) scores reported show that the
newer models outperform the state-of-the-art but no one model performs the best
over multiple datasets.Comment: Accepted into ICCBS 202
Brightness data in Rayleigh for OI 630.0 nm (red line) and OI 557.7 (green line) through the night of August 21-22, 2017 obtained using the HiT&MIS from Carbondale, IL
This file contains brightness data in Rayleigh for OI 630.0 nm (red line), OI 557.7 (green line) and NeI 630.5 nm (cloud indicator) through the night of August 21-22, 2017 obtained using the HiT&MIS from Carbondale, IL.
This is a python .npz file that contain numpy arrays with uncertainities, so you would need the uncertainties as well as the numpy modules for python.
On python it can be opened up as follows:
import numpy as np
import uncertainties.unumpy as unp
dload=np.load("<file location>/eclipse_night_final.npz")#brightnesses are saved as numpy array files
int630=dload["int630"] #630 nm red line brightness
int557=dload["int557"] #557.7 nm green line brightness
int6305=dload["int6305"]# Ne I 630.5 nm brightness for cloud activity
time=dload["time"] #capture times (Local times, Central Daylight time)
# uncertainities can be accessed as
b630value=unp.nominal_values(int630) # 630 nm brightnesses values
e630=unp.std_devs(int630)# 630 nm uncertainities.
please email me at [email protected] if you have any questions
Hey, Siri! Why Are You Biased against Women? (Student Abstract)
The intersection of pervasive technology and verbal communication has resulted in the creation of Automatic Speech Recognition Systems (ASRs), which automate the conversion of spontaneous speech into texts. ASR enables human-computer interactions through speech and is rapidly integrated into our daily lives. However, the research studies on current ASR technologies have reported unfulfilled social inclusivity and accentuated biases and stereotypes towards minorities. In this work, we provide a review of examples and evidence to demonstrate preexisting sexist behavior in ASR systems through a systematic review of research literature over the past five years. For each article, we also provide the ASR technology used, highlight specific instances of reported bias, discuss the impact of this bias on the female community, and suggest possible methods of mitigation. We believe this paper will provide insights into the harm that unchecked AI-powered technologies can have on a community by contributing to the growing body of research on this topic and underscoring the need for technological inclusivity for all demographics, especially women
Evaluating Factors Influencing COVID-19 Outcomes across Countries Using Decision Trees (Student Abstract)
While humanity prepares for a post-pandemic world and a return to normality through worldwide vaccination campaigns, each country experienced different levels of impact based on natural, political, regulatory, and socio-economic factors. To prepare for a possible future with COVID-19 and similar outbreaks, it is imperative to understand how each of these factors impacted spread and mortality. We train and tune two decision tree regression models to predict COVID-related cases and deaths using a multitude of features. Our findings suggest that, at the country-level, GDP per capita and comorbidity mortality rate are best predictors for both outcomes. Furthermore, latitude and smoking prevalence are also significantly related to COVID-related spread and mortality
Multi-spectral and multi-instrument observation of TIDs following the Total Solar Eclipse of August 21, 2017
Wave‐like structures in the upper atmospheric nightglow brightness were observed on the night of 22 August 2017, approximately 8 hr following a total solar eclipse. These wave‐like perturbations are signatures of atmospheric gravity waves and associated traveling ionospheric disturbances (TIDs). Observations were made in the red line (OI 630.0 nm) and the green line (OI 557.7 nm) from Carbondale, IL, at 2–10 UTC on 22 August 2017. Based on wavelet analyses, the dominant time period in both the red and green lines was around 1.5 hr. Differential total electron content data obtained from Global Positioning System total electron content measurements at Carbondale, IL, and ionospheric parameters from digisonde measurements at Idaho National Laboratory and Millstone Hill showed a similar dominant time period. Based on these observations and their correlation with geomagnetic indices, the TIDs appear to be associated with geomagnetic disturbances. In addition, by modeling the ionosphere‐thermosphere system's response to the eclipse, it was seen that while the eclipse enhanced the O/N2 ratio and electron density (Ne) at 250 km during our observation period, it did not affect the TIDs. Vertical (7 m/s) and meridional (616 m/s) phase velocities of the TIDs were estimated using cross‐correlation analysis between red and green line brightness profiles and spectral analysis of the differential total electron content keogram, respectively. This provides a method to characterize the three‐dimensional wave properties of TIDs
Laboratory Study of the Cameron Bands and UV Doublet in the Middle Ultraviolet 180-300 nm by Electron Impact upon CO2 with Application to Mars
peer reviewedWe have observed electron impact fluorescence from CO2 to excite the Cameron bands (CBs), CO (a 3Π → X 1Σ+; 180-280 nm), the first-negative group (1NG) bands, CO+ (B 2Σ+ → X 2Σ+; 180-320 nm), the fourth-positive group (4PG) bands, CO (A 1Π → X 1Σ+; 111-280 nm), and the UV doublet, CO2+ ( B 2Σ u + → X 2Π g ; 288.3 and 289.6 nm) in the ultraviolet (UV). This wavelength range matches the spectral region of past and present spacecraft equipped to observe UV dayglow and aurora emissions from the thermospheres (100-300 km) of Mars and Venus. Our large vacuum system apparatus is able to measure the emission cross sections of the strongest optically forbidden UV transitions found in planetary spectra. Based on our cross-sectional measurements, previous CB emission cross-sectional errors exceed a factor of 3. The UV doublet lifetime is perturbed through B 2Σu + − A 2Π u spin-orbit coupling. Forward modeling codes of the Mars dayglow have not been accurate in the mid-UV due to systematic errors in these two emission cross sections. We furnish absolute emission cross sections for several band systems over electron energies 20-100 eV for CO2. We present a CB lifetime, which together with emission cross sections, furnish a set of fundamental physical constants for electron transport codes such as AURIC (Atmospheric Ultraviolet Radiance Integrated Code). AURIC and Trans-Mars are used in the analysis of UV spectra from the Martian dayglow and aurora
A single pseudouridine on rRNA regulates ribosome structure and function in the mammalian parasite Trypanosoma brucei
Abstract Trypanosomes are protozoan parasites that cycle between insect and mammalian hosts and are the causative agent of sleeping sickness. Here, we describe the changes of pseudouridine (Ψ) modification on rRNA in the two life stages of the parasite using four different genome-wide approaches. CRISPR-Cas9 knock-outs of all four snoRNAs guiding Ψ on helix 69 (H69) of the large rRNA subunit were lethal. A single knock-out of a snoRNA guiding Ψ530 on H69 altered the composition of the 80S monosome. These changes specifically affected the translation of only a subset of proteins. This study correlates a single site Ψ modification with changes in ribosomal protein stoichiometry, supported by a high-resolution cryo-EM structure. We propose that alteration in rRNA modifications could generate ribosomes preferentially translating state-beneficial proteins
CRC-113 gene expression signature for predicting prognosis in patients with colorectal cancer
Colorectal cancer (CRC) is the third leading cause of global cancer mortality. Recent studies have proposed several gene signatures to predict CRC prognosis, but none of those have proven reliable for predicting prognosis in clinical practice yet due to poor reproducibility and molecular heterogeneity. Here, we have established a prognostic signature of 113 probe sets (CRC-113) that include potential biomarkers and reflect the biological and clinical characteristics. Robustness and accuracy were significantly validated in external data sets from 19 centers in five countries. In multivariate analysis, CRC-113 gene signature showed a stronger prognostic value for survival and disease recurrence in CRC patients than current clinicopathological risk factors and molecular alterations. We also demonstrated that the CRC-113 gene signature reflected both genetic and epigenetic molecular heterogeneity in CRC patients. Furthermore, incorporation of the CRC-113 gene signature into a clinical context and molecular markers further refined the selection of the CRC patients who might benefit from postoperative chemotherapy. Conclusively, CRC-113 gene signature provides new possibilities for improving prognostic models and personalized therapeutic strategies.11Nsciescopu