Search CORE

176 research outputs found

MT-MAG: Accurate and interpretable machine learning for complete or partial taxonomic assignments of metagenome-assembled genomes

Author: Wanxin Li
Publication venue: 'University of Waterloo'
Publication date: 12/05/2022
Field of study

We propose MT-MAG, a novel machine learning-based software tool for the complete or partial hierarchically-structured taxonomic classification of metagenome-assembled genomes (MAGs). MT-MAG is capable of classifying large and diverse metagenomic datasets: a total of 245.68 Gbp in the training sets, and 9.6 Gbp in the test sets analyzed in this study. MT-MAG is, to the best of our knowledge, the first machine learning method for taxonomic assignment of metagenomic data that offers a “partial classification” option, whereby a classification at a higher taxonomic level is provided for MAGs that cannot be classified to the Species level. MT-MAG outputs complete or partial classification paths, and interpretable numerical classification confidences of its classifications, at all taxonomic ranks. To assess the performance of MT-MAG, we define a “weighted classification accuracy,” with a weighting scheme reflecting the fact that partial classifications at different ranks are not equally informative. For the two benchmarking datasets analyzed (genomes from human gut microbiome species, and bacterial and archaeal genomes assembled from cow rumen metagenomic sequences), MT-MAG achieves an average of 80.13% in weighted classification accuracy. At the Species level, MT-MAG outperforms DeepMicrobes, the only other comparable software tool, by an average of 35.75% in weighted classification accuracy. In addition, MT-MAG is able to completely classify an average of 67.7% of the sequences at the Species level, compared with DeepMicrobes which only classifies 47.45%. Moreover, MT-MAG provides additional information for sequences that it could not classify at the Species level, resulting in the partial or complete classification of 95.15%, of the genomes in the datasets analyzed. Lastly, unlike other taxonomic assignment tools (e.g., GDTB-Tk), MT-MAG is an alignment-free and genetic marker-free tool, able to provide additional bioinformatics analysis to confirm existing or tentative taxonomic assignments

University of Waterloo's Institutional Repository

First Language Loss and Maintenance in Adolescents and Young Adults from Immigrant Backgrounds

Author: Li Wanxin
Publication venue: Scholars Commons @ Laurier
Publication date: 01/01/2022
Field of study

Abstract Language attrition is a documented phenomenon that occurs when individuals progressively lose their first language (Schmid et al., 2007). This is particularly common among individuals who relocate to a country that speaks a foreign language that differs from their first language, as the societal language eventually becomes their dominant language. Deterioration and loss of the first language (L1) may result in consequences such as loss of ethnic and cultural identity, leading to the loss of a link to one’s country of origin (Cho & Krashen, 1998). Thus, the present study examined factors that may contribute to L1 attrition. The present study aimed to assess individuals’ L1 skills in relation to their cultural affiliation with their heritage and/or Canadian backgrounds after the participants emigrated from their home country to a foreign country (Canada). This study also looked at whether participants’ L1 skills are preserved if they are residing in a multigenerational household. Participants were instructed to complete a series of surveys that measured their receptive vocabulary size in English, levels of acculturation to the host culture, and language dominance. Participants were also scheduled for a one-on-one Zoom session to assess their verbal fluency in their L1 and English. Group comparisons based on age of arrival and being born in Canada showed differences in self-reports of L1 and L2 skills, enculturation and acculturation. Group differences were also found for groups based on whether or not participants attended school only in Canada or also in another country. Also group differences were found based on differences in language dominance as measured by the bilingual dominance scale. However, no effect was found for participants who lived in a multi-generational home and those who did not. This exploratory study may provide insight into the field of language development and literacy by showing a comprehensive relationship between L1 loss and acculturation

Wilfrid Laurier University

Electric field enhancement of pool boiling of dielectric fluids on pillar-structured surfaces: A lattice Boltzmann study

Author: Li Qing
Li Wanxin
Tang Shi
Yu Yue
Publication venue: 'AIP Publishing'
Publication date: 24/08/2022
Field of study

In this paper, by using a phase-change lattice Boltzmann (LB) model coupled with an electric field model, we numerically investigate the performance and enhancement mechanism of pool boiling of dielectric fluids on pillar-structured surfaces under an electric field. The numerical investigation reveals that applying an electric field causes both positive and negative influences on the pool boiling of dielectric fluids on pillar-structured surfaces. It is found that, under the action of an electric field, the electric force prevents the bubbles nucleated in the channels from crossing the edges of the pillar tops. On the one hand, such an effect results in the bubble coalescence in the channels and blocks the paths of liquid supply for the channels, which leads to the deterioration of pool boiling in the medium-superheat regime. On the other hand, it prevents the coalescence between the bubbles in the channels and those on the pillar tops, which suppresses the formation of a continuous vapor film and therefore delays the occurrence of boiling crisis. Meanwhile, the electric force can promote the departure of the bubbles on the pillar tops. Accordingly, the critical heat flux (CHF) can be improved. Based on the revealed mechanism, wettability-modified regions are applied to the pillar tops for further enhancing the boiling heat transfer. It is shown that the boiling performance on pillar-structured surfaces can be enhanced synergistically with the CHF being increased by imposing an electric field and the maximum heat transfer coefficient being improved by applying mixed wettability to the pillar-structured surfaces.Comment: 29 pages, 16 figure

arXiv.org e-Print Archive

A Hierarchical and Location-aware Consensus Protocol for IoT-Blockchain Applications

Author: Guo Hao
Li Wanxin
Nejad Mark
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/05/2023
Field of study

Blockchain-based IoT systems can manage IoT devices and achieve a high level of data integrity, security, and provenance. However, incorporating existing consensus protocols in many IoT systems limits scalability and leads to high computational cost and consensus latency. In addition, location-centric characteristics of many IoT applications paired with limited storage and computing power of IoT devices bring about more limitations, primarily due to the location-agnostic designs in blockchains. We propose a hierarchical and location-aware consensus protocol (LH-Raft) for IoT-blockchain applications inspired by the original Raft protocol to address these limitations. The proposed LH-Raft protocol forms local consensus candidate groups based on nodes' reputation and distance to elect the leaders in each sub-layer blockchain. It utilizes a threshold signature scheme to reach global consensus and the local and global log replication to maintain consistency for blockchain transactions. To evaluate the performance of LH-Raft, we first conduct an extensive numerical analysis based on the proposed reputation mechanism and the candidate group formation model. We then compare the performance of LH-Raft against the classical Raft protocol from both theoretical and experimental perspectives. We evaluate the proposed threshold signature scheme using Hyperledger Ursa cryptography library to measure various consensus nodes' signing and verification time. Experimental results show that the proposed LH-Raft protocol is scalable for large IoT applications and significantly reduces the communication cost, consensus latency, and agreement time for consensus processing.Comment: Published in IEEE Transactions on Network and Service Management ( Volume: 19, Issue: 3, September 2022). arXiv admin note: text overlap with arXiv:2305.1696

arXiv.org e-Print Archive

Traffic Prediction using Artificial Intelligence: Review of Recent Advances and Emerging Opportunities

Author: Li Wanxin
Meese Collin
Nejad Mark
Shaygan Maryam
Zhao Xiaolong
Publication venue: 'Elsevier BV'
Publication date: 04/06/2023
Field of study

Traffic prediction plays a crucial role in alleviating traffic congestion which represents a critical problem globally, resulting in negative consequences such as lost hours of additional travel time and increased fuel consumption. Integrating emerging technologies into transportation systems provides opportunities for improving traffic prediction significantly and brings about new research problems. In order to lay the foundation for understanding the open research challenges in traffic prediction, this survey aims to provide a comprehensive overview of traffic prediction methodologies. Specifically, we focus on the recent advances and emerging research opportunities in Artificial Intelligence (AI)-based traffic prediction methods, due to their recent success and potential in traffic prediction, with an emphasis on multivariate traffic time series modeling. We first provide a list and explanation of the various data types and resources used in the literature. Next, the essential data preprocessing methods within the traffic prediction context are categorized, and the prediction methods and applications are subsequently summarized. Lastly, we present primary research challenges in traffic prediction and discuss some directions for future research.Comment: Published in Transportation Research Part C: Emerging Technologies (TR_C), Volume 145, 202

arXiv.org e-Print Archive

B^2SFL: A Bi-level Blockchained Architecture for Secure Federated Learning-based Traffic Prediction

Author: Guo Hao
Li Wanxin
Meese Collin
Nejad Mark
Shen Chien-Chung
Publication venue
Publication date: 23/10/2023
Field of study

Federated Learning (FL) is a privacy-preserving machine learning (ML) technology that enables collaborative training and learning of a global ML model based on aggregating distributed local model updates. However, security and privacy guarantees could be compromised due to malicious participants and the centralized FL server. This article proposed a bi-level blockchained architecture for secure federated learning-based traffic prediction. The bottom and top layer blockchain store the local model and global aggregated parameters accordingly, and the distributed homomorphic-encrypted federated averaging (DHFA) scheme addresses the secure computation problems. We propose the partial private key distribution protocol and a partially homomorphic encryption/decryption scheme to achieve the distributed privacy-preserving federated averaging model. We conduct extensive experiments to measure the running time of DHFA operations, quantify the read and write performance of the blockchain network, and elucidate the impacts of varying regional group sizes and model complexities on the resulting prediction accuracy for the online traffic flow prediction task. The results indicate that the proposed system can facilitate secure and decentralized federated learning for real-world traffic prediction tasks.Comment: Paper accepted for publication in IEEE Transactions on Services Computing (TSC

arXiv.org e-Print Archive

BFRT: Blockchained Federated Learning for Real-time Traffic Flow Prediction

Author: Asif Syed Ali
Chen Hang
Li Wanxin
Meese Collin
Nejad Mark
Shen Chien-Chung
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/05/2023
Field of study

Accurate real-time traffic flow prediction can be leveraged to relieve traffic congestion and associated negative impacts. The existing centralized deep learning methodologies have demonstrated high prediction accuracy, but suffer from privacy concerns due to the sensitive nature of transportation data. Moreover, the emerging literature on traffic prediction by distributed learning approaches, including federated learning, primarily focuses on offline learning. This paper proposes BFRT, a blockchained federated learning architecture for online traffic flow prediction using real-time data and edge computing. The proposed approach provides privacy for the underlying data, while enabling decentralized model training in real-time at the Internet of Vehicles edge. We federate GRU and LSTM models and conduct extensive experiments with dynamically collected arterial traffic data shards. We prototype the proposed permissioned blockchain network on Hyperledger Fabric and perform extensive tests using virtual machines to simulate the edge nodes. Experimental results outperform the centralized models, highlighting the feasibility of our approach for facilitating privacy-preserving and decentralized real-time traffic flow prediction.Comment: Published in 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid

arXiv.org e-Print Archive