Search CORE

10,884 research outputs found

Personalized Federated Instruction Tuning via Neural Architecture Search

Author: Chen Mingsong
Feng Junxian
Hu Ming
Weng Jiawen
Zhang Pengyu
Zhou Yingbo
Publication venue
Publication date: 26/02/2024
Field of study

Federated Instruction Tuning (FIT) has shown the ability to achieve collaborative model instruction tuning among massive data owners without sharing private data. However, it still faces two key challenges, i.e., data and resource heterogeneity. Due to the varying data distribution and preferences among data owners, FIT cannot adapt to the personalized data of individual owners. Moreover, clients with superior computational abilities are constrained since they need to maintain the same fine-tuning architecture as the weaker clients. To address these issues, we propose a novel Personalized Federated Instruction Tuning (PerFIT) framework based on architecture search. Specifically, PerFIT allows each client to search for a personalized architecture by expanding the trainable parameter space of the global model followed by pruning the parameters to the original state. This procedure allows personalized instruction fine-tuning within expanded parameter spaces, concurrently preserving the same number of trainable parameters. Furthermore, to release the abilities of heterogeneous computational resources and enhance the performance of personalization on local data, we exploit personalized parameter-wise aggregation. The evaluation with multiple LLMs non-IID scenarios demonstrates that compared to the state-of-the-art FIT methods, our approach can achieve up to a 23% decrease in perplexity

arXiv.org e-Print Archive

Engineering Crowdsourced Stream Processing Systems

Author: Carlos Castillo
Crp Henri Tudor
Ioanna Lykourentzou
Muhammad Imran
Yannick Naudet
Publication venue
Publication date: 04/08/2014
Field of study

A crowdsourced stream processing system (CSP) is a system that incorporates crowdsourced tasks in the processing of a data stream. This can be seen as enabling crowdsourcing work to be applied on a sample of large-scale data at high speed, or equivalently, enabling stream processing to employ human intelligence. It also leads to a substantial expansion of the capabilities of data processing systems. Engineering a CSP system requires the combination of human and machine computation elements. From a general systems theory perspective, this means taking into account inherited as well as emerging properties from both these elements. In this paper, we position CSP systems within a broader taxonomy, outline a series of design principles and evaluation metrics, present an extensible framework for their design, and describe several design patterns. We showcase the capabilities of CSP systems by performing a case study that applies our proposed framework to the design and analysis of a real system (AIDR) that classifies social media messages during time-critical crisis events. Results show that compared to a pure stream processing system, AIDR can achieve a higher data classification accuracy, while compared to a pure crowdsourcing solution, the system makes better use of human workers by requiring much less manual work effort

arXiv.org e-Print Archive

CiteSeerX

Combatting the war against machines : an innovative hands-on approach to coding

Author: Chetty Jacqui
Publication venue
Publication date: 01/01/2017
Field of study

Abstract: The 21st century is an era of technological advances that has surpassed previous decades. This is largely due to the level of innovation in the fields of artificial intelligence, robotics and automation. However, learners are often reluctant to choose computer programming (coding) as a subject due to it’s perceived difficulty. Nevertheless, it is also well known that learners that are introduced to computer programming at a young age become the computer science university graduates of tomorrow

University of Johannesburg Institutional Repository

Distributed Learning with Sparse Communications by Identification

Author: Amini Massih-Reza
Grishchenko Dmitry
Iutzeler Franck
Malick Jérôme
Publication venue
Publication date: 25/06/2020
Field of study

In distributed optimization for large-scale learning, a major performance limitation comes from the communications between the different entities. When computations are performed by workers on local data while a coordinator machine coordinates their updates to minimize a global loss, we present an asynchronous optimization algorithm that efficiently reduces the communications between the coordinator and workers. This reduction comes from a random sparsification of the local updates. We show that this algorithm converges linearly in the strongly convex case and also identifies optimal strongly sparse solutions. We further exploit this identification to propose an automatic dimension reduction, aptly sparsifying all exchanges between coordinator and workers.Comment: v2 is a significant improvement over v1 (titled "Asynchronous Distributed Learning with Sparse Communications and Identification") with new algorithms, results, and discussion

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

Shortest Path Computation with No Information Leakage

Author: Gasarch W. I.
Goldberg A. V.
Hart P.
Kellaris G.
Köhler E.
Mokbel M. F.
Sion R.
Williams P.
Publication venue
Publication date: 01/01/2012
Field of study

Shortest path computation is one of the most common queries in location-based services (LBSs). Although particularly useful, such queries raise serious privacy concerns. Exposing to a (potentially untrusted) LBS the client's position and her destination may reveal personal information, such as social habits, health condition, shopping preferences, lifestyle choices, etc. The only existing method for privacy-preserving shortest path computation follows the obfuscation paradigm; it prevents the LBS from inferring the source and destination of the query with a probability higher than a threshold. This implies, however, that the LBS still deduces some information (albeit not exact) about the client's location and her destination. In this paper we aim at strong privacy, where the adversary learns nothing about the shortest path query. We achieve this via established private information retrieval techniques, which we treat as black-box building blocks. Experiments on real, large-scale road networks assess the practicality of our schemes.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Far apart, yet close together: Cooperative learning in online education

Author: Ivone Francisca M.
Jacobs George M.
Renandya Willy A.
Publication venue: 'LPPM Unsyiah'
Publication date: 03/09/2020
Field of study

Online education can play a crucial role in increasing access to educational opportunity and in promoting lifelong learning. The Covid-19 pandemic has done even more to raise awareness of the importance of online education. The pandemic has been a Category 5 disruptor of education systems. This article was written to help teachers at all levels of education facilitate cooperation among their students as a key element of online education. While many teachers believe in the benefits of student-student cooperation, and theory and research support this view, many teachers worry that distance learning is already difficult enough without adding the complications of cooperative learning, no matter how beneficial it might be. The article begins by discussing some of the obstacles teachers may encounter as they seek to integrate cooperative learning as part of online education. The main part of the article presents nine lesson plans for language education via cooperative learning in online education settings. While the content of the lessons focuses on language learning, the lessons can be employed in a wide variety of content areas. The article concludes with general suggestions on overcoming the previously mentioned obstacles

Studies in English Language and Education