1,332 research outputs found

    The Dilemma and Path Innovation of Education in Ethnic Minority Areas

    Get PDF
    Due to historical culture, natural geography, economic foundation and other reasons, the development of ethnic education still faces some special difficulties and outstanding problems, and the overall development level is relatively low. Education in some ethnic minority areas faces the real predicament of imbalanced educational resources, great shackles in traditional ideas, and difficulty in promoting education informatization. In order to accelerate the development of education in ethnic minority areas and achieve long-term peace and stability in the country, it is necessary to establish universal and special policies and promote long-term education mechanisms, and innovate the education path in ethnic minority areas in order to solve these dilemmas

    STSyn: Speeding Up Local SGD with Straggler-Tolerant Synchronization

    Full text link
    Synchronous local stochastic gradient descent (local SGD) suffers from some workers being idle and random delays due to slow and straggling workers, as it waits for the workers to complete the same amount of local updates. In this paper, to mitigate stragglers and improve communication efficiency, a novel local SGD strategy, named STSyn, is developed. The key point is to wait for the KK fastest workers, while keeping all the workers computing continually at each synchronization round, and making full use of any effective (completed) local update of each worker regardless of stragglers. An analysis of the average wall-clock time, average number of local updates and average number of uploading workers per round is provided to gauge the performance of STSyn. The convergence of STSyn is also rigorously established even when the objective function is nonconvex. Experimental results show the superiority of the proposed STSyn against state-of-the-art schemes through utilization of the straggler-tolerant technique and additional effective local updates at each worker, and the influence of system parameters is studied. By waiting for faster workers and allowing heterogeneous synchronization with different numbers of local updates across workers, STSyn provides substantial improvements both in time and communication efficiency.Comment: 12 pages, 10 figures, submitted for transaction publicatio

    ABS: Adaptive Bounded Staleness Converges Faster and Communicates Less

    Full text link
    Wall-clock convergence time and communication rounds are critical performance metrics in distributed learning with parameter-server setting. While synchronous methods converge fast but are not robust to stragglers; and asynchronous ones can reduce the wall-clock time per round but suffers from degraded convergence rate due to the staleness of gradients, it is natural to combine the two methods to achieve a balance. In this work, we develop a novel asynchronous strategy that leverages the advantages of both synchronous methods and asynchronous ones, named adaptive bounded staleness (ABS). The key enablers of ABS are two-fold. First, the number of workers that the PS waits for per round for gradient aggregation is adaptively selected to strike a straggling-staleness balance. Second, the workers with relatively high staleness are required to start a new round of computation to alleviate the negative effect of staleness. Simulation results are provided to demonstrate the superiority of ABS over state-of-the-art schemes in terms of wall-clock time and communication rounds

    The Association of Parent-Child Communication With Internet Addiction in Left-Behind Children in China: A Cross-Sectional Study

    Get PDF
    Objective: Internet addiction has emerged as a growing concern worldwide. This study aimed to compare the prevalence of Internet addiction between left-behind children (LBC) and non-left-behind children (non-LBC), and explore the role of paternal and maternal parent-child communication on LBC. Methods: We conducted a cross-sectional survey in rural areas in Anhui, China. The complete data were available from 699 LBC and 740 non-LBC. Multivariable logistic regression was used to examine 1) whether LBC were more likely to develop Internet addiction, and 2) the association between parent-child communication and Internet addiction among LBC. Results: LBC had a higher likelihood to report Internet addiction when compared to non-LBC (OR = 2.03, 95%CI = 1.43–2.88, p \u3c 0.001). Among LBC, parent-child communication (both mother-child and father-child) was protective factor for children’s Internet addiction. The role of mother-child communication played well among male LBC. Conclusions: The lack of parental supervision may lead to Internet addiction. It is highly recommended for migrant parents to improve the quality of communication with their children. Also, gender-matching effects should be considered in the relationship between children’s behavior and parental factors

    DRAG: Divergence-based Adaptive Aggregation in Federated learning on Non-IID Data

    Full text link
    Local stochastic gradient descent (SGD) is a fundamental approach in achieving communication efficiency in Federated Learning (FL) by allowing individual workers to perform local updates. However, the presence of heterogeneous data distributions across working nodes causes each worker to update its local model towards a local optimum, leading to the phenomenon known as ``client-drift" and resulting in slowed convergence. To address this issue, previous works have explored methods that either introduce communication overhead or suffer from unsteady performance. In this work, we introduce a novel metric called ``degree of divergence," quantifying the angle between the local gradient and the global reference direction. Leveraging this metric, we propose the divergence-based adaptive aggregation (DRAG) algorithm, which dynamically ``drags" the received local updates toward the reference direction in each round without requiring extra communication overhead. Furthermore, we establish a rigorous convergence analysis for DRAG, proving its ability to achieve a sublinear convergence rate. Compelling experimental results are presented to illustrate DRAG's superior performance compared to state-of-the-art algorithms in effectively managing the client-drift phenomenon. Additionally, DRAG exhibits remarkable resilience against certain Byzantine attacks. By securely sharing a small sample of the client's data with the FL server, DRAG effectively counters these attacks, as demonstrated through comprehensive experiments
    • …
    corecore