Search CORE

10,703 research outputs found

Convergence Theory of Learning Over-parameterized ResNet: A Full Characterization

Author: Chen Wei
Liu Tie-Yan
Yi Mingyang
Yu Da
Zhang Huishuai
Publication venue
Publication date: 12/07/2019
Field of study

ResNet structure has achieved great empirical success since its debut. Recent work established the convergence of learning over-parameterized ResNet with a scaling factor

\tau=1/L

on the residual branch where

L

is the network depth. However, it is not clear how learning ResNet behaves for other values of

\tau

. In this paper, we fully characterize the convergence theory of gradient descent for learning over-parameterized ResNet with different values of

\tau

. Specifically, with hiding logarithmic factor and constant coefficients, we show that for

\tau\le 1/\sqrt{L}

gradient descent is guaranteed to converge to the global minma, and especially when

\tau\le 1/L

the convergence is irrelevant of the network depth. Conversely, we show that for

\tau>L^{-\frac{1}{2}+c}

, the forward output grows at least with rate

L^c

in expectation and then the learning fails because of gradient explosion for large

L

. This means the bound

\tau\le 1/\sqrt{L}

is sharp for learning ResNet with arbitrary depth. To the best of our knowledge, this is the first work that studies learning ResNet with full range of

\tau

.Comment: 31 page

arXiv.org e-Print Archive

Automatic hypocenter determination with the IPFx method for the 2018 Hualien earthquake sequence

Author: Chen Da-Yi
Yamada Masumi
Publication venue: Springer Nature
Publication date: 16/06/2022
Field of study

The extended integrated particle filter (IPFx) method is an automatic source determination algorithm designed for the Japanese earthquake early warning (EEW) system. The method improved earthquake source determination during active seismicity by incorporating the smart phase association scheme. We applied this method to the 2018 Hualien earthquake sequence and evaluated its performance by comparing it to the manual catalog. We used 1-month continuous waveforms from February 2018 at 170 stations. Owing to the higher noise level, we improved the phase association algorithm to avoid noise contamination. Out of 127 earthquakes with a seismic intensity ≥ 4, 105 were successfully detected in one month, of which 103 had good accuracy with a location error of < 30 km. The detectability of earthquakes decreased immediately following large events. The IPFx method showed good performance in detecting earthquakes with seismic intensity ≥ 4 during the 2018 Hualien earthquake sequence. The method was also applied to the 1-day continuous data on April 18, 2021, and detected 14 earthquakes with a magnitude ~ 2 that were not on the manual catalog. Currently, the Central Weather Bureau in Taiwan uses the effective epicenter method to locate earthquakes for the EEW system. However, source determination for offshore events is difficult as most of the stations are on land. We expect the IPFx method to provide better location estimates for offshore earthquakes and during the period of active seismicity. It also provides an earlier warning as it sends the first message when three stations are triggered. This new method can potentially improve the speed and accuracy of the Taiwanese EEW system

Kyoto University Research Information Repository

The Structuration of Task-oriented Communication in Innovative Virtual Teams

Author: Brown Sue
Chen Yi-Da
Publication venue: AIS Electronic Library (AISeL)
Publication date: 14/12/2012
Field of study

With advanced communication technologies, business managers can globally recruit talented members to form virtual teams and collaborate on innovative projects. While virtual teams enjoy superiority in their composition of talents, they also face more collaborative issues resulting from the diversity of members’ backgrounds and the limitations of communication technologies. Formal task interventions have been suggested to mitigate the severity of these collaborative issues in virtual teams. In this study, we aim to investigate the underlying mechanisms through which task interventions compensate the limitations of communication technologies and facilitate the exchange of individuals’ perspectives. By adopting the lens of structuration theory, we hypothesize that task interventions establish or modify the structural properties of the team-task environment, which in turn shape virtual teams’ communication patterns. This study can provide a better understanding of how virtual teams learn to coordinate their task work more effectively by initiating task interventions

AIS Electronic Library (AISeL)

A hybrid representation based simile component extraction

Author: Cai Yi
Chen Junying
Li Qing
Ren Da
Tao Xiaohui
Zhang Pengfei
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/03/2020
Field of study

Simile, a special type of metaphor, can help people to express their ideas more clearly. Simile component extraction is to extract tenors and vehicles from sentences. This task has a realistic significance since it is useful for building cognitive knowledge base. With the development of deep neural networks, researchers begin to apply neural models to component extraction. Simile components should be in cross-domain. According to our observations, words in cross-domain always have different concepts. Thus, concept is important when identifying whether two words are simile components or not. However, existing models do not integrate concept into their models. It is difficult for these models to identify the concept of a word. What’s more, corpus about simile component extraction is limited. There are a number of rare words or unseen words, and the representations of these words are always not proper enough. Exiting models can hardly extract simile components accurately when there are low-frequency words in sentences. To solve these problems, we propose a hybrid representation-based component extraction (HRCE) model. Each word in HRCE is represented in three different levels: word level, concept level and character level. Concept representations (representations in concept level) can help HRCE to identify the words in cross-domain more accurately. Moreover, with the help of character representations (representations in character levels), HRCE can represent the meaning of a word more properly since words are consisted of characters and these characters can partly represent the meaning of words. We conduct experiments to compare the performance between HRCE and existing models. The experiment results show that HRCE significantly outperforms current models

University of Southern Queensland ePrints

Electrostatic Repulsion of Positively Charged Vesicles and Negatively Charged Objects

Author: Aranda-Espinoza Helim
Chen Yi
Dan Nily
Lubensky TC
Nelson Philip
Ramos Laurence
Weitz DA
Publication venue
Publication date: 01/01/1999
Field of study

A positively charged, mixed bilayer vesicle in the presence of negatively charged surfaces (for example, colloidal particles) can spontaneously partition into an adhesion zone of definite area, and another zone that repels additional negative objects. Although the membrane itself has nonnegative charge in the repulsive zone, negative counterions on the interior of the vesicle spontaneously aggregate there, and present a net negative charge to the exterior. Beyond the fundamental result that oppositely charged objects can repel, our mechanism helps explain recent experiments on surfactant vesicles.Comment: Latex using epsfig and afterpage; pdf available at http://www.physics.upenn.edu/~nelson/Mss/repel.pd

arXiv.org e-Print Archive

CiteSeerX

ScholarlyCommons@Penn