Search CORE

1,423 research outputs found

Tackling Uncertainties and Errors in the Satellite Monitoring of Forest Cover Change

Author: Song Kuan
Publication venue
Publication date: 01/01/2010
Field of study

This study aims at improving the reliability of automatic forest change detection. Forest change detection is of vital importance for understanding global land cover as well as the carbon cycle. Remote sensing and machine learning have been widely adopted for such studies with increasing degrees of success. However, contemporary global studies still suffer from lower-than-satisfactory accuracies and robustness problems whose causes were largely unknown. Global geographical observations are complex, as a result of the hidden interweaving geographical processes. Is it possible that some geographical complexities were not expected in contemporary machine learning? Could they cause uncertainties and errors when contemporary machine learning theories are applied for remote sensing? This dissertation adopts the philosophy of error elimination. We start by explaining the mathematical origins of possible geographic uncertainties and errors in chapter two. Uncertainties are unavoidable but might be mitigated. Errors are hidden but might be found and corrected. Then in chapter three, experiments are specifically designed to assess whether or not the contemporary machine learning theories can handle these geographic uncertainties and errors. In chapter four, we identify an unreported systemic error source: the proportion distribution of classes in the training set. A subsequent Bayesian Optimal solution is designed to combine Support Vector Machine and Maximum Likelihood. Finally, in chapter five, we demonstrate how this type of error is widespread not just in classification algorithms, but also embedded in the conceptual definition of geographic classes before the classification. In chapter six, the sources of errors and uncertainties and their solutions are summarized, with theoretical implications for future studies. The most important finding is that, how we design a classification largely pre-determines what we eventually get out of it. This applies for many contemporary popular classifiers including various types of neural nets, decision tree, and support vector machine. This is a cause of the so-called overfitting problem in contemporary machine learning. Therefore, we propose that the emphasis of classification work be shifted to the planning stage before the actual classification. Geography should not just be the analysis of collected observations, but also about the planning of observation collection. This is where geography, machine learning, and survey statistics meet

Digital Repository at the University of Maryland

The Taiwanese-American Perspective on Discrimination in English Language Teaching

Author: Song Kuan Cheng
Publication venue: USF Scholarship: a digital repository @ Gleeson Library | Geschke Center
Publication date: 21/03/2016
Field of study

This is a qualitative study examining the perspectives of five Taiwanese-American English teachers on their experiences of discrimination in the English language-teaching field of Taiwan. An extensive amount of literature has been written about the nativeness paradigm and its effect on the English language-teaching field, but the Taiwanese-American experience concerning those issues has yet to be explored. The study used Asian Critical Race Theory, Social Identity Theory and Asian American Racial Identity Theory to analyze the history of English language teaching in Taiwan, the critical studies on native and non-native English language teachers and the social issues affecting Asian Americans in Taiwan. The study found that all of the participants were aware of the notions of hiring discrimination and stereotypes against ethnically Asian English teachers, but not all participants believed that it had a negative impact on their social identity or self-identity. This study offers a voice to Taiwanese-American English teachers in hopes of encouraging a more progressive attitude towards the diversity of all English teachers in Taiwan

University of San Francisco

HAQ: Hardware-Aware Automated Quantization with Mixed Precision

Author: Han Song
Lin Ji
Lin Yujun
Liu Zhijian
Wang Kuan
Publication venue
Publication date: 06/04/2019
Field of study

Model quantization is a widely used technique to compress and accelerate deep neural network (DNN) inference. Emergent DNN hardware accelerators begin to support mixed precision (1-8 bits) to further improve the computation efficiency, which raises a great challenge to find the optimal bitwidth for each layer: it requires domain experts to explore the vast design space trading off among accuracy, latency, energy, and model size, which is both time-consuming and sub-optimal. Conventional quantization algorithm ignores the different hardware architectures and quantizes all the layers in a uniform way. In this paper, we introduce the Hardware-Aware Automated Quantization (HAQ) framework which leverages the reinforcement learning to automatically determine the quantization policy, and we take the hardware accelerator's feedback in the design loop. Rather than relying on proxy signals such as FLOPs and model size, we employ a hardware simulator to generate direct feedback signals (latency and energy) to the RL agent. Compared with conventional methods, our framework is fully automated and can specialize the quantization policy for different neural network architectures and hardware architectures. Our framework effectively reduced the latency by 1.4-1.95x and the energy consumption by 1.9x with negligible loss of accuracy compared with the fixed bitwidth (8 bits) quantization. Our framework reveals that the optimal policies on different hardware architectures (i.e., edge and cloud architectures) under different resource constraints (i.e., latency, energy and model size) are drastically different. We interpreted the implication of different quantization policies, which offer insights for both neural network architecture design and hardware architecture design.Comment: CVPR 2019. The first three authors contributed equally to this work. Project page: https://hanlab.mit.edu/projects/haq

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Two-Field Quintom Models in the w-w' Plane

Author: V. Sahni
Xinmin Zhang
Yuan-Zhong Zhang
Yun-Song Piao
Zong-Kuan Guo
Publication venue: 'American Physical Society (APS)'
Publication date: 08/08/2006
Field of study

The w-w' plane, defined by the equation of state parameter for the dark energy and its derivative with respect to the logarithm of the scale factor, is useful to the study of classifying the dynamical dark energy models. In this note, we examine the evolving behavior of the two-field quintom models with w crossing the w=-1 barrier in the w-w' plane. We find that these models can be divided into two categories, type A quintom in which w changes from >-1 to <-1 and type B quintom in which w changes from -1 as the universe expands.Comment: 5 pages, 2 figures, RevTeX, Accepted for publication as a Brief Report in Physical Review

arXiv.org e-Print Archive

Crossref

CERN Document Server

GCN-RL Circuit Designer: Transferable Transistor Sizing with Graph Neural Networks and Reinforcement Learning

Author: Han Song
Lee Hae-Seung
Shen Linxiao
Sun Nan
Wang Hanrui
Wang Kuan
Yang Jiacheng
Publication venue
Publication date: 30/04/2020
Field of study

Automatic transistor sizing is a challenging problem in circuit design due to the large design space, complex performance trade-offs, and fast technological advancements. Although there has been plenty of work on transistor sizing targeting on one circuit, limited research has been done on transferring the knowledge from one circuit to another to reduce the re-design overhead. In this paper, we present GCN-RL Circuit Designer, leveraging reinforcement learning (RL) to transfer the knowledge between different technology nodes and topologies. Moreover, inspired by the simple fact that circuit is a graph, we learn on the circuit topology representation with graph convolutional neural networks (GCN). The GCN-RL agent extracts features of the topology graph whose vertices are transistors, edges are wires. Our learning-based optimization consistently achieves the highest Figures of Merit (FoM) on four different circuits compared with conventional black-box optimization methods (Bayesian Optimization, Evolutionary Algorithms), random search, and human expert designs. Experiments on transfer learning between five technology nodes and two circuit topologies demonstrate that RL with transfer learning can achieve much higher FoMs than methods without knowledge transfer. Our transferable optimization method makes transistor sizing and design porting more effective and efficient.Comment: Accepted to the 57th Design Automation Conference (DAC 2020); 6 pages, 8 figure

arXiv.org e-Print Archive

Crossref

DSpace@MIT

A VOICE OF OUR OWN : RETHINKING THE DISABLED IN THE HISTORICAL IMAGINATION OF SINGAPORE

Author: ZHUANG KUAN SONG
Publication venue
Publication date: 14/09/2010
Field of study

Master'sMASTER OF ART

ScholarBank@NUS

Hardware-Centric AutoML for Mixed-Precision Quantization

Author: Han Song
Lin Ji
Lin Yujun
Liu Zhijian
Wang Kuan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/08/2020
Field of study

Model quantization is a widely used technique to compress and accelerate deep neural network (DNN) inference. Emergent DNN hardware accelerators begin to support mixed precision (1-8 bits) to further improve the computation efficiency, which raises a great challenge to find the optimal bitwidth for each layer: it requires domain experts to explore the vast design space trading off among accuracy, latency, energy, and model size, which is both time-consuming and sub-optimal. Conventional quantization algorithm ignores the different hardware architectures and quantizes all the layers in a uniform way. In this paper, we introduce the Hardware-Aware Automated Quantization (HAQ) framework which leverages the reinforcement learning to automatically determine the quantization policy, and we take the hardware accelerator's feedback in the design loop. Rather than relying on proxy signals such as FLOPs and model size, we employ a hardware simulator to generate direct feedback signals (latency and energy) to the RL agent. Compared with conventional methods, our framework is fully automated and can specialize the quantization policy for different neural network architectures and hardware architectures. Our framework effectively reduced the latency by 1.4-1.95x and the energy consumption by 1.9x with negligible loss of accuracy compared with the fixed bitwidth (8 bits) quantization. Our framework reveals that the optimal policies on different hardware architectures (i.e., edge and cloud architectures) under different resource constraints (i.e., latency, energy, and model size) are drastically different. We interpreted the implication of different quantization policies, which offer insights for both neural network architecture design and hardware architecture design.Comment: Journal preprint of arXiv:1811.08886 (IJCV, 2020). The first three authors contributed equally to this work. Project page: https://hanlab.mit.edu/projects/haq

arXiv.org e-Print Archive

DSpace@MIT