1,578 research outputs found
Ward's Hierarchical Clustering Method: Clustering Criterion and Agglomerative Algorithm
The Ward error sum of squares hierarchical clustering method has been very
widely used since its first description by Ward in a 1963 publication. It has
also been generalized in various ways. However there are different
interpretations in the literature and there are different implementations of
the Ward agglomerative algorithm in commonly used software systems, including
differing expressions of the agglomerative criterion. Our survey work and case
studies will be useful for all those involved in developing software for data
analysis using Ward's hierarchical clustering method.Comment: 20 pages, 21 citations, 4 figure
Recommended from our members
A study of State and Federal Supreme Court decisions involving expulsion from our public schools for deficiencies in discipline, scholarship and patriotism.
The Waters Edge Apartments: Capital Budgeting In Real Estate Development
The Water’s Edge Apartments case provides intermediate finance students with an opportunity to apply capital budgeting analysis and decision- making techniques to a real estate development situation. The objective is to give students practice in identifying and estimating project cash flows to make a decision regarding the project. Students are encouraged to include basic sensitivity analysis in their recommendation
Mumford dendrograms and discrete p-adic symmetries
In this article, we present an effective encoding of dendrograms by embedding
them into the Bruhat-Tits trees associated to -adic number fields. As an
application, we show how strings over a finite alphabet can be encoded in
cyclotomic extensions of and discuss -adic DNA encoding. The
application leads to fast -adic agglomerative hierarchic algorithms similar
to the ones recently used e.g. by A. Khrennikov and others. From the viewpoint
of -adic geometry, to encode a dendrogram in a -adic field means
to fix a set of -rational punctures on the -adic projective line
. To is associated in a natural way a
subtree inside the Bruhat-Tits tree which recovers , a method first used by
F. Kato in 1999 in the classification of discrete subgroups of
.
Next, we show how the -adic moduli space of
with punctures can be applied to the study of time series of
dendrograms and those symmetries arising from hyperbolic actions on
. In this way, we can associate to certain classes of dynamical
systems a Mumford curve, i.e. a -adic algebraic curve with totally
degenerate reduction modulo .
Finally, we indicate some of our results in the study of general discrete
actions on , and their relation to -adic Hurwitz spaces.Comment: 14 pages, 6 figure
Fast, Linear Time Hierarchical Clustering using the Baire Metric
The Baire metric induces an ultrametric on a dataset and is of linear
computational complexity, contrasted with the standard quadratic time
agglomerative hierarchical clustering algorithm. In this work we evaluate
empirically this new approach to hierarchical clustering. We compare
hierarchical clustering based on the Baire metric with (i) agglomerative
hierarchical clustering, in terms of algorithm properties; (ii) generalized
ultrametrics, in terms of definition; and (iii) fast clustering through k-means
partititioning, in terms of quality of results. For the latter, we carry out an
in depth astronomical study. We apply the Baire distance to spectrometric and
photometric redshifts from the Sloan Digital Sky Survey using, in this work,
about half a million astronomical objects. We want to know how well the (more
costly to determine) spectrometric redshifts can predict the (more easily
obtained) photometric redshifts, i.e. we seek to regress the spectrometric on
the photometric redshifts, and we use clusterwise regression for this.Comment: 27 pages, 6 tables, 10 figure
First Odin sub-mm retrievals in the tropical upper troposphere: ice cloud properties
International audienceMore accurate global measurements of the amount of ice in thicker clouds are needed to validate atmospheric models and sub-mm radiometry can be an important component in this respect. A cloud ice retrieval scheme for the first such instrument in space, Odin-SMR, is presented here. Several advantages of sub-mm observations are shown, such as low influence of particle shape and orientation, and a high dynamic range of the retrievals. In the case of Odin-SMR, only cloud ice above ?12.5 km can be measured. The present retrieval scheme gives a detection threshold of about 4 g/m2 above 12.5 km and does not saturate even for thickest observed clouds (>500 g/m2). The main retrieval uncertainties are the assumed particle size distribution and cloud inhomogeneity effects. The overall retrieval accuracy is estimated to be ~75%. The retrieval error is judged to have large random components and to be significantly lower than this value for averaged results, but high fixed errors can not be excluded. However, a firm lower value can always be provided. Initial results are found to be consistent with similar Aura MLS retrievals, but show important differences to corresponding data from atmospheric models. This first retrieval algorithm is limited to lowermost Odin-SMR tangent altitudes, and further development should improve the detection threshold and the vertical resolution. It should also be possible to decrease the retrieval uncertainty associated with cloud inhomogeneities by detailed analysis of other data sets
Nitric acid in the stratosphere based on Odin observations from 2001 to 2009 – Part 2: High-altitude polar enhancements
The wintertime abundance of nitric acid (HNO<sub>3</sub>) in the polar upper stratosphere displays a strong inter-annual variability, and is known to be strongly influenced by energetic particle precipitation (EPP), primarily by protons during solar proton events (SPEs), but also by precipitating auroral or relativistic electrons. We analyse a multi-year record (August 2001 to April 2009) of middle atmospheric HNO<sub>3</sub> measurements by the Sub-Millimeter Radiometer instrument aboard the Odin satellite, with a focus on the polar upper stratosphere. SMR observations show clear evidence of two different types of polar high-altitude HNO<sub>3</sub> enhancements linked to EPP. In the first type, referred to as direct enhancements by analogy with the EPP/NO<sub>x</sub> direct effect, enhanced HNO<sub>3</sub> mixing ratios are observed for a short period (1 week) after a SPE, upwards of a level typically in the mid-stratosphere. In a second type, referred to as indirect enhancements by analogy with the EPP/NO<sub>x</sub> indirect effect, the descent of mesospheric air triggers a stronger and longer-lasting enhancement. Each of the three major SPEs that occurred during the Northern Hemisphere autumn or winter, in November 2001, October–November 2003 and January 2005, are observed to lead to both direct and indirect HNO<sub>3</sub> enhancements. On the other hand, indirect enhancements occur recurrently in winter, are stronger in the Southern Hemisphere, and are influenced by EPP at higher altitudes
Comparison of satellite limb-sounding humidity climatologies of the uppermost tropical troposphere
International audienceHumidity climatologies of the tropical uppermost troposphere from satellite limb emission measurements have been compared. Four instruments are considered; UARS-MLS, Odin-SMR, and Aura-MLS operating in the microwave region, and MIPAS in the IR region. A reference for the comparison is obtained by MOZAIC in-situ measurements. The upper tropospheric humidity products were compared on basis of their empirical probability density functions and seasonally averaged horizontal fields at two altitude layers, 12 and 15 km. The probability density functions of the microwave datasets were found to be in very good agreement with each other, and are also consistent with MOZAIC. The average seasonal humidities differ with less than 10%RHi between the instruments, indicating that stated measurement accuracies of 20?30% are conservative estimates. The systematic uncertainty in Odin-SMR data due to cloud correction was also independently estimated to be 10%RHi. MIPAS humidity profiles were found to suffer from cloud contamination, with only 30% of the measurements reaching into the upper troposphere, but under clear-sky conditions there is a good agreement between MIPAS, Odin-SMR and Aura-MLS. Odin-SMR and the two MLS datasets can be treated as independent, being based on different underlying spectroscopy and technology. The good agreement between the microwave limb-sounders, and MOZAIC, is therefore an important step towards understanding the upper tropospheric humidity. The found accuracy of 10%RHi is approaching the level required to validate climate modelling of the upper troposphere humidity. The comparison of microwave and IR also stresses that microwave limb-sounding is necessary for a complete view of the upper troposphere
Comparison of satellite limb-sounding humidity climatologies of the uppermost tropical troposphere
International audienceHumidity climatologies of the tropical uppermost troposphere from satellite limb emission measurements have been compared. Four instruments are considered; UARS-MLS, Odin-SMR, and Aura-MLS operating in the microwave region, and MIPAS in the infrared region. A reference for the comparison is obtained by MOZAIC in-situ measurements. The upper tropospheric humidity products were compared on basis of their empirical probability density functions and seasonally averaged horizontal fields at two altitude layers, 12 and 15 km. The probability density functions of the microwave datasets were found to be in very good agreement with each other, and were also consistent with MOZAIC. The average seasonal humidities differ with less than 10%RHi between the instruments, indicating that stated measurement accuracies of 20?30% are conservative estimates. The systematic uncertainty in Odin-SMR data due to cloud correction was also independently estimated to be 10%RHi. MIPAS humidity profiles were found to suffer from cloud contamination, with only 30% of the measurements reaching into the upper troposphere, but under clear-sky conditions there is a good agreement between MIPAS, Odin-SMR and Aura-MLS. Odin-SMR and the two MLS datasets can be treated as independent, being based on different underlying spectroscopy and technology. The good agreement between the microwave limb-sounders, and MOZAIC, is therefore an important step towards understanding the upper tropospheric humidity. The found accuracy of 10%RHi is approaching the level required to validate climate modelling of the upper troposphere humidity. The comparison of microwave and infrared also stresses that microwave limb-sounding is necessary for a complete view of the upper troposphere
- …