75 research outputs found

    High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso

    Full text link
    The goal of supervised feature selection is to find a subset of input features that are responsible for predicting output values. The least absolute shrinkage and selection operator (Lasso) allows computationally efficient feature selection based on linear dependency between input features and output values. In this paper, we consider a feature-wise kernelized Lasso for capturing non-linear input-output dependency. We first show that, with particular choices of kernel functions, non-redundant features with strong statistical dependence on output values can be found in terms of kernel-based independence measures. We then show that the globally optimal solution can be efficiently computed; this makes the approach scalable to high-dimensional problems. The effectiveness of the proposed method is demonstrated through feature selection experiments with thousands of features.Comment: 18 page

    Thermal constraints on the reionisation of hydrogen by population-II stellar sources

    Get PDF
    Measurements of the intergalactic medium (IGM) temperature provide a potentially powerful constraint on the reionisation history due to the thermal imprint left by the photo-ionisation of neutral hydrogen. However, until recently IGM temperature measurements were limited to redshifts 2 < z < 4.8, restricting the ability of these data to probe the reionisation history at z > 6. In this work, we use recent measurements of the IGM temperature in the near-zones of seven quasars at z ~ 5.8 - 6.4, combined with a semi-numerical model for inhomogeneous reionisation, to establish new constraints on the redshift at which hydrogen reionisation completed. We calibrate the model to reproduce observational constraints on the electron scattering optical depth and the HI photo-ionisation rate, and compute the resulting spatially inhomogeneous temperature distribution at z ~ 6 for a variety of reionisation scenarios. Under standard assumptions for the ionising spectra of population-II sources, the near-zone temperature measurements constrain the redshift by which hydrogen reionisation was complete to be z > 7.9 (6.5) at 68 (95) per cent confidence. We conclude that future temperature measurements around other high redshift quasars will significantly increase the power of this technique, enabling these results to be tightened and generalised.Comment: 15 pages, 8 figures, accepted for publication in MNRA

    Improved measurements of the intergalactic medium temperature around quasars: possible evidence for the initial stages of He II reionization at z ≃ 6

    Get PDF
    We present measurements of the intergalactic medium (IGM) temperature within ∼5 proper Mpc of seven luminous quasars at z ≃ 6. The constraints are obtained from the Doppler widths of Lyα absorption lines in the quasar near zones and build upon our previous measurement for the z = 6.02 quasar SDSS J0818+1722. The expanded data set, combined with an improved treatment of systematic uncertainties, yields an average temperature at the mean density of log(T_0/K) = 4.21±^(0.03)_(0.03)(±^(0.06)_(0.07)) at 68 (95) per cent confidence for a flat prior distribution over 3.2 ≤ log (T_0/K) ≤ 4.8. In comparison, temperatures measured from the general IGM at z ≃ 5 are ∼0.3 dex cooler, implying an additional source of heating around these quasars which is not yet present in the general IGM at slightly lower redshift. This heating is most likely due to the recent reionization of He ii in vicinity of these quasars, which have hard and non-thermal ionizing spectra. The elevated temperatures may therefore represent evidence for the earliest stages of He ii reionization in the most biased regions of the high-redshift Universe. The temperature as a function of distance from the quasars is consistent with being constant, log (T_0/K) ≃ 4.2, with no evidence for a line-of-sight thermal proximity effect. However, the limited extent of the quasar near zones prevents the detection of He III regions larger than ∼5 proper Mpc. Under the assumption that the quasars have reionized the He II in their vicinity, we infer that the data are consistent with an average optically bright phase of duration in excess of 10^(6.5) yr. These measurements represent the highest redshift IGM temperature constraints to date, and thus provide a valuable data set for confronting models of H I reionization

    The impact of temperature fluctuations on the large-scale clustering of the Lyα forest

    Get PDF
    We develop a semi-analytic method for assessing the impact of the large-scale IGM temperature fluctuations expected following He ii reionization on three-dimensional clustering measurements of the Lyα forest. Our methodology builds upon the existing large volume, mock Lyα forest survey simulations presented by Greig et al. by including a prescription for a spatially inhomogeneous ionizing background, temperature fluctuations induced by patchy He ii photoheating and the clustering of quasars. This approach enables us to achieve a dynamic range within our semi-analytic model substantially larger than currently feasible with computationally expensive, fully numerical simulations. The results agree well with existing numerical simulations, with large-scale temperature fluctuations introducing a scale-dependent increase in the spherically averaged 3D Lyα forest power spectrum of up to 20–30 per cent at wavenumbers k ∼ 0.02 Mpc− 1. Although these large-scale thermal fluctuations will not substantially impact upon the recovery of the baryon acoustic oscillation scale from existing and forthcoming dark energy spectroscopic surveys, any complete forward modelling of the broad-band term in the Lyα correlation function will none the less require their inclusion

    The effect of intergalactic helium on hydrogen reionisation: implications for the sources of ionising photons at z > 6

    Get PDF
    We investigate the effect of helium on hydrogen reionisation using a hydrodynamical simulation combined with the cosmological radiative transfer code CRASH. The simulations are run in a 35.12/h comoving Mpc box using a variety of assumptions for the amplitude and power-law extreme-UV (EUV) spectral index, alpha, of the ionising emissivity. We use an empirically motivated prescription for ionising sources which ensures all of the models are consistent with constraints on the Thomson scattering optical depth and the hydrogen photo-ionisation rate at z=6. The inclusion of helium slightly delays reionisation due to the small number of ionising photons which reionise neutral helium instead of hydrogen. However, helium has a significant impact on the thermal state of the IGM. Models with alpha=3 produce IGM temperatures at the mean density at z=6 which are about 20 % higher compared to models without helium photo-heating. Harder EUV indices produce even larger IGM temperature boosts. A comparison to recent observational estimates of the IGM temperature at z=5 - 6 suggests that hydrogen reionisation was primarily driven by pop-II stellar sources with a soft EUV index, alpha<3. We also find that faint, as yet undetected galaxies, characterised by a luminosity function with a steepening faint-end slope and an increasing Lyman continuum escape fraction (fesc=0.5), are required to reproduce the ionising emissivity used in our simulations at z>6. Finally, we note there is some tension between recent observational constraints which indicate the IGM is > 10% neutral by volume z=7, and estimates of the ionising emissivity at z=6 which indicate only between 1 and 3 ionising photons are emitted per hydrogen atom over a Hubble time. This tension may be alleviated by either a lower neutral fraction at z=7 or an IGM which still remains a few % neutral by volume at z=6.Comment: 19 pages, 12 figures; MNRAS in pres

    Addressing preference heterogeneity in public health policy by combining Cluster Analysis and Multi-Criteria Decision Analysis: Proof of Method.

    Get PDF
    The use of subgroups based on biological-clinical and socio-demographic variables to deal with population heterogeneity is well-established in public policy. The use of subgroups based on preferences is rare, except when religion based, and controversial. If it were decided to treat subgroup preferences as valid determinants of public policy, a transparent analytical procedure is needed. In this proof of method study we show how public preferences could be incorporated into policy decisions in a way that respects both the multi-criterial nature of those decisions, and the heterogeneity of the population in relation to the importance assigned to relevant criteria. It involves combining Cluster Analysis (CA), to generate the subgroup sets of preferences, with Multi-Criteria Decision Analysis (MCDA), to provide the policy framework into which the clustered preferences are entered. We employ three techniques of CA to demonstrate that not only do different techniques produce different clusters, but that choosing among techniques (as well as developing the MCDA structure) is an important task to be undertaken in implementing the approach outlined in any specific policy context. Data for the illustrative, not substantive, application are from a Randomized Controlled Trial of online decision aids for Australian men aged 40-69 years considering Prostate-specific Antigen testing for prostate cancer. We show that such analyses can provide policy-makers with insights into the criterion-specific needs of different subgroups. Implementing CA and MCDA in combination to assist in the development of policies on important health and community issues such as drug coverage, reimbursement, and screening programs, poses major challenges -conceptual, methodological, ethical-political, and practical - but most are exposed by the techniques, not created by them

    SDSS-IV MaNGA: uncovering the angular momentum content of central and satellite early-type galaxies

    Get PDF
    We study 379 central and 159 satellite early-type galaxies with two-dimensional kinematics from the integral field survey Mapping Nearby Galaxies at APO (MaNGA) to determine how their angular momentum content depends on stellar and halo mass. Using the Yang et al. (2007) group catalog, we identify central and satellite galaxies in groups with halo masses in the range 1012:5 h-1 M_ 1011 h-2 M_ tend to have very little rotation, while nearly all galaxies at lower mass show some net rotation. The ~ 30% of high-mass galaxies that have significant rotation do not stand out in other galaxy properties except for a higher incidence of ionized gas emission. Our data are consistent with recent simulation results suggesting that major merging and gas accretion have more impact on the rotational support of lower-mass galaxies. When carefully matching the stellar mass distributions, we find no residual differences in angular momentum content between satellite and central galaxies at the 20% level. Similarly, at fixed mass, galaxies have consistent rotation properties across a wide range of halo mass. However, we find that errors in classification of centrals and satellites with group finders systematically lowers differences between satellite and central galaxies at a level that is comparable to current measurement uncertainties. To improve constraints, the impact of group finding methods will have to be forward modeled via mock catalogs
    • …
    corecore