272 research outputs found

    High FIB4 index is an independent risk factor of diabetic kidney disease in type 2 diabetes

    Get PDF
    博士(医学)福島県立医科大

    Policy-Adaptive Estimator Selection for Off-Policy Evaluation

    Full text link
    Off-policy evaluation (OPE) aims to accurately evaluate the performance of counterfactual policies using only offline logged data. Although many estimators have been developed, there is no single estimator that dominates the others, because the estimators' accuracy can vary greatly depending on a given OPE task such as the evaluation policy, number of actions, and noise level. Thus, the data-driven estimator selection problem is becoming increasingly important and can have a significant impact on the accuracy of OPE. However, identifying the most accurate estimator using only the logged data is quite challenging because the ground-truth estimation accuracy of estimators is generally unavailable. This paper studies this challenging problem of estimator selection for OPE for the first time. In particular, we enable an estimator selection that is adaptive to a given OPE task, by appropriately subsampling available logged data and constructing pseudo policies useful for the underlying estimator selection task. Comprehensive experiments on both synthetic and real-world company data demonstrate that the proposed procedure substantially improves the estimator selection compared to a non-adaptive heuristic.Comment: accepted at AAAI'2

    Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

    Full text link
    Ranking interfaces are everywhere in online platforms. There is thus an ever growing interest in their Off-Policy Evaluation (OPE), aiming towards an accurate performance evaluation of ranking policies using logged data. A de-facto approach for OPE is Inverse Propensity Scoring (IPS), which provides an unbiased and consistent value estimate. However, it becomes extremely inaccurate in the ranking setup due to its high variance under large action spaces. To deal with this problem, previous studies assume either independent or cascade user behavior, resulting in some ranking versions of IPS. While these estimators are somewhat effective in reducing the variance, all existing estimators apply a single universal assumption to every user, causing excessive bias and variance. Therefore, this work explores a far more general formulation where user behavior is diverse and can vary depending on the user context. We show that the resulting estimator, which we call Adaptive IPS (AIPS), can be unbiased under any complex user behavior. Moreover, AIPS achieves the minimum variance among all unbiased estimators based on IPS. We further develop a procedure to identify the appropriate user behavior model to minimize the mean squared error (MSE) of AIPS in a data-driven fashion. Extensive experiments demonstrate that the empirical accuracy improvement can be significant, enabling effective OPE of ranking systems even under diverse user behavior.Comment: KDD2023 Research trac

    Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation

    Full text link
    Off-Policy Evaluation (OPE) aims to assess the effectiveness of counterfactual policies using only offline logged data and is often used to identify the top-k promising policies for deployment in online A/B tests. Existing evaluation metrics for OPE estimators primarily focus on the "accuracy" of OPE or that of downstream policy selection, neglecting risk-return tradeoff in the subsequent online policy deployment. To address this issue, we draw inspiration from portfolio evaluation in finance and develop a new metric, called SharpeRatio@k, which measures the risk-return tradeoff of policy portfolios formed by an OPE estimator under varying online evaluation budgets (k). We validate our metric in two example scenarios, demonstrating its ability to effectively distinguish between low-risk and high-risk estimators and to accurately identify the most efficient one. Efficiency of an estimator is characterized by its capability to form the most advantageous policy portfolios, maximizing returns while minimizing risks during online deployment, a nuance that existing metrics typically overlook. To facilitate a quick, accurate, and consistent evaluation of OPE via SharpeRatio@k, we have also integrated this metric into an open-source software, SCOPE-RL (https://github.com/hakuhodo-technologies/scope-rl). Employing SharpeRatio@k and SCOPE-RL, we conduct comprehensive benchmarking experiments on various estimators and RL tasks, focusing on their risk-return tradeoff. These experiments offer several interesting directions and suggestions for future OPE research.Comment: ICLR202

    SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation

    Full text link
    This paper introduces SCOPE-RL, a comprehensive open-source Python software designed for offline reinforcement learning (offline RL), off-policy evaluation (OPE), and selection (OPS). Unlike most existing libraries that focus solely on either policy learning or evaluation, SCOPE-RL seamlessly integrates these two key aspects, facilitating flexible and complete implementations of both offline RL and OPE processes. SCOPE-RL put particular emphasis on its OPE modules, offering a range of OPE estimators and robust evaluation-of-OPE protocols. This approach enables more in-depth and reliable OPE compared to other packages. For instance, SCOPE-RL enhances OPE by estimating the entire reward distribution under a policy rather than its mere point-wise expected value. Additionally, SCOPE-RL provides a more thorough evaluation-of-OPE by presenting the risk-return tradeoff in OPE results, extending beyond mere accuracy evaluations in existing OPE literature. SCOPE-RL is designed with user accessibility in mind. Its user-friendly APIs, comprehensive documentation, and a variety of easy-to-follow examples assist researchers and practitioners in efficiently implementing and experimenting with various offline RL methods and OPE estimators, tailored to their specific problem contexts. The documentation of SCOPE-RL is available at https://scope-rl.readthedocs.io/en/latest/.Comment: preprint, open-source software: https://github.com/hakuhodo-technologies/scope-r

    In Vitro and In Vivo Evaluation of Starfish Bone-Derived -Tricalcium Phosphate as a Bone Substitute Material

    Get PDF
    We evaluated starfish-derived -tricalcium phosphate (Sf-TCP) obtained by phosphatization of starfish-bone-derived porous calcium carbonate as a potential bone substitute material. The Sf-TCP had a communicating pore structure with a pore size of approximately 10 m. Although the porosity of Sf-TCP was similar to that of Cerasorb M (CM)a commercially available -TCP bone fillerthe specific surface area was roughly three times larger than that of CM. Observation by scanning electron microscopy showed that pores communicated to the inside of the Sf-TCP. Cell growth tests showed that Sf-TCP improved cell proliferation compared with CM. Cells grown on Sf-TCP showed stretched filopodia and adhered; cells migrated both to the surface and into pores. In vivo, vigorous tissue invasion into pores was observed in Sf-TCP, and more fibrous tissue was observed for Sf-TCP than CM. Moreover, capillary formation into pores was observed for Sf-TCP. Thus, Sf-TCP showed excellent biocompatibility in vitro and more vigorous bone formation in vivo, indicating the possible applications of this material as a bone substitute. In addition, our findings suggested that mimicking the microstructure derived from whole organisms may facilitate the development of superior artificial bone.ArticleMATERIALS. 12(11):1881 (2019)journal articl

    Evaluation of MC3T3-E1 Cell Osteogenesis in Different Cell Culture Media

    Get PDF
    Many biomaterials have been evaluated using cultured cells. In particular, osteoblast-like cells are often used to evaluate the osteocompatibility, hard-tissue-regeneration, osteoconductive, and osteoinductive characteristics of biomaterials. However, the evaluation of biomaterial osteogenesis-inducing capacity using osteoblast-like cells is not standardized; instead, it is performed under laboratory-specific culture conditions with different culture media. However, the effect of different media conditions on bone formation has not been investigated. Here, we aimed to evaluate the osteogenesis of MC3T3-E1 cells, one of the most commonly used osteoblast-like cell lines for osteogenesis evaluation, and assayed cell proliferation, alkaline phosphatase activity, expression of osteoblast markers, and calcification under varying culture media conditions. Furthermore, the various media conditions were tested in uncoated plates and plates coated with collagen type I and poly-L-lysine, highly biocompatible molecules commonly used as pseudobiomaterials. We found that the type of base medium, the presence or absence of vitamin C, and the freshness of the medium may affect biomaterial regeneration. We posit that an in vitro model that recapitulates in vivo bone formation should be established before evaluating biomaterials.ArticleInternational Journal of Molecular Sciences. 22(14):7752 (2021)journal articl

    Cellular Responses of Human Lymphatic Endothelial Cells to Carbon Nanomaterials

    Get PDF
    One of the greatest challenges to overcome in the pursuit of the medical application of carbon nanomaterials (CNMs) is safety. Particularly, when considering the use of CNMs in drug delivery systems (DDSs), evaluation of safety at the accumulation site is an essential step. In this study, we evaluated the toxicity of carbon nanohorns (CNHs), which are potential DDSs, using human lymph node endothelial cells that have been reported to accumulate CNMs, as a comparison to fibrous, multi-walled carbon nanotubes (MWCNTs) and particulate carbon black (CB). The effect of different surface characteristics was also evaluated using two types of CNHs (untreated and oxidized). In the fibrous MWCNT, cell growth suppression, as well as expression of inflammatory cytokine genes was observed, as in previous reports. In contrast, no significant toxicity was observed for particulate CB and CNHs, which was different from the report of CB cytotoxicity in vascular endothelial cells. These results show that (1) lymph endothelial cells need to be tested separately from other endothelial cells for safety evaluation of nanomaterials, and (2) the potential of CNHs as DDSs.ArticleNANOMATERIALS. 10(7):1374 (2020)journal articl

    Genotype prevalence and age distribution of human papillomavirus from infection to cervical cancer in Japanese women: A systematic review and meta-analysis

    Get PDF
    Background: National HPV vaccination coverage in Japan is less than one percent of the eligible population and cervical cancer incidence and mortality are increasing. This systematic review and meta-analysis aimed to provide a comprehensive estimate of HPV genotype prevalence for Japan. Methods: English and Japanese databases were searched to March 2021 for research reporting HPV genotypes in cytology and histology samples from Japanese women. Summary estimates were calculated by disease stage from cytology only assessment – Normal, ASCUS, LSIL, HSIL and from histological assessment – CIN1, CIN2, CIN3/AIS, ICC (ICC-SCC, and ICC-ADC), and other. A random-effects meta–analysis was used to calculate summary prevalence estimates of any-HPV, high-risk (HR) and low-risk (LR) vaccine types, and vaccine genotypes (bivalent, quadrivalent, or nonavalent). This study was registered with PROSPERO: CRD42018117596. Results: A total of 57759 women with normal cytology, 1766 ASCUS, 3764 LSIL, 2017 HSIL, 3130 CIN1, 1219 CIN2, 869 CIN3/AIS, and 4306 ICC (which included 1032 ICC-SCC, and 638 ICC-ADC) were tested for HPV. The summary estimate of any-HPV genotype in women with normal cytology was 15·6% (95% CI: 12·3–19·4) and in invasive cervical cancer (ICC) was 85·6% (80·7–89·8). The prevalence of HR-HPV was 86·0% (95% CI: 73·9–94·9) for cytological cases of HSIL, 76·9% (52·1–94·7) for histological cases of CIN3/AIS, and 75·7% (68·0–82·6) for ICC. In women with ICC, the summary prevalence of bivalent vaccine genotypes was 58·5% (95% CI: 52·1–64·9), for quadrivalent genotypes was 58·6% (52·2–64·9) and for nonavalent genotypes was 71·5% (64·9–77·6), and of ICC cases that were HPV positive over 90% of infections are nonavalent vaccine preventable. There was considerable heterogeneity in all HPV summary estimates and for ICC, this heterogeneity was not explained by variability in study design, sample type, HPV assay type, or HPV DNA detection method, although studies published in the 1990s had lower prevalence estimates of any-HPV and HR HPV genotypes. Interpretations: HPV prevalence is high among Japanese women. The nonavalent vaccine is likely to have the greatest impact on reducing cervical cancer incidence and mortality in Japan
    corecore