76 research outputs found

    Zoom-SVD: Fast and Memory Efficient Method for Extracting Key Patterns in an Arbitrary Time Range

    Full text link
    Given multiple time series data, how can we efficiently find latent patterns in an arbitrary time range? Singular value decomposition (SVD) is a crucial tool to discover hidden factors in multiple time series data, and has been used in many data mining applications including dimensionality reduction, principal component analysis, recommender systems, etc. Along with its static version, incremental SVD has been used to deal with multiple semi infinite time series data and to identify patterns of the data. However, existing SVD methods for the multiple time series data analysis do not provide functionality for detecting patterns of data in an arbitrary time range: standard SVD requires data for all intervals corresponding to a time range query, and incremental SVD does not consider an arbitrary time range. In this paper, we propose Zoom-SVD, a fast and memory efficient method for finding latent factors of time series data in an arbitrary time range. Zoom-SVD incrementally compresses multiple time series data block by block to reduce the space cost in storage phase, and efficiently computes singular value decomposition (SVD) for a given time range query in query phase by carefully stitching stored SVD results. Through extensive experiments, we demonstrate that Zoom-SVD is up to 15x faster, and requires 15x less space than existing methods. Our case study shows that Zoom-SVD is useful for capturing past time ranges whose patterns are similar to a query time range.Comment: 10 pages, 2018 ACM Conference on Information and Knowledge Management (CIKM 2018

    Improved retrievals of aerosol optical depth and fine mode fraction from GOCI geostationary satellite data using machine learning over East Asia

    Get PDF
    Aerosol Optical Depth (AOD) and Fine Mode Fraction (FMF) are important information for air quality research. Both are mainly obtained from satellite data based on a radiative transfer model, which requires heavy computation and has uncertainties. We proposed machine learning-based models to estimate AOD and FMF directly from Geostationary Ocean Color Imager (GOCI) reflectances over East Asia. Hourly AOD and FMF were estimated for 00-07 UTC at a spatial resolution of 6 km using the GOCI reflectances, their channel differences (with 30-day minimum reflectance), solar and satellite viewing geometry, meteorological data, geographical information, and the Day Of the Year (DOY) as input features. Light Gradient Boosting Machine (LightGBM) and Random Forest (RF) machine learning approaches were applied and evaluated using random, spatial, and temporal 10-fold cross-validation with ground-based observation data. LightGBM (R-2 = 0.89-0.93 and RMSE = 0.071-0.091 for AOD and R-2 = 0.67-0.81 and RMSE = 0.079-0.105 for FMF) and RF (R-2 = 0.88-0.92 and RMSE = 0.080-0.095 for AOD and R-2 = 0.59-0.76 and RMSE = 0.092-0.118 for FMF) agreed well with the in-situ data. The machine learning models showed much smaller errors when compared to GOCI-based Yonsei aerosol retrieval and the Moderate Resolution Imaging Spectroradiometer Dark Target and Deep Blue algorithms. The Shapley Additive exPlanations values (SHAP)-based feature importance result revealed that the 412 nm band (i. e., ch01) contributed most in both AOD and FMF retrievals. Relative humidity and air temperature were also identified as important factors especially for FMF, which suggests that considering meteorological conditions helps improve AOD and FMF estimation. Besides, spatial distribution of AOD and FMF showed that using the channel difference features to indirectly consider surface reflectance was very helpful for AOD retrieval on bright surfaces

    Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback

    Full text link
    Code editing is an essential step towards reliable program synthesis to automatically correct critical errors generated from code LLMs. Recent studies have demonstrated that closed-source LLMs (i.e., ChatGPT and GPT-4) are capable of generating corrective feedback to edit erroneous inputs. However, it remains challenging for open-source code LLMs to generate feedback for code editing, since these models tend to adhere to the superficial formats of feedback and provide feedback with misleading information. Hence, the focus of our work is to leverage open-source code LLMs to generate helpful feedback with correct guidance for code editing. To this end, we present Coffee, a collected dataset specifically designed for code fixing with feedback. Using this dataset, we construct CoffeePots, a framework for COde Fixing with FEEdback via Preference-Optimized Tuning and Selection. The proposed framework aims to automatically generate helpful feedback for code editing while minimizing the potential risk of superficial feedback. The combination of Coffee and CoffeePots marks a significant advancement, achieving state-of-the-art performance on HumanEvalFix benchmark. Codes and model checkpoints are publicly available at https://github.com/Lune-Blue/COFFEE.Comment: Work in progres

    Early dynamic behavior of lactate in predicting continuous renal replacement therapy after surgery for acute type A aortic dissection

    Get PDF
    BackgroundIt has been well known that hyperlactatemia is an independent risk factor for postoperative mortality in patients who received acute type A aortic dissection (ATAAD) surgery. Some patients may require the assistance of continuous renal replacement therapy (CRRT) for acute postoperative renal deficiency and often associate with increased mortality rate. This study aimed to examine the association between the early dynamic change of lactate levels and postoperative CRRT in ATAAD patients who received surgical repairment.MethodsThis retrospective study included 503 patients who received ATAAD surgeries. Serum lactate levels were measured before operation and at 0, 1, 3, 6, 12, 24 h post intensive care unit (ICU) admission. We examined the association between dynamic changes of lactate and CRRT.ResultsAmong all patients, 19.9% (100 patients) required CRRT. Our data showed that the lactate levels were higher in the CRRT group at all timepoints compared to the non-CRRT group. In a multivariate model, lactate levels at 12 h post ICU admission [odds ratio (OR), 1.362; p = 0.007] was identified as an independent predictor for requiring CRRT. Unsurprisingly, 30-day mortality in the CRRT group (41%) was 8.2 times higher than in the non-CRRT group (5%). To better understand the associations between CRRT and lactate levels, patients in the CRRT group were further stratified into the non-survivor group (n = 41) and survivor group (n = 59) based on the 30-day mortality. Elevated lactate levels measured upon ICU admission (OR, 1.284; p = 0.001) and decreased 24 h lactate clearance (OR, 0.237; p = 0.039) were independent risk factors for 30-day mortality in patients who received CRRT. The area under the curve to predict requirement for CRRT at 6 and 12 h post CICU admission were 0.714 and 0.722, respectively, corresponding to lactate cut-off levels of 4.15 and 2.45 mmol/L.ConclusionThe CRRT is commonly required in patients who received ATAAD surgery and often associated with worse mortality. Early dynamic changes of lactate levels can be used to predict the requirement of postoperative CRRT

    ??????????????? ????????? ???????????? ??????????????? ?????? ???????????? ????????? ?????? ??? ????????? ????????? ?????? ????????? ??????

    Get PDF
    As part of the next-generation Compact Advanced Satellite 500 (CAS500) project, CAS500-4 is scheduled to be launched in 2025 focusing on the remote sensing of agriculture and forestry. To obtain quantitative information on vegetation from satellite images, it is necessary to acquire surface reflectance through atmospheric correction. Thus, it is essential to develop an atmospheric correction method suitable for CAS500-4. Since the absorption and scattering characteristics in the atmosphere vary depending on the wavelength, it is needed to analyze the sensitivity of atmospheric correction parameters such as aerosol optical depth (AOD) and water vapor (WV) considering the wavelengths of CAS500-4. In addition, as CAS500-4 has only five channels (blue, green, red, red edge, and near-infrared), making it difficult to directly calculate key parameters for atmospheric correction, external parameter data should be used. Therefore, this study performed a sensitivity analysis of the key parameters (AOD, WV, and O3) using the simulated images based on Sentinel-2 satellite data, which has similar wavelength specifications to CAS500-4, and examined the possibility of using the products of GEOKOMPSAT-2A (GK2A) as atmospheric parameters. The sensitivity analysis showed that AOD was the most important parameter with greater sensitivity in visible channels than in the near-infrared region. In particular, since AOD change of 20% causes about a 100% error rate in the blue channel surface reflectance in forests, a highly reliable AOD is needed to obtain accurate surface reflectance. The atmospherically corrected surface reflectance based on the GK2A AOD and WV was compared with the Sentinel-2 L2A reflectance data through the separability index of the known land cover pixels. The result showed that two corrected surface reflectance had similar Seperability index (SI) values, the atmospheric corrected surface reflectance based on the GK2A AOD showed higher SI than the Sentinel-2 L2A reflectance data in short-wavelength channels. Thus, it is judged that the parameters provided by GK2A can be fully utilized for atmospheric correction of the CAS500-4. The research findings will provide a basis for atmospheric correction of the CAS500-4 in the future. ?? This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited

    Small Fermi pockets intertwined with charge stripes and pair density wave order in a kagome superconductor

    Full text link
    The kagome superconductor family AV3Sb5 (A=Cs, K, Rb) emerged as an exciting platform to study exotic Fermi surface instabilities. Here we use spectroscopic-imaging scanning tunneling microscopy (SI-STM) and angle-resolved photoemission spectroscopy (ARPES) to reveal how the surprising cascade of higher and lower-dimensional density waves in CsV3Sb5 is intimately tied to a set of small reconstructed Fermi pockets. ARPES measurements visualize the formation of these pockets generated by a 3D charge density wave transition. The pockets are connected by dispersive q* wave vectors observed in Fourier transforms of STM differential conductance maps. As the additional 1D charge order emerges at a lower temperature, q* wave vectors become substantially renormalized, signaling further reconstruction of the Fermi pockets. Remarkably, in the superconducting state, the superconducting gap modulations give rise to an in-plane Cooper pair-density-wave at the same q* wave vectors. Our work demonstrates the intrinsic origin of the charge-stripes and the pair-density-wave in CsV3Sb5 and their relationship to the Fermi pockets. These experiments uncover a unique scenario of how Fermi pockets generated by a parent charge density wave state can provide a favorable platform for the emergence of additional density waves

    Predictive biomarkers for 5-fluorouracil and oxaliplatin-based chemotherapy in gastric cancers via profiling of patient-derived xenografts.

    Get PDF
    Gastric cancer (GC) is commonly treated by chemotherapy using 5-fluorouracil (5-FU) derivatives and platinum combination, but predictive biomarker remains lacking. We develop patient-derived xenografts (PDXs) from 31 GC patients and treat with a combination of 5-FU and oxaliplatin, to determine biomarkers associated with responsiveness. When the PDXs are defined as either responders or non-responders according to tumor volume change after treatment, the responsiveness of PDXs is significantly consistent with the respective clinical outcomes of the patients. An integrative genomic and transcriptomic analysis of PDXs reveals that pathways associated with cell-to-cell and cell-to-extracellular matrix interactions enriched among the non-responders in both cancer cells and the tumor microenvironment (TME). We develop a 30-gene prediction model to determine the responsiveness to 5-FU and oxaliplatin-based chemotherapy and confirm the significant poor survival outcomes among cases classified as non-responder-like in three independent GC cohorts. Our study may inform clinical decision-making when designing treatment strategies

    Disease-specific induced pluripotent stem cells: a platform for human disease modeling and drug discovery

    Get PDF
    The generation of disease-specific induced pluripotent stem cell (iPSC) lines from patients with incurable diseases is a promising approach for studying disease mechanisms and drug screening. Such innovation enables to obtain autologous cell sources in regenerative medicine. Herein, we report the generation and characterization of iPSCs from fibroblasts of patients with sporadic or familial diseases, including Parkinson's disease (PD), Alzheimer's disease (AD), juvenile-onset, type I diabetes mellitus (JDM), and Duchenne type muscular dystrophy (DMD), as well as from normal human fibroblasts (WT). As an example to modeling disease using disease-specific iPSCs, we also discuss the previously established childhood cerebral adrenoleukodystrophy (CCALD)- and adrenomyeloneuropathy (AMN)-iPSCs by our group. Through DNA fingerprinting analysis, the origins of generated disease-specific iPSC lines were identified. Each iPSC line exhibited an intense alkaline phosphatase activity, expression of pluripotent markers, and the potential to differentiate into all three embryonic germ layers: the ectoderm, endoderm, and mesoderm. Expression of endogenous pluripotent markers and downregulation of retrovirus-delivered transgenes [OCT4 (POU5F1), SOX2, KLF4, and c-MYC] were observed in the generated iPSCs. Collectively, our results demonstrated that disease-specific iPSC lines characteristically resembled hESC lines. Furthermore, we were able to differentiate PD-iPSCs, one of the disease-specific-iPSC lines we generated, into dopaminergic (DA) neurons, the cell type mostly affected by PD. These PD-specific DA neurons along with other examples of cell models derived from disease-specific iPSCs would provide a powerful platform for examining the pathophysiology of relevant diseases at the cellular and molecular levels and for developing new drugs and therapeutic regimens

    Time-Resolved in Situ Visualization of the Structural Response of Zeolites During Catalysis

    Get PDF
    Zeolites are three-dimensional aluminosilicates having unique properties from the size and connectivity of their sub-nanometer pores, the Si/Al ratio of the anionic framework, and the charge-balancing cations. The inhomogeneous distribution of the cations affects their catalytic performances because it influences the intra-crystalline diffusion rates of the reactants and products. However, the structural deformation regarding inhomogeneous active regions during the catalysis is not yet observed by conventional analytical tools. Here we employ in situ X-ray free electron laser-based time-resolved coherent X-ray diffraction imaging to investigate the internal deformations originating from the inhomogeneous Cu ion distributions in Cu-exchanged ZSM-5 zeolite crystals during the deoxygenation of nitrogen oxides with propene. We show that the interactions between the reactants and the active sites lead to an unusual strain distribution, confirmed by density functional theory simulations. These observations provide insights into the role of structural inhomogeneity in zeolites during catalysis and will assist the future design of zeolites for their applications
    corecore