84 research outputs found

    SheetCopilot: Bringing Software Productivity to the Next Level through Large Language Models

    Full text link
    Computer end users have spent billions of hours completing daily tasks like tabular data processing and project timeline scheduling. Most of these tasks are repetitive and error-prone, yet most end users lack the skill to automate these burdensome works. With the advent of large language models (LLMs), directing software with natural language user requests become a reachable goal. In this work, we propose a SheetCopilot agent that takes natural language task and control spreadsheet to fulfill the requirements. We propose a set of atomic actions as an abstraction of spreadsheet software functionalities. We further design a state machine-based task planning framework for LLMs to robustly interact with spreadsheets. We curate a representative dataset containing 221 spreadsheet control tasks and establish a fully automated evaluation pipeline for rigorously benchmarking the ability of LLMs in software control tasks. Our SheetCopilot correctly completes 44.3\% of tasks for a single generation, outperforming the strong code generation baseline by a wide margin. Our project page:https://sheetcopilot.github.io/.Comment: Accepted to NeurIPS 202

    Prediction model of cervical lymph node metastasis based on clinicopathological characteristics of papillary thyroid carcinoma: a dual-center retrospective study

    Get PDF
    BackgroundThe overall prevalence of papillary thyroid carcinoma (PTC) patients is expanding along with an ongoing increase in thyroid cancer incidence. Patients with PTC who have lymph node metastases have a poor prognosis and a high death rate. There is an urgent need for indicators that can predict lymph node metastasis (LNM) before surgery as current imaging techniques, such as ultrasonography, do not have sufficient sensitivity to detect LNM. To predict independent risk factors for Central lymph node metastasis (CLNM) or Lateral lymph node metastasis (LLNM), we therefore developed two nomograms based on CLNM and LLNM, separately.MethodsIn two centers, the Second Affiliated Hospital of Nanchang University and Yichun People’s Hospital, we retrospectively analyzed clinicopathological characteristics of PTC patients. We utilized multivariate analysis to screen for variables that might be suspiciously related to CLNM or LLNM. Furthermore, we developed nomograms to graphically depict the independent risk valuables connected to lymph node metastasis in PTC patients.ResultUltimately, 6068 PTC patients in all were included in the research. Six factors, including age<45, male, mETE, TSH>1.418, tumor size>4cm, and location (multicentric and lobe), were observed to be related to CLNM. Age<45, male, mETE (minimal extrathyroidal extension), multifocality, TSH≥2.910, CLNM positive, and tumor size>4cm were regarded as related risk factors for LLNM. The two nomograms developed subsequently proved to have good predictive power with 0.706 and 0.818 and demonstrated good clinical guidance functionality with clinical decision curves and impact curves.ConclusionBased on the successful establishment of this dual-institution-based visual nomogram model, we found that some clinical features are highly correlated with cervical lymph node metastasis, including CLNM and LLNM, which will better help clinicians make individualized clinical decisions for more effectively rationalizing managing PTC patients

    Pan-Cancer Analysis of lncRNA Regulation Supports Their Targeting of Cancer Genes in Each Tumor Context

    Get PDF
    Long noncoding RNAs (lncRNAs) are commonly dys-regulated in tumors, but only a handful are known toplay pathophysiological roles in cancer. We inferredlncRNAs that dysregulate cancer pathways, onco-genes, and tumor suppressors (cancer genes) bymodeling their effects on the activity of transcriptionfactors, RNA-binding proteins, and microRNAs in5,185 TCGA tumors and 1,019 ENCODE assays.Our predictions included hundreds of candidateonco- and tumor-suppressor lncRNAs (cancerlncRNAs) whose somatic alterations account for thedysregulation of dozens of cancer genes and path-ways in each of 14 tumor contexts. To demonstrateproof of concept, we showed that perturbations tar-geting OIP5-AS1 (an inferred tumor suppressor) andTUG1 and WT1-AS (inferred onco-lncRNAs) dysre-gulated cancer genes and altered proliferation ofbreast and gynecologic cancer cells. Our analysis in-dicates that, although most lncRNAs are dysregu-lated in a tumor-specific manner, some, includingOIP5-AS1, TUG1, NEAT1, MEG3, and TSIX, synergis-tically dysregulate cancer pathways in multiple tumorcontexts

    Pan-cancer Alterations of the MYC Oncogene and Its Proximal Network across the Cancer Genome Atlas

    Get PDF
    Although theMYConcogene has been implicated incancer, a systematic assessment of alterations ofMYC, related transcription factors, and co-regulatoryproteins, forming the proximal MYC network (PMN),across human cancers is lacking. Using computa-tional approaches, we define genomic and proteo-mic features associated with MYC and the PMNacross the 33 cancers of The Cancer Genome Atlas.Pan-cancer, 28% of all samples had at least one ofthe MYC paralogs amplified. In contrast, the MYCantagonists MGA and MNT were the most frequentlymutated or deleted members, proposing a roleas tumor suppressors.MYCalterations were mutu-ally exclusive withPIK3CA,PTEN,APC,orBRAFalterations, suggesting that MYC is a distinct onco-genic driver. Expression analysis revealed MYC-associated pathways in tumor subtypes, such asimmune response and growth factor signaling; chro-matin, translation, and DNA replication/repair wereconserved pan-cancer. This analysis reveals insightsinto MYC biology and is a reference for biomarkersand therapeutics for cancers with alterations ofMYC or the PMN

    Genomic, Pathway Network, and Immunologic Features Distinguishing Squamous Carcinomas

    Get PDF
    This integrated, multiplatform PanCancer Atlas study co-mapped and identified distinguishing molecular features of squamous cell carcinomas (SCCs) from five sites associated with smokin

    Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images

    Get PDF
    Beyond sample curation and basic pathologic characterization, the digitized H&E-stained images of TCGA samples remain underutilized. To highlight this resource, we present mappings of tumorinfiltrating lymphocytes (TILs) based on H&E images from 13 TCGA tumor types. These TIL maps are derived through computational staining using a convolutional neural network trained to classify patches of images. Affinity propagation revealed local spatial structure in TIL patterns and correlation with overall survival. TIL map structural patterns were grouped using standard histopathological parameters. These patterns are enriched in particular T cell subpopulations derived from molecular measures. TIL densities and spatial structure were differentially enriched among tumor types, immune subtypes, and tumor molecular subtypes, implying that spatial infiltrate state could reflect particular tumor cell aberration states. Obtaining spatial lymphocytic patterns linked to the rich genomic characterization of TCGA samples demonstrates one use for the TCGA image archives with insights into the tumor-immune microenvironment

    Sciences for The 2.5-meter Wide Field Survey Telescope (WFST)

    Full text link
    The Wide Field Survey Telescope (WFST) is a dedicated photometric survey facility under construction jointly by the University of Science and Technology of China and Purple Mountain Observatory. It is equipped with a primary mirror of 2.5m in diameter, an active optical system, and a mosaic CCD camera of 0.73 Gpix on the main focus plane to achieve high-quality imaging over a field of view of 6.5 square degrees. The installation of WFST in the Lenghu observing site is planned to happen in the summer of 2023, and the operation is scheduled to commence within three months afterward. WFST will scan the northern sky in four optical bands (u, g, r, and i) at cadences from hourly/daily to semi-weekly in the deep high-cadence survey (DHS) and the wide field survey (WFS) programs, respectively. WFS reaches a depth of 22.27, 23.32, 22.84, and 22.31 in AB magnitudes in a nominal 30-second exposure in the four bands during a photometric night, respectively, enabling us to search tremendous amount of transients in the low-z universe and systematically investigate the variability of Galactic and extragalactic objects. Intranight 90s exposures as deep as 23 and 24 mag in u and g bands via DHS provide a unique opportunity to facilitate explorations of energetic transients in demand for high sensitivity, including the electromagnetic counterparts of gravitational-wave events detected by the second/third-generation GW detectors, supernovae within a few hours of their explosions, tidal disruption events and luminous fast optical transients even beyond a redshift of 1. Meanwhile, the final 6-year co-added images, anticipated to reach g about 25.5 mag in WFS or even deeper by 1.5 mag in DHS, will be of significant value to general Galactic and extragalactic sciences. The highly uniform legacy surveys of WFST will also serve as an indispensable complement to those of LSST which monitors the southern sky.Comment: 46 pages, submitted to SCMP

    Integrated Genomic Analysis of the Ubiquitin Pathway across Cancer Types

    Get PDF
    Protein ubiquitination is a dynamic and reversibleprocess of adding single ubiquitin molecules orvarious ubiquitin chains to target proteins. Here,using multidimensional omic data of 9,125 tumorsamples across 33 cancer types from The CancerGenome Atlas, we perform comprehensive molecu-lar characterization of 929 ubiquitin-related genesand 95 deubiquitinase genes. Among them, we sys-tematically identify top somatic driver candidates,including mutatedFBXW7with cancer-type-specificpatterns and amplifiedMDM2showing a mutuallyexclusive pattern withBRAFmutations. Ubiquitinpathway genes tend to be upregulated in cancermediated by diverse mechanisms. By integratingpan-cancer multiomic data, we identify a group oftumor samples that exhibit worse prognosis. Thesesamples are consistently associated with the upre-gulation of cell-cycle and DNA repair pathways, char-acterized by mutatedTP53,MYC/TERTamplifica-tion, andAPC/PTENdeletion. Our analysishighlights the importance of the ubiquitin pathwayin cancer development and lays a foundation fordeveloping relevant therapeutic strategies

    The Cancer Genome Atlas Comprehensive Molecular Characterization of Renal Cell Carcinoma

    Get PDF

    Two ultraviolet radiation datasets that cover China

    Get PDF
    Ultraviolet (UV) radiation has significant effects on ecosystems, environments, and human health, as well as atmospheric processes and climate change. Two ultraviolet radiation datasets are described in this paper. One contains hourly observations of UV radiation measured at 40 Chinese Ecosystem Research Network stations from 2005 to 2015. CUV3 broadband radiometers were used to observe the UV radiation, with an accuracy of 5%, which meets the World Meteorology Organization's measurement standards. The extremum method was used to control the quality of the measured datasets. The other dataset contains daily cumulative UV radiation estimates that were calculated using an all-sky estimation model combined with a hybrid model. The reconstructed daily UV radiation data span from 1961 to 2014. The mean absolute bias error and root-mean-square error are smaller than 30% at most stations, and most of the mean bias error values are negative, which indicates underestimation of the UV radiation intensity. These datasets can improve our basic knowledge of the spatial and temporal variations in UV radiation. Additionally, these datasets can be used in studies of potential ozone formation and atmospheric oxidation, as well as simulations of ecological processes
    • …
    corecore