92 research outputs found
Impact of Large Language Models on Generating Software Specifications
Software specifications are essential for ensuring the reliability of
software systems. Existing specification extraction approaches, however, suffer
from limited generalizability and require manual efforts. We study the
effectiveness of Large Language Models (LLMs) in generating software
specifications from software documentation, utilizing Few-Shot Learning (FSL)
to enable LLMs to generalize from a small number of examples. We compare the
performance of LLMs with FSL to that of state-of-the-art specification
extraction techniques and study the impact of prompt construction strategies on
LLM performance. In addition, we conduct a comprehensive analysis of their
symptoms and root causes of the failures to understand the pros and cons of
LLMs and existing approaches. We also compare 11 LLMs' performance, cost, and
response time for generating software specifications. Our findings include that
(1) the best performing LLM outperforms existing approaches by 9.1--13.7% with
a few similar examples, (2) the two dominant root causes combined (ineffective
prompts and missing domain knowledge) result in 57--60% of LLM failures, and
(3) most of the 11 LLMs achieve better or comparable performance compared to
traditional techniques. Our study offers valuable insights for future research
to improve specification generation
Recommended from our members
Accelerated microwave curing of fibre-reinforced thermoset polymer composites for structural applications: A review of scientific challenges
Accelerated curing of high performance fibre-reinforced polymer (FRP) composites via microwave heating or radiation, which can significantly reduce cure time and increase energy efficiency, has several major challenges (e.g. uneven depth of radiation penetration, reinforcing fibre shielding, uneven curing, introduction of hot spots etc). This article reviews the current scientific challenges with microwave curing of FRP composites considering the underlying physics of microwave radiation absorption in thermoset-matrix composites. The fundamental principles behind efficient accelerated curing of composites using microwave radiation heating are reviewed and presented, especially focusing on the relation between penetration depth, microwave frequency, dielectric properties and cure degree. Based on this review, major factors influencing microwave curing of thermoset-matrix composites are identified, and recommendations for efficient cure cycle design are provided
A hQCD model and its phase diagram in Einstein-Maxwell-Dilaton system
By use of the potential reconstruction approach we obtain a series of
asymptotically AdS (aAdS) black hole solutions in an Einstein-Maxwell-Dilaton
(EMD) system. Basing on the solutions of the system, we reconstruct a
semi-analytical holographic QCD (hQCD) model with a quadratic term in warped
factor. We discuss some aspects of the hQCD model, in particular we calculate
the free energy of two static color sources (a heavy quark-antiquark pair)
which is an important order parameter to describe confinement/deconfinement
phase transition. The behavior of the free energy with respect to temperature
and chemical potential is studied. We find that in the hQCD model the
deconfinement phase transition can be realized and a critical point occurs. The
resulting phase diagram in the temperature-chemical potential plane is
in quite good agreement with the one from recent lattice results and effective
models of QCD.Comment: 27 pages, 11 figures, regular paper, add some comments and
references, accepted by JHE
A Search for Technosignatures Around 11,680 Stars with the Green Bank Telescope at 1.15-1.73 GHz
We conducted a search for narrowband radio signals over four observing
sessions in 2020-2023 with the L-band receiver (1.15-1.73 GHz) of the 100 m
diameter Green Bank Telescope. We pointed the telescope in the directions of 62
TESS Objects of Interest, capturing radio emissions from a total of ~11,680
stars and planetary systems in the ~9 arcminute beam of the telescope. All
detections were either automatically rejected or visually inspected and
confirmed to be of anthropogenic nature. In this work, we also quantified the
end-to-end efficiency of radio SETI pipelines with a signal injection and
recovery analysis. The UCLA SETI pipeline recovers 94.0% of the injected
signals over the usable frequency range of the receiver and 98.7% of the
injections when regions of dense RFI are excluded. In another pipeline that
uses incoherent sums of 51 consecutive spectra, the recovery rate is ~15 times
smaller at ~6%. The pipeline efficiency affects calculations of transmitter
prevalence and SETI search volume. Accordingly, we developed an improved Drake
Figure of Merit and a formalism to place upper limits on transmitter prevalence
that take the pipeline efficiency and transmitter duty cycle into account.
Based on our observations, we can state at the 95% confidence level that fewer
than 6.6% of stars within 100 pc host a transmitter that is detectable in our
search (EIRP > 1e13 W). For stars within 20,000 ly, the fraction of stars with
detectable transmitters (EIRP > 5e16 W) is at most 3e-4. Finally, we showed
that the UCLA SETI pipeline natively detects the signals detected with AI
techniques by Ma et al. (2023).Comment: 22 pages, 9 figures, submitted to AJ, revise
Broad and strong memory CD4(+)and CD8(+)T cells induced by SARS-CoV-2 in UK convalescent individuals following COVID-19
The development of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) vaccines and therapeutics will depend on understanding viral immunity. We studied T cell memory in 42 patients following recovery from COVID-19 (28 with mild disease and 14 with severe disease) and 16 unexposed donors, using interferon-γ-based assays with peptides spanning SARS-CoV-2 except ORF1. The breadth and magnitude of T cell responses were significantly higher in severe as compared with mild cases. Total and spike-specific T cell responses correlated with spike-specific antibody responses. We identified 41 peptides containing CD4+ and/or CD8+ epitopes, including six immunodominant regions. Six optimized CD8+ epitopes were defined, with peptide–MHC pentamer-positive cells displaying the central and effector memory phenotype. In mild cases, higher proportions of SARS-CoV-2-specific CD8+ T cells were observed. The identification of T cell responses associated with milder disease will support an understanding of protective immunity and highlights the potential of including non-spike proteins within future COVID-19 vaccine design
A saturated map of common genetic variants associated with human height
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes(1). Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel(2)) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries.A large genome-wide association study of more than 5 million individuals reveals that 12,111 single-nucleotide polymorphisms account for nearly all the heritability of height attributable to common genetic variants
A saturated map of common genetic variants associated with human height.
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries
- …