192 research outputs found
Silico-centric Theory of Mind
Theory of Mind (ToM) refers to the ability to attribute mental states, such
as beliefs, desires, intentions, and knowledge, to oneself and others, and to
understand that these mental states can differ from one's own and from reality.
We investigate ToM in environments with multiple, distinct, independent AI
agents, each possessing unique internal states, information, and objectives.
Inspired by human false-belief experiments, we present an AI ('focal AI') with
a scenario where its clone undergoes a human-centric ToM assessment. We prompt
the focal AI to assess whether its clone would benefit from additional
instructions. Concurrently, we give its clones the ToM assessment, both with
and without the instructions, thereby engaging the focal AI in higher-order
counterfactual reasoning akin to human mentalizing--with respect to humans in
one test and to other AI in another. We uncover a discrepancy: Contemporary AI
demonstrates near-perfect accuracy on human-centric ToM assessments. Since
information embedded in one AI is identically embedded in its clone, additional
instructions are redundant. Yet, we observe AI crafting elaborate instructions
for their clones, erroneously anticipating a need for assistance. An
independent referee AI agrees with these unsupported expectations. Neither the
focal AI nor the referee demonstrates ToM in our 'silico-centric' test
Efficient LLM Inference on CPUs
Large language models (LLMs) have demonstrated remarkable performance and
tremendous potential across a wide range of tasks. However, deploying these
models has been challenging due to the astronomical amount of model parameters,
which requires a demand for large memory capacity and high memory bandwidth. In
this paper, we propose an effective approach that can make the deployment of
LLMs more efficiently. We support an automatic INT4 weight-only quantization
flow and design a special LLM runtime with highly-optimized kernels to
accelerate the LLM inference on CPUs. We demonstrate the general applicability
of our approach on popular LLMs including Llama2, Llama, GPT-NeoX, and showcase
the extreme inference efficiency on CPUs. The code is publicly available at:
https://github.com/intel/intel-extension-for-transformers.Comment: NeurIPS'2023 on Efficient Natural Language and Speech Processin
Affect as a Decision-Making System of the Present
article is based on the first author’s doctoral dissertation completed under the second author’s direction at Columbia University. The authors thank the other members of the dissertation committee—Eric Johnson, Leonard Lee, Tom Meyvis, and Elke Weber—for their very useful input at various stages of this project. They also thank Jiewen Hong, Seshan Ramaswami, and Anne-Laure Sellier for their helpful comments, and the various members of the Research o
Towards Large-scale Single-shot Millimeter-wave Imaging for Low-cost Security Inspection
Millimeter-wave (MMW) imaging is emerging as a promising technique for safe
security inspection. It achieves a delicate balance between imaging resolution,
penetrability and human safety, resulting in higher resolution compared to
low-frequency microwave, stronger penetrability compared to visible light, and
stronger safety compared to X ray. Despite of recent advance in the last
decades, the high cost of requisite large-scale antenna array hinders
widespread adoption of MMW imaging in practice. To tackle this challenge, we
report a large-scale single-shot MMW imaging framework using sparse antenna
array, achieving low-cost but high-fidelity security inspection under an
interpretable learning scheme. We first collected extensive full-sampled MMW
echoes to study the statistical ranking of each element in the large-scale
array. These elements are then sampled based on the ranking, building the
experimentally optimal sparse sampling strategy that reduces the cost of
antenna array by up to one order of magnitude. Additionally, we derived an
untrained interpretable learning scheme, which realizes robust and accurate
image reconstruction from sparsely sampled echoes. Last, we developed a neural
network for automatic object detection, and experimentally demonstrated
successful detection of concealed centimeter-sized targets using 10% sparse
array, whereas all the other contemporary approaches failed at the same sample
sampling ratio. The performance of the reported technique presents higher than
50% superiority over the existing MMW imaging schemes on various metrics
including precision, recall, and mAP50. With such strong detection ability and
order-of-magnitude cost reduction, we anticipate that this technique provides a
practical way for large-scale single-shot MMW imaging, and could advocate its
further practical applications
Effective Quantization for Diffusion Models on CPUs
Diffusion models have gained popularity for generating images from textual
descriptions. Nonetheless, the substantial need for computational resources
continues to present a noteworthy challenge, contributing to time-consuming
processes. Quantization, a technique employed to compress deep learning models
for enhanced efficiency, presents challenges when applied to diffusion models.
These models are notably more sensitive to quantization compared to other model
types, potentially resulting in a degradation of image quality. In this paper,
we introduce a novel approach to quantize the diffusion models by leveraging
both quantization-aware training and distillation. Our results show the
quantized models can maintain the high image quality while demonstrating the
inference efficiency on CPUs. The code is publicly available at:
https://github.com/intel/intel-extension-for-transformers
Identification of novel host-oriented targets for Human Immunodeficiency Virus type 1 using Random Homozygous Gene Perturbation
<p>Abstract</p> <p>Background</p> <p>Human Immunodeficiency Virus (HIV) is a global threat to public health. Current therapies that directly target the virus often are rendered ineffective due to the emergence of drug-resistant viral variants. An emerging concept to combat drug resistance is the idea of targeting host mechanisms that are essential for the propagation of the virus, but have a minimal cellular effect.</p> <p>Results</p> <p>Herein, using Random Homozygous Gene Perturbation (RHGP), we have identified cellular targets that allow human MT4 cells to survive otherwise lethal infection by a wild type HIV-1<sub>NL4-3</sub>. These gene targets were validated by the reversibility of the RHGP technology, which confirmed that the RHGP itself was responsible for the resistance to HIV-1 infection. We further confirmed by siRNA knockdowns that the RHGP-identified cellular pathways are responsible for resistance to infection by either CXCR4 or CCR5 tropic HIV-1 variants. We also demonstrated that cell clones with these gene targets disrupted by RHGP were not permissible to the replication of a drug resistant HIV-1 mutant.</p> <p>Conclusion</p> <p>These studies demonstrate the power of RHGP to identify novel host targets that are essential for the viral life cycle but which can be safely perturbed without overt cytotoxicity. These findings suggest opportunities for the future development of host-oriented therapeutics with the broad spectrum potential for safe and effective inhibition of HIV infection.</p
MUSiC : a model-unspecific search for new physics in proton-proton collisions at root s=13TeV
Results of the Model Unspecific Search in CMS (MUSiC), using proton-proton collision data recorded at the LHC at a centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 35.9 fb(-1), are presented. The MUSiC analysis searches for anomalies that could be signatures of physics beyond the standard model. The analysis is based on the comparison of observed data with the standard model prediction, as determined from simulation, in several hundred final states and multiple kinematic distributions. Events containing at least one electron or muon are classified based on their final state topology, and an automated search algorithm surveys the observed data for deviations from the prediction. The sensitivity of the search is validated using multiple methods. No significant deviations from the predictions have been observed. For a wide range of final state topologies, agreement is found between the data and the standard model simulation. This analysis complements dedicated search analyses by significantly expanding the range of final states covered using a model independent approach with the largest data set to date to probe phase space regions beyond the reach of previous general searches.Peer reviewe
- …