35 research outputs found
Conference 2014 speaker series: an interview with Jonathan Stray
Ahead of the Polis Annual Journalism Conference on Friday March 28th, we are interviewing some of our speakers. Jonathan Stray is the man behind Overview, a project from the Associated Press that aims to help journalists find stories in large quantities of documents. The tool uses keyword searches to automatically sort documents according to topic, making patterns and trends far easier to spot. Jonathan is a fellow at the Tow Center for Digital Journalism at Columbia University, teaching and researching computational journalism. He developed Overview with a Knight News Challenge grant
Summon a Demon and Bind it: A Grounded Theory of LLM Red Teaming in the Wild
Engaging in the deliberate generation of abnormal outputs from large language
models (LLMs) by attacking them is a novel human activity. This paper presents
a thorough exposition of how and why people perform such attacks. Using a
formal qualitative methodology, we interviewed dozens of practitioners from a
broad range of backgrounds, all contributors to this novel work of attempting
to cause LLMs to fail. We relate and connect this activity between its
practitioners' motivations and goals; the strategies and techniques they
deploy; and the crucial role the community plays. As a result, this paper
presents a grounded theory of how and why people attack large language models:
LLM red teaming in the wild
Magneto-optical trapping in a near-surface borehole
Borehole gravity sensing can be used in a number of applications to measure
features around a well including rock-type change mapping and determination of
reservoir porosity. Quantum technology gravity sensors based on atom
interferometry have the ability to offer increased survey speeds and reduced
need for calibration. While surface sensors have been demonstrated in real
world environments, significant improvements in robustness and reductions to
radial size, weight, and power consumption are required for such devices to be
deployed in boreholes. To realise the first step towards the deployment of cold
atom-based sensors down boreholes, we demonstrate a borehole-deployable
magneto-optical trap, the core package of many cold atom-based systems. The
enclosure containing the magneto-optical trap itself had an outer radius of
() mm at its widest point and a length of () mm. This system
was used to generate atom clouds at 1 m intervals in a 14 cm wide, 50 m deep
borehole, to simulate an in-borehole gravity surveys are performed. During the
survey the system generated on average clouds of (3.0
Rb atoms with the standard deviation in atom number across the survey
observed to be as low as
Magneto-optical trapping in a near-suface borehole
Borehole gravity sensing can be used in a number of applications to measure features around a well, including rock-type change mapping and determination of reservoir porosity. Quantum technology gravity sensors, based on atom interferometry, have the ability to offer increased survey speeds and reduced need for calibration. While surface sensors have been demonstrated in real world environments, significant improvements in robustness and reductions to radial size, weight, and power consumption are required for such devices to be deployed in boreholes. To realise the first step towards the deployment of cold atom-based sensors down boreholes, we demonstrate a borehole-deployable magneto-optical trap, the core package of many cold atom-based systems. The enclosure containing the magneto-optical trap itself had an outer radius of (60 ± 0.1) mm at its widest point and a length of (890 ± 5) mm. This system was used to generate atom clouds at 1 m intervals in a 14 cm wide, 50 m deep borehole, to simulate how in-borehole gravity surveys are performed. During the survey, the system generated, on average, clouds of (3.0 ± 0.1) à 105 87Rb atoms with the standard deviation in atom number across the survey observed to be as low as 8.9 à 104
Histone Deacetylase Inhibitor Romidepsin Induces HIV Expression in CD4 T Cells from Patients on Suppressive Antiretroviral Therapy at Concentrations Achieved by Clinical Dosing
Persistent latent reservoir of replication-competent proviruses in memory CD4 T cells is a major obstacle to curing HIV infection. Pharmacological activation of HIV expression in latently infected cells is being explored as one of the strategies to deplete the latent HIV reservoir. In this study, we characterized the ability of romidepsin (RMD), a histone deacetylase inhibitor approved for the treatment of T-cell lymphomas, to activate the expression of latent HIV. In an in vitro T-cell model of HIV latency, RMD was the most potent inducer of HIV (EC50 = 4.5 nM) compared with vorinostat (VOR; EC50 = 3,950 nM) and other histone deacetylase (HDAC) inhibitors in clinical development including panobinostat (PNB; EC50 = 10 nM). The HIV induction potencies of RMD, VOR, and PNB paralleled their inhibitory activities against multiple human HDAC isoenzymes. In both resting and memory CD4 T cells isolated from HIV-infected patients on suppressive combination antiretroviral therapy (cART), a 4-hour exposure to 40 nM RMD induced a mean 6-fold increase in intracellular HIV RNA levels, whereas a 24-hour treatment with 1 ÎŒM VOR resulted in 2- to 3-fold increases. RMD-induced intracellular HIV RNA expression persisted for 48 hours and correlated with sustained inhibition of cell-associated HDAC activity. By comparison, the induction of HIV RNA by VOR and PNB was transient and diminished after 24 hours. RMD also increased levels of extracellular HIV RNA and virions from both memory and resting CD4 T-cell cultures. The activation of HIV expression was observed at RMD concentrations below the drug plasma levels achieved by doses used in patients treated for T-cell lymphomas. In conclusion, RMD induces HIV expression ex vivo at concentrations that can be achieved clinically, indicating that the drug may reactivate latent HIV in patients on suppressive cART
The AI Learns to Lie to Please You: Preventing Biased Feedback Loops in Machine-Assisted Intelligence Analysis
Researchers are starting to design AI-powered systems to automatically select and summarize the reports most relevant to each analyst, which raises the issue of bias in the information presented. This article focuses on the selection of relevant reports without an explicit query, a task known as recommendation. Drawing on previous work documenting the existence of human-machine feedback loops in recommender systems, this article reviews potential biases and mitigations in the context of intelligence analysis. Such loops can arise when behavioral âengagementâ signals such as clicks or user ratings are used to infer the value of displayed information. Even worse, there can be feedback loops in the collection of intelligence information because users may also be responsible for tasking collection. Avoiding misalignment feedback loops requires an alternate, ongoing, non-engagement signal of information quality. Existing evaluation scales for intelligence product quality and rigor, such as the IC Rating Scale, could provide ground-truth feedback. This sparse data can be used in two ways: for human supervision of average performance and to build models that predict human survey ratings for use at recommendation time. Both techniques are widely used today by social media platforms. Open problems include the design of an ideal human evaluation method, the cost of skilled human labor, and the sparsity of the resulting data
Designing recommender systems to depolarize
Polarization is implicated in the erosion of democracy and the progression to violence, which makes the polarization properties of large algorithmic content selection systems (recommender systems) a matter of concern for peace and security. While algorithm-driven social media do not seem to be a primary driver of polarization at the country level, they could be a useful intervention point in polarized societies. This paper examines algorithmic depolarization interventions aimed at transforming conflict: not suppressing or eliminating conflict, but making it more constructive. Algorithmic intervention is considered at three stages: what content is available (moderation), how content is selected and personalized (ranking), and content presentation and controls (user interface). Empirical studies of online conflict suggest that not only could the exposure-diversity intervention proposed as an antidote to âfilter bubblesâ be improved: under some conditions, it can even worsen polarization. Using civility metrics in conjunction with diversity in content selection may be more effective. However, diversity-based interventions have not been tested at scale, and may not work in the diverse and dynamic contexts of real platforms. Instead, intervening in platform polarization dynamics will likely require continuous monitoring of polarization metrics, such as the widely used âfeeling thermometerâ. These metrics can be used to evaluate product features, and can potentially be engineered as algorithmic objectives. While using any metric as an optimization target may have harmful consequences, to prevent optimization processes from creating conflict as a side effect it may prove necessary to include polarization measures in the objective function of recommender algorithms
What the Victorians learned: perspectives on nineteenth-century schoolbooks
No abstract available