318 research outputs found
Multi-Channel Pyramid Person Matching Network for Person Re-Identification
In this work, we present a Multi-Channel deep convolutional Pyramid Person
Matching Network (MC-PPMN) based on the combination of the semantic-components
and the color-texture distributions to address the problem of person
re-identification. In particular, we learn separate deep representations for
semantic-components and color-texture distributions from two person images and
then employ pyramid person matching network (PPMN) to obtain correspondence
representations. These correspondence representations are fused to perform the
re-identification task. Further, the proposed framework is optimized via a
unified end-to-end deep learning scheme. Extensive experiments on several
benchmark datasets demonstrate the effectiveness of our approach against the
state-of-the-art literature, especially on the rank-1 recognition rate.Comment: 9 pages, 5 figures, 7 tables and accepted by the 32nd AAAI Conference
on Artificial Intelligenc
Liquid metal flows drive by gas bubbles in a static magnetic field
This thesis presents an experimental study which investigates the behaviour of gas bubbles rising in a liquid metal and the related bubble-driven flow under the influence of external DC magnetic fields. The experimental configuration considered here concerns a cylindrical container filled with the eutectic alloy GaInSn. Argon gas bubbles are injected through a single orifice located at the container bottom in the centre of the circular cross-section. A homogeneous magnetic field was generated by a Helmholtz configuration of a pair of water-cooled copper coils. The magnetic field has been imposed either in vertical direction parallel to the main bubble motion or in horizontal direction, respectively. A vertical magnetic field stabilizes and damps the liquid metal flow effectively. The temporal variations of the fluid velocity with time become smaller with increasing magnetic induction. The velocity magnitudes are decreased, and the velocity distributions along the magnetic field lines are smoothed. The flow field keeps the axisymmetric distribution. A horizontal magnetic field destabilizes and enhances the flow within a range of moderate Hartmann numbers (100 < Ha < 400). The flow becomes non-axisymmetric due to the non-isotropic influence of the magnetic field. In the meridional plane parallel to the field lines, the flow changes its direction from a downward to an upward motion. Enhanced downward flows were observed in the meridional plane perpendicular to the field lines. The liquid velocity in both planes shows strong, periodic oscillations. The fluid motion is dominated by large-scale structures elongated along the magnetic field lines over the entire chord lengths of the circular cross-section
Self-organization of photoionized plasmas via kinetic instabilities
Self-organization in an unmagnetized collisionless plasma (in this paper)
refers to formation of transient coherent structures such as collective
oscillations (electrostatic waves) or magnetic fields resulting from so-called
kinetic effects in the plasma. This topical review provides a comprehensive
analysis of the self-organization of strong-field photoionized, non-equilibrium
plasmas through kinetic instabilities. The authors propose and demonstrate a
novel experimental platform that enables the formation of dense plasmas with
known highly anisotropic and nonthermal electron velocity distribution
functions on a timescale on the order of an inverse electron plasma frequency.
We then show that such plasmas are highly susceptible to a hierarchy of kinetic
instabilities, including two-stream, current filamentation and Weibel, that
convert a fraction of the electron kinetic energy into electric and/or magnetic
energy stored in self-organized structures. The electrostatic waves so produced
are measured using a collective light (Thomson) scattering technique with
femtosecond resolution as the kinetic instabilities aided by collisions
eventually thermalize the plasma electrons. In addition, we describe a novel
experimental technique that has made it possible to map the temporal evolution
of the wavenumber spectrum of the thermal Weibel instability with picosecond
resolution, which leads to the formation of quasi-static coherent magnetic
fields with different topologies in photoionized plasmas. Finally, the paper
summarizes the important results and discusses future directions on this topic
Research on solitons interactions' in one-dimensional indium chains on Si(111) surfaces
Solitons have garnered significant attention across various fields, yet a
contentious debate persists regarding the precise structure of solitons on
indium chains. Currently, multiple forms of solitons in one-dimensional atomic
chains have been reported. STM provides an effective means to study the precise
atomic structure of solitons, particularly their dynamics and interactions.
However, limited research has been conducted on soliton interactions and
soliton-chain interactions, despite their profound impact on relative soliton
motions and the overall physical properties of the system. In this work, we
characterized the structures of the soliton dimer and trimer, observed the
displacements induced by the soliton entity and statisticized the dynamic
behaviors of soliton dimers over time evolution or temperature. To reveal the
soliton mechanism, we further utilized STM to investigate the CDWs between two
solitons when two monomers were encountered. Additionally, we achieved the
manipulation of the monomer on the indium chain by the STM tip. Our work serves
as an important approach to elucidate interactions in correlated electronic
systems and advance the development of potential topological soliton computers
Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference
With the ubiquitous use of modern large language models (LLMs) across
industries, the inference serving for these models is ever expanding. Given the
high compute and memory requirements of modern LLMs, more and more
top-of-the-line GPUs are being deployed to serve these models. Energy
availability has come to the forefront as the biggest challenge for data center
expansion to serve these models. In this paper, we present the trade-offs
brought up by making energy efficiency the primary goal of LLM serving under
performance SLOs. We show that depending on the inputs, the model, and the
service-level agreements, there are several knobs available to the LLM
inference provider to use for being energy efficient. We characterize the
impact of these knobs on the latency, throughput, as well as the energy. By
exploring these trade-offs, we offer valuable insights into optimizing energy
usage without compromising on performance, thereby paving the way for
sustainable and cost-effective LLM deployment in data center environments.Comment: 6 pages, 15 figure
POLCA: Power Oversubscription in LLM Cloud Providers
Recent innovation in large language models (LLMs), and their myriad use-cases
have rapidly driven up the compute capacity demand for datacenter GPUs. Several
cloud providers and other enterprises have made substantial plans of growth in
their datacenters to support these new workloads. One of the key bottleneck
resources in datacenters is power, and given the increasing model sizes of
LLMs, they are becoming increasingly power intensive. In this paper, we show
that there is a significant opportunity to oversubscribe power in LLM clusters.
Power oversubscription improves the power efficiency of these datacenters,
allowing more deployable servers per datacenter, and reduces the deployment
time, since building new datacenters is slow.
We extensively characterize the power consumption patterns of a variety of
LLMs and their configurations. We identify the differences between the
inference and training power consumption patterns. Based on our analysis of
these LLMs, we claim that the average and peak power utilization in LLM
clusters for inference should not be very high. Our deductions align with the
data from production LLM clusters, revealing that inference workloads offer
substantial headroom for power oversubscription. However, the stringent set of
telemetry and controls that GPUs offer in a virtualized environment, makes it
challenging to have a reliable and robust power oversubscription mechanism.
We propose POLCA, our framework for power oversubscription that is robust,
reliable, and readily deployable for GPU clusters. Using open-source models to
replicate the power patterns observed in production, we simulate POLCA and
demonstrate that we can deploy 30% more servers in the same GPU cluster for
inference, with minimal performance los
- …