813 research outputs found
Wall-mediated self-diffusion in slit and cylindrical pores
Analytical and numerical simulation studies are performed on the diffusion of simple fluids in both thin slits and long cylindrical pores. In the region of large Knudsen numbers, where the wall-particle collisions outnumber the intermolecular collisions, we obtain analytical results for the self-diffusion coefficients for both slit and cylindrical pore shapes. The results show anomalous behavior of the mean square displacement and the velocity autocorrelation for the case of slits, unlike the case of cylindrical pores which shows standard Fick's law. Molecular dynamics simulations confirm the analytical results. We further study the wall-mediated diffusion behavior conducted by a Smoluchowski thermal wall and compare with our analytical results obtained from the stochastic thermal wall model proposed by Mon and Percus
Low-velocity anisotropic Dirac fermions on the side surface of topological insulators
We report anisotropic Dirac-cone surface bands on a side-surface geometry of
the topological insulator BiSe revealed by first-principles
density-functional calculations. We find that the electron velocity in the
side-surface Dirac cone is anisotropically reduced from that in the
(111)-surface Dirac cone, and the velocity is not in parallel with the wave
vector {\bf k} except for {\bf k} in high-symmetry directions. The size of the
electron spin depends on the direction of {\bf k} due to anisotropic variation
of the noncollinearity of the electron state. Low-energy effective Hamiltonian
is proposed for side-surface Dirac fermions, and its implications are presented
including refractive transport phenomena occurring at the edges of tological
insulators where different surfaces meet.Comment: 4 pages, 2 columns, 4 figure
INSTA-BNN: Binary Neural Network with INSTAnce-aware Threshold
Binary Neural Networks (BNNs) have emerged as a promising solution for
reducing the memory footprint and compute costs of deep neural networks. BNNs,
on the other hand, suffer from information loss because binary activations are
limited to only two values, resulting in reduced accuracy. To improve the
accuracy, previous studies have attempted to control the distribution of binary
activation by manually shifting the threshold of the activation function or
making the shift amount trainable. During the process, they usually depended on
statistical information computed from a batch. We argue that using statistical
data from a batch fails to capture the crucial information for each input
instance in BNN computations, and the differences between statistical
information computed from each instance need to be considered when determining
the binary activation threshold of each instance. Based on the concept, we
propose the Binary Neural Network with INSTAnce-aware threshold (INSTA-BNN),
which decides the activation threshold value considering the difference between
statistical data computed from a batch and each instance. The proposed
INSTA-BNN outperforms the baseline by 2.5% and 2.3% on the ImageNet
classification task with comparable computing cost, achieving 68.0% and 71.7%
top-1 accuracy on ResNet-18 and MobileNetV1 based models, respectively.Comment: 19 pages, 7 figures; excluded axessibility packag
OWQ: Lessons learned from activation outliers for weight quantization in large language models
Large language models (LLMs) with hundreds of billions of parameters show
impressive results across various language tasks using simple prompt tuning and
few-shot examples, without the need for task-specific fine-tuning. However,
their enormous size requires multiple server-grade GPUs even for inference,
creating a significant cost barrier. To address this limitation, we introduce a
novel post-training quantization method for weights with minimal quality
degradation. While activation outliers are known to be problematic in
activation quantization, our theoretical analysis suggests that we can identify
factors contributing to weight quantization errors by considering activation
outliers. We propose an innovative PTQ scheme called outlier-aware weight
quantization (OWQ), which identifies vulnerable weights and allocates
high-precision to them. Our extensive experiments demonstrate that the 3.01-bit
models produced by OWQ exhibit comparable quality to the 4-bit models generated
by OPTQ
Thermodynamics of d-dimensional hard sphere fluids confined to micropores
We derive an analytical expression of the second virial coefficient of d-dimensional hard sphere fluids confined to slit pores by applying Speedy and Reiss’ interpretation of cavity space. We confirm that this coefficient is identical to the one obtained from the Mayer cluster expansion up to second order with respect to fugacity. The key step of both approaches is to evaluate either the surface area or the volume of the d-dimensional exclusion sphere confined to a slit pore. We, further, present an analytical form of thermodynamic functions such as entropy and pressure tensor as a function of the size of the slit pore. Molecular dynamics simulations are performed for d = 2 and d = 3, and the results are compared with analytically obtained equations of state. They agree satisfactorily in the low density regime, and, for given density, the agreement of the results becomes excellent as the width of the slit pore gets smaller, because the higher order virial coefficients become unimportant
- …