1,065 research outputs found
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
We present Samanantar, the largest publicly available parallel corpora
collection for Indic languages. The collection contains a total of 49.7 million
sentence pairs between English and 11 Indic languages (from two language
families). Specifically, we compile 12.4 million sentence pairs from existing,
publicly-available parallel corpora, and additionally mine 37.4 million
sentence pairs from the web, resulting in a 4x increase. We mine the parallel
sentences from the web by combining many corpora, tools, and methods: (a)
web-crawled monolingual corpora, (b) document OCR for extracting sentences from
scanned documents, (c) multilingual representation models for aligning
sentences, and (d) approximate nearest neighbor search for searching in a large
collection of sentences. Human evaluation of samples from the newly mined
corpora validate the high quality of the parallel sentences across 11
languages. Further, we extract 83.4 million sentence pairs between all 55 Indic
language pairs from the English-centric parallel corpus using English as the
pivot language. We trained multilingual NMT models spanning all these languages
on Samanantar, which outperform existing models and baselines on publicly
available benchmarks, such as FLORES, establishing the utility of Samanantar.
Our data and models are available publicly at
https://indicnlp.ai4bharat.org/samanantar/ and we hope they will help advance
research in NMT and multilingual NLP for Indic languages.Comment: Accepted to the Transactions of the Association for Computational
Linguistics (TACL
The Effect of Iron Oxide Magnetic Nanoparticles on Smooth Muscle Cells
Recently, magnetic nanoparticles of iron oxide (Fe3O4, γ-Fe2O3) have shown an increasing number of applications in the field of biomedicine, but some questions have been raised about the potential impact of these nanoparticles on the environment and human health. In this work, the three types of magnetic nanoparticles (DMSA-Fe2O3, APTS-Fe2O3, and GLU-Fe2O3) with the same crystal structure, magnetic properties, and size distribution was designed, prepared, and characterized by transmission electronic microscopy, powder X-ray diffraction, zeta potential analyzer, vibrating sample magnetometer, and Fourier transform Infrared spectroscopy. Then, we have investigated the effect of the three types of magnetic nanoparticles (DMSA-Fe2O3, APTS-Fe2O3, and GLU-Fe2O3) on smooth muscle cells (SMCs). Cellular uptake of nanoparticles by SMC displays the dose, the incubation time and surface property dependent patterns. Through the thin section TEM images, we observe that DMSA-Fe2O3is incorporated into the lysosome of SMCs. The magnetic nanoparticles have no inflammation impact, but decrease the viability of SMCs. The other questions about metabolism and other impacts will be the next subject of further studies
Azimuthal anisotropy and correlations at large transverse momenta in and Au+Au collisions at = 200 GeV
Results on high transverse momentum charged particle emission with respect to
the reaction plane are presented for Au+Au collisions at =
200 GeV. Two- and four-particle correlations results are presented as well as a
comparison of azimuthal correlations in Au+Au collisions to those in at
the same energy. Elliptic anisotropy, , is found to reach its maximum at
GeV/c, then decrease slowly and remain significant up to
-- 10 GeV/c. Stronger suppression is found in the back-to-back
high- particle correlations for particles emitted out-of-plane compared to
those emitted in-plane. The centrality dependence of at intermediate
is compared to simple models based on jet quenching.Comment: 4 figures. Published version as PRL 93, 252301 (2004
mixing in a family non-universal model revisited
Motivated by the very recent measurements performed at the LHCb and the
Tevatron of the mixing, in this paper we revisit it in a
family non-universal model, to check if a simultaneous explanation
for all the mixing observables, especially for the like-sign dimuon charge
asymmetry observed by the D0 collaboration, could be made in such a specific
model. In the first scenario where the boson contributes only to the
off-diagonal element , it is found that, once the combined
constraints from , and are imposed, the
model could not explain the measured flavour-specific CP asymmetry ,
at least within its ranges. In the second scenario where the NP
contributes also to the absorptive part via tree-level
-induced operators, we find that, with the
constraints from , and the indirect CP asymmetry in
taken into account, the present measured
experimental ranges for could not be reproduced too. Thus, such a
specific model with our specific assumptions could not
simultaneously reconcile all the present data on mixing.
Future improved measurements from the LHCb and the proposed superB experiments,
especially of the flavour-specific CP asymmetries, are expected to shed light
on the issue.Comment: 30 pages, 6 figures, 1 table, pdflatex; accepted by JHE
Azimuthal anisotropy in Au+Au collisions at sqrtsNN = 200 GeV
The results from the STAR Collaboration on directed flow (v_1), elliptic flow
(v_2), and the fourth harmonic (v_4) in the anisotropic azimuthal distribution
of particles from Au+Au collisions at sqrtsNN = 200 GeV are summarized and
compared with results from other experiments and theoretical models. Results
for identified particles are presented and fit with a Blast Wave model.
Different anisotropic flow analysis methods are compared and nonflow effects
are extracted from the data. For v_2, scaling with the number of constituent
quarks and parton coalescence is discussed. For v_4, scaling with v_2^2 and
quark coalescence is discussed.Comment: 26 pages. As accepted by Phys. Rev. C. Text rearranged, figures
modified, but data the same. However, in Fig. 35 the hydro calculations are
corrected in this version. The data tables are available at
http://www.star.bnl.gov/central/publications/ by searching for "flow" and
then this pape
Rapidity and Centrality Dependence of Proton and Anti-proton Production from Au+Au Collisions at sqrt(sNN) = 130GeV
We report on the rapidity and centrality dependence of proton and anti-proton
transverse mass distributions from Au+Au collisions at sqrt(sNN) = 130GeV as
measured by the STAR experiment at RHIC. Our results are from the rapidity and
transverse momentum range of |y|<0.5 and 0.35 <p_t<1.00GeV/c. For both protons
and anti-protons, transverse mass distributions become more convex from
peripheral to central collisions demonstrating characteristics of collective
expansion. The measured rapidity distributions and the mean transverse momenta
versus rapidity are flat within |y|<0.5. Comparisons of our data with results
from model calculations indicate that in order to obtain a consistent picture
of the proton(anti-proton) yields and transverse mass distributions the
possibility of pre-hadronic collective expansion may have to be taken into
account.Comment: 4 pages, 3 figures, 1 table, submitted to PR
Effective Rheology of Bubbles Moving in a Capillary Tube
We calculate the average volumetric flux versus pressure drop of bubbles
moving in a single capillary tube with varying diameter, finding a square-root
relation from mapping the flow equations onto that of a driven overdamped
pendulum. The calculation is based on a derivation of the equation of motion of
a bubble train from considering the capillary forces and the entropy production
associated with the viscous flow. We also calculate the configurational
probability of the positions of the bubbles.Comment: 4 pages, 1 figur
Planar and van der Waals heterostructures for vertical tunnelling single electron transistors
Despite a rich choice of two-dimensional materials, which exists these days, heterostructures, both vertical (van der Waals) and in-plane, offer an unprecedented control over the properties and functionalities of the resulted structures. Thus, planar heterostructures allow p-n junctions between different two-dimensional semiconductors and graphene nanoribbons with well-defined edges; and vertical heterostructures resulted in the observation of superconductivity in purely carbon-based systems and realisation of vertical tunnelling transistors. Here we demonstrate simultaneous use of in-plane and van der Waals heterostructures to build vertical single electron tunnelling transistors. We grow graphene quantum dots inside the matrix of hexagonal boron nitride, which allows a dramatic reduction of the number of localised states along the perimeter of the quantum dots. The use of hexagonal boron nitride tunnel barriers as contacts to the graphene quantum dots make our transistors reproducible and not dependent on the localised states, opening even larger flexibility when designing future devices
- …