137 research outputs found
QARV: Quantization-Aware ResNet VAE for Lossy Image Compression
This paper addresses the problem of lossy image compression, a fundamental
problem in image processing and information theory that is involved in many
real-world applications. We start by reviewing the framework of variational
autoencoders (VAEs), a powerful class of generative probabilistic models that
has a deep connection to lossy compression. Based on VAEs, we develop a novel
scheme for lossy image compression, which we name quantization-aware ResNet VAE
(QARV). Our method incorporates a hierarchical VAE architecture integrated with
test-time quantization and quantization-aware training, without which efficient
entropy coding would not be possible. In addition, we design the neural network
architecture of QARV specifically for fast decoding and propose an adaptive
normalization operation for variable-rate compression. Extensive experiments
are conducted, and results show that QARV achieves variable-rate compression,
high-speed decoding, and a better rate-distortion performance than existing
baseline methods. The code of our method is publicly accessible at
https://github.com/duanzhiihao/lossy-vaeComment: Technical repor
Rec4Ad: A Free Lunch to Mitigate Sample Selection Bias for Ads CTR Prediction in Taobao
Click-Through Rate (CTR) prediction serves as a fundamental component in
online advertising. A common practice is to train a CTR model on advertisement
(ad) impressions with user feedback. Since ad impressions are purposely
selected by the model itself, their distribution differs from the inference
distribution and thus exhibits sample selection bias (SSB) that affects model
performance. Existing studies on SSB mainly employ sample re-weighting
techniques which suffer from high variance and poor model calibration. Another
line of work relies on costly uniform data that is inadequate to train
industrial models. Thus mitigating SSB in industrial models with a
uniform-data-free framework is worth exploring. Fortunately, many platforms
display mixed results of organic items (i.e., recommendations) and sponsored
items (i.e., ads) to users, where impressions of ads and recommendations are
selected by different systems but share the same user decision rationales.
Based on the above characteristics, we propose to leverage recommendations
samples as a free lunch to mitigate SSB for ads CTR model (Rec4Ad). After
elaborating data augmentation, Rec4Ad learns disentangled representations with
alignment and decorrelation modules for enhancement. When deployed in Taobao
display advertising system, Rec4Ad achieves substantial gains in key business
metrics, with a lift of up to +6.6\% CTR and +2.9\% RPM
Grid investment capability prediction based on path analysis and BP neural network
With the more complex investment environment of China’s power grid, the accurate prediction of the investment ability of power grid enterprises has become an important prerequisite for managers to make precise investment decisions. This paper first selects the factors affecting the investment capacity of the power grid from the internal and external environment, and establishes the index system of the factors affecting the investment capacity. Secondly, the path analysis is used to deeply explore the interaction relationship and influence degree of each index and investment capacity. Finally, the maximum investment capacity of the power network can be predicted based on the BP neural network prediction model. The results show that the BP neural network prediction model can achieve higher prediction accuracy when predicting the power grid investment capability
A multicenter study of fetal chromosomal abnormalities in Chinese women of advanced maternal age
AbstractObjectiveThis study aimed to determine the rates of different fetal chromosomal abnormalities among women of advanced maternal age in China and to discuss the possible misdiagnosis risks of newer molecular techniques, for selection of appropriate prenatal screening and diagnostic technologies.Materials and MethodsSecond trimester amniocentesis and fetal karyotype results of 46,258 women were retrospectively reviewed. All women were ≥ 35 years old with singleton pregnancies. The rates of clinically significant chromosomal abnormalities (CSCAs), incidence of chromosomal abnormalities, and correlations with age were determined.ResultsFrom 2001 to 2010, the proportion of women of advanced maternal age undergoing prenatal diagnosis increased from 20% to 46%. The mean age was 37.4 years (range, 35–46 years). A total of 708 cases of CSCAs, with a rate of 1.53% were found. Trisomy 21 was the most common single chromosome abnormality and accounted for 55.9% of all CSCAs with an incidence of 0.86%. Trisomy 13, trisomy 18, and trisomy 21, the most common chromosome autosomal aneuploidies, accounted for 73.6% of all CSCAs, with a rate of 1.13%. As a group, the most common chromosomal aneuploidies (13/18/21/X/Y) accounted for 93.9% of all abnormalities, with a rate of 1.44%. The incidence of trisomy 21, trisomy 13/18/21 as a group, and 13/18/21/X/Y as a group was significantly greater in women aged 39 years and older (p < 0.001), but was not different between women aged 35 years, 36 years, 37 years, and 38 years.ConclusionThese findings may assist in genetic counseling of advanced maternal age pregnant women, and provide a basis for the selection of prenatal screening and diagnostic technologies
COPR: Consistency-Oriented Pre-Ranking for Online Advertising
Cascading architecture has been widely adopted in large-scale advertising
systems to balance efficiency and effectiveness. In this architecture, the
pre-ranking model is expected to be a lightweight approximation of the ranking
model, which handles more candidates with strict latency requirements. Due to
the gap in model capacity, the pre-ranking and ranking models usually generate
inconsistent ranked results, thus hurting the overall system effectiveness. The
paradigm of score alignment is proposed to regularize their raw scores to be
consistent. However, it suffers from inevitable alignment errors and error
amplification by bids when applied in online advertising. To this end, we
introduce a consistency-oriented pre-ranking framework for online advertising,
which employs a chunk-based sampling module and a plug-and-play rank alignment
module to explicitly optimize consistency of ECPM-ranked results. A -based weighting mechanism is adopted to better distinguish the importance
of inter-chunk samples in optimization. Both online and offline experiments
have validated the superiority of our framework. When deployed in Taobao
display advertising system, it achieves an improvement of up to +12.3\% CTR and
+5.6\% RPM
Robust Representation Learning for Unified Online Top-K Recommendation
In large-scale industrial e-commerce, the efficiency of an online
recommendation system is crucial in delivering highly relevant item/content
advertising that caters to diverse business scenarios. However, most existing
studies focus solely on item advertising, neglecting the significance of
content advertising. This oversight results in inconsistencies within the
multi-entity structure and unfair retrieval. Furthermore, the challenge of
retrieving top-k advertisements from multi-entity advertisements across
different domains adds to the complexity. Recent research proves that
user-entity behaviors within different domains exhibit characteristics of
differentiation and homogeneity. Therefore, the multi-domain matching models
typically rely on the hybrid-experts framework with domain-invariant and
domain-specific representations. Unfortunately, most approaches primarily focus
on optimizing the combination mode of different experts, failing to address the
inherent difficulty in optimizing the expert modules themselves. The existence
of redundant information across different domains introduces interference and
competition among experts, while the distinct learning objectives of each
domain lead to varying optimization challenges among experts. To tackle these
issues, we propose robust representation learning for the unified online top-k
recommendation. Our approach constructs unified modeling in entity space to
ensure data fairness. The robust representation learning employs domain
adversarial learning and multi-view wasserstein distribution learning to learn
robust representations. Moreover, the proposed method balances conflicting
objectives through the homoscedastic uncertainty weights and orthogonality
constraints. Various experiments validate the effectiveness and rationality of
our proposed method, which has been successfully deployed online to serve real
business scenarios.Comment: 14 pages, 6 figures, submitted to ICD
Shifts in Soil Microbial Community Composition, Function, and Co-occurrence Network of Phragmites australis in the Yellow River Delta
Soil microorganisms play vital roles in regulating biogeochemical processes. The composition and function of soil microbial community have been well studied, but little is known about the responses of bacterial and fungal communities to different habitats of the same plant, especially the inter-kingdom co-occurrence pattern including bacteria and fungi. Herein, we used high-throughput sequencing to investigate the bacterial and fungal communities of five Phragmites australis habitats in the Yellow River Delta and constructed their inter-kingdom interaction network by network analysis. The results showed that richness did not differ significantly among habitats for either the bacterial or fungal communities. The distribution of soil bacterial community was significantly affected by soil physicochemical properties, whereas that of the fungal community was not. The main functions of the bacterial and fungal communities were to participate in the degradation of organic matter and element cycling, both of which were significantly affected by soil physicochemical properties. Network analysis revealed that bacteria and fungi participated in the formation of networks through positive interactions; the role of intra-kingdom interactions were more important than inter-kingdom interactions. In addition, rare species acted as keystones played a critical role in maintaining the network structure, while NO3−−N likely played an important role in maintaining the network topological properties. Our findings provided insights into the inter-kingdom microbial co-occurrence network and response of the soil microbial community composition and function to different P. australis habitats in coastal wetlands, which will deepen our insights into microbial community assembly in coastal wetlands
- …