65 research outputs found
A Conceptual Framework for Smart City International Standards
Smart cities construction has been a global focus during the past ten years. It contributes to the achievement of the sustainability development goals (for economy, society, and environment) by leveraging information and communication technologies (ICTs). International organizations (such as ISO, IEC, and ITU-T) have developed standards to encapsulate precise and state-of-the-art knowledge regarding research, practice and policy. However, thousands of such standards have not been fully used due to the lack of generally agreed vocabularies or frameworks. In this article, a conceptual framework named ‘ALL’ is proposed. Some initial evaluations on the proposed framework have been performed. The result shows that the framework could help people observe, organize and use such standards more efficiently. Some preliminary conversations with governments prove the potential usefulness of the framework in practice
Self adaptive global-local feature enhancement for radiology report generation
Automated radiology report generation aims at automatically generating a
detailed description of medical images, which can greatly alleviate the
workload of radiologists and provide better medical services to remote areas.
Most existing works pay attention to the holistic impression of medical images,
failing to utilize important anatomy information. However, in actual clinical
practice, radiologists usually locate important anatomical structures, and then
look for signs of abnormalities in certain structures and reason the underlying
disease. In this paper, we propose a novel framework AGFNet to dynamically fuse
the global and anatomy region feature to generate multi-grained radiology
report. Firstly, we extract important anatomy region features and global
features of input Chest X-ray (CXR). Then, with the region features and the
global features as input, our proposed self-adaptive fusion gate module could
dynamically fuse multi-granularity information. Finally, the captioning
generator generates the radiology reports through multi-granularity features.
Experiment results illustrate that our model achieved the state-of-the-art
performance on two benchmark datasets including the IU X-Ray and MIMIC-CXR.
Further analyses also prove that our model is able to leverage the
multi-grained information from radiology images and texts so as to help
generate more accurate reports
CAMP:Co-Attention Memory Networks for Diagnosis Prediction in Healthcare
Diagnosis prediction, which aims to predict future health information of patients from historical electronic health records (EHRs), is a core research task in personalized healthcare. Although some RNN-based methods have been proposed to model sequential EHR data, these methods have two major issues. First, they cannot capture fine-grained progression patterns of patient health conditions. Second, they do not consider the mutual effect between important context (e.g., patient demographics) and historical diagnosis. To tackle these challenges, we propose a model called Co-Attention Memory networks for diagnosis Prediction (CAMP), which tightly integrates historical records, fine-grained patient conditions, and demographics with a three-way interaction architecture built on co-attention. Our model augments RNNs with a memory network to enrich the representation capacity. The memory network enables analysis of fine-grained patient conditions by explicitly incorporating a taxonomy of diseases into an array of memory slots. We instantiate the READ/WRITE operations of the memory network so that the memory cooperates effectively with the patient demographics through co-attention mechanism. Experiments on real-world datasets demonstrate that CAMP consistently performs better than state-of-the-art methods
Rec4Ad: A Free Lunch to Mitigate Sample Selection Bias for Ads CTR Prediction in Taobao
Click-Through Rate (CTR) prediction serves as a fundamental component in
online advertising. A common practice is to train a CTR model on advertisement
(ad) impressions with user feedback. Since ad impressions are purposely
selected by the model itself, their distribution differs from the inference
distribution and thus exhibits sample selection bias (SSB) that affects model
performance. Existing studies on SSB mainly employ sample re-weighting
techniques which suffer from high variance and poor model calibration. Another
line of work relies on costly uniform data that is inadequate to train
industrial models. Thus mitigating SSB in industrial models with a
uniform-data-free framework is worth exploring. Fortunately, many platforms
display mixed results of organic items (i.e., recommendations) and sponsored
items (i.e., ads) to users, where impressions of ads and recommendations are
selected by different systems but share the same user decision rationales.
Based on the above characteristics, we propose to leverage recommendations
samples as a free lunch to mitigate SSB for ads CTR model (Rec4Ad). After
elaborating data augmentation, Rec4Ad learns disentangled representations with
alignment and decorrelation modules for enhancement. When deployed in Taobao
display advertising system, Rec4Ad achieves substantial gains in key business
metrics, with a lift of up to +6.6\% CTR and +2.9\% RPM
Joint Optimization of Ranking and Calibration with Contextualized Hybrid Model
Despite the development of ranking optimization techniques, pointwise loss
remains the dominating approach for click-through rate prediction. It can be
attributed to the calibration ability of the pointwise loss since the
prediction can be viewed as the click probability. In practice, a CTR
prediction model is also commonly assessed with the ranking ability. To
optimize the ranking ability, ranking loss (e.g., pairwise or listwise loss)
can be adopted as they usually achieve better rankings than pointwise loss.
Previous studies have experimented with a direct combination of the two losses
to obtain the benefit from both losses and observed an improved performance.
However, previous studies break the meaning of output logit as the
click-through rate, which may lead to sub-optimal solutions. To address this
issue, we propose an approach that can Jointly optimize the Ranking and
Calibration abilities (JRC for short). JRC improves the ranking ability by
contrasting the logit value for the sample with different labels and constrains
the predicted probability to be a function of the logit subtraction. We further
show that JRC consolidates the interpretation of logits, where the logits model
the joint distribution. With such an interpretation, we prove that JRC
approximately optimizes the contextualized hybrid discriminative-generative
objective. Experiments on public and industrial datasets and online A/B testing
show that our approach improves both ranking and calibration abilities. Since
May 2022, JRC has been deployed on the display advertising platform of Alibaba
and has obtained significant performance improvements.Comment: Accepted at KDD 202
COPR: Consistency-Oriented Pre-Ranking for Online Advertising
Cascading architecture has been widely adopted in large-scale advertising
systems to balance efficiency and effectiveness. In this architecture, the
pre-ranking model is expected to be a lightweight approximation of the ranking
model, which handles more candidates with strict latency requirements. Due to
the gap in model capacity, the pre-ranking and ranking models usually generate
inconsistent ranked results, thus hurting the overall system effectiveness. The
paradigm of score alignment is proposed to regularize their raw scores to be
consistent. However, it suffers from inevitable alignment errors and error
amplification by bids when applied in online advertising. To this end, we
introduce a consistency-oriented pre-ranking framework for online advertising,
which employs a chunk-based sampling module and a plug-and-play rank alignment
module to explicitly optimize consistency of ECPM-ranked results. A -based weighting mechanism is adopted to better distinguish the importance
of inter-chunk samples in optimization. Both online and offline experiments
have validated the superiority of our framework. When deployed in Taobao
display advertising system, it achieves an improvement of up to +12.3\% CTR and
+5.6\% RPM
Fstl1 Antagonizes BMP Signaling and Regulates Ureter Development
Bone morphogenetic protein (BMP) signaling pathway plays important roles in urinary tract development although the detailed regulation of its activity in this process remains unclear. Here we report that follistatin-like 1 (Fstl1), encoding a secreted extracellular glycoprotein, is expressed in developing ureter and antagonizes BMP signaling activity. Mouse embryos carrying disrupted Fstl1 gene displayed prominent hydroureter arising from proximal segment and ureterovesical junction defects. These defects were associated with significant reduction in ureteric epithelial cell proliferation at E15.5 and E16.5 as well as absence of subepithelial ureteral mesenchymal cells in the urinary tract at E16.5 and E18.5. At the molecular level, increased BMP signaling was found in Fstl1 deficient ureters, indicated by elevated pSmad1/5/8 activity. In vitro study also indicated that Fstl1 can directly bind to ALK6 which is specifically expressed in ureteric epithelial cells in developing ureter. Furthermore, Sonic hedgehog (SHH) signaling, which is crucial for differentiation of ureteral subepithelial cell proliferation, was also impaired in Fstl1-/- ureter. Altogether, our data suggest that Fstl1 is essential in maintaining normal ureter development by antagonizing BMP signaling
The Genomes of Oryza sativa: A History of Duplications
We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped super-scaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000–40,000. Only 2%–3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family
- …