61 research outputs found
Query-dominant User Interest Network for Large-Scale Search Ranking
Historical behaviors have shown great effect and potential in various
prediction tasks, including recommendation and information retrieval. The
overall historical behaviors are various but noisy while search behaviors are
always sparse. Most existing approaches in personalized search ranking adopt
the sparse search behaviors to learn representation with bottleneck, which do
not sufficiently exploit the crucial long-term interest. In fact, there is no
doubt that user long-term interest is various but noisy for instant search, and
how to exploit it well still remains an open problem.
To tackle this problem, in this work, we propose a novel model named
Query-dominant user Interest Network (QIN), including two cascade units to
filter the raw user behaviors and reweigh the behavior subsequences.
Specifically, we propose a relevance search unit (RSU), which aims to search a
subsequence relevant to the query first and then search the sub-subsequences
relevant to the target item. These items are then fed into an attention unit
called Fused Attention Unit (FAU). It should be able to calculate attention
scores from the ID field and attribute field separately, and then adaptively
fuse the item embedding and content embedding based on the user engagement of
past period. Extensive experiments and ablation studies on real-world datasets
demonstrate the superiority of our model over state-of-the-art methods. The QIN
now has been successfully deployed on Kuaishou search, an online video search
platform, and obtained 7.6% improvement on CTR.Comment: 10 page
Responses of secondary wind dispersal to environmental characteristics and diaspore morphology of seven Calligonum species
Secondary diaspore dispersal by wind, that is, wind‐driven movement along the ground surface (GS), is important for the structure and dynamics of plant populations and communities. However, how wind velocity (WV), GS, and diaspore morphology influence diaspore secondary dispersal by wind are unclear. We used a wind tunnel and video camera to measure the threshold of WV (TWV) and diaspore velocities (DV) of secondary diaspore dispersal. Diaspores of seven Calligonum species with different appendages (wings, bristles, membranous balloon, and wings + thorns) were used to determine the TWV and DV under variable wind speed (4, 6, 8, and 10 m s‐1) and four GSs (cement, sand, loam, and gravel). GS and diaspore morphological traits explained 37.1 and 18% of diaspore TWV, respectively. Meanwhile, WV, GS, and diaspore morphological traits explained 62.4, 13.6, and 3.2% of DV, respectively. An increasing trend was shown for TWV, and a decreasing trend was shown for DV in the order of cement, sand, loam, and gravel surfaces. Spherical and light diaspores had low TWV and high DV, whereas winged and heavy diaspores had high TWV and low DV. Our results indicated that adaptive features of diaspore appendages might be the result of selection for primary dispersal or secondary dispersal. The mechanism of diaspore secondary dispersal is important for understanding the recovery of degraded sand dunes and providing theoretical support for restoration practices
Relationship between seed morphological traits and wind dispersal trajectory
The structure and dynamics of plant populations and communities are largely influenced by seed dispersal. How the wind dispersal trajectory of seeds shifts with differences in seed morphology remains unknown. We used a wind tunnel and video camera to track the dispersal trajectory of seven species of Calligonum whose seeds have different kinds of appendages and other morphological traits, using variable wind speeds and release heights to determine the relationship between seed morphological traits and wind dispersal trajectory. Concave-, straight-line-, horizontal-projectile- and projectile-shaped trajectories were found. Dispersal trajectories such as the horizontal projectile (HP) and projectile (P) tended to have a long dispersal distance. Straight line (SL) and concave curve (CC) trajectories tended to have a short dispersal distance. Seeds with bristles and large mass tended to have SL and CC trajectories, those with wings or balloon and small mass tended to have HP and P trajectories. Wind speed tended to have a stronger influence on the dispersal trajectory of light and low-wing-loading seeds, and release height tended to have a stronger influence on the dispersal trajectory of heavy and high-wing-loading seeds. Thus, seed wind dispersal trajectory is not only determined by seed morphological characteristics but also by environmental factors such as wind speed and release height
Cancer LncRNA Census reveals evidence for deep functional conservation of long noncoding RNAs in tumorigenesis.
Long non-coding RNAs (lncRNAs) are a growing focus of cancer genomics studies, creating the need for a resource of lncRNAs with validated cancer roles. Furthermore, it remains debated whether mutated lncRNAs can drive tumorigenesis, and whether such functions could be conserved during evolution. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, we introduce the Cancer LncRNA Census (CLC), a compilation of 122 GENCODE lncRNAs with causal roles in cancer phenotypes. In contrast to existing databases, CLC requires strong functional or genetic evidence. CLC genes are enriched amongst driver genes predicted from somatic mutations, and display characteristic genomic features. Strikingly, CLC genes are enriched for driver mutations from unbiased, genome-wide transposon-mutagenesis screens in mice. We identified 10 tumour-causing mutations in orthologues of 8 lncRNAs, including LINC-PINT and NEAT1, but not MALAT1. Thus CLC represents a dataset of high-confidence cancer lncRNAs. Mutagenesis maps are a novel means for identifying deeply-conserved roles of lncRNAs in tumorigenesis
Recommended from our members
Analyses of non-coding somatic drivers in 2,658 cancer whole genomes.
The discovery of drivers of cancer has traditionally focused on protein-coding genes1-4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5' region of TP53, in the 3' untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available
Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples
Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts
Precise Point Positioning with Almost Fully Deployed BDS-3, BDS-2, GPS, GLONASS, Galileo and QZSS Using Precise Products from Different Analysis Centers
The space segment of all the five satellite systems capable of providing precise position services, namely BeiDou Navigation Satellite System (BDS) (including BDS-3 and BDS-2), Global Positioning System (GPS), GLObal NAvigation Satellite System (GLONASS), Galileo and Quasi-Zenith Satellite System (QZSS), has almost been fully deployed at present, and the number of available satellites is approximately 136. Currently, the precise satellite orbit and clock products from the analysis centers European Space Agency (ESA), GeoForschungsZentrum Potsdam (GFZ) and Wuhan University (WHU) can support all five satellite systems. Thus, it is necessary to investigate the positioning performance of a five-system integrated precise point positioning (PPP) (i.e., GRECJ-PPP) using the precise products from different analysis centers under the current constellation status. It should be noted that this study only focuses on the long-term performance of PPP based on daily observations. The static GRECJ-PPP can provide a convergence time of 5.9–6.9/2.6–3.1/6.3–7.1 min and a positioning accuracy of 0.2–0.3/0.2–0.3/1.0–1.1 cm in east/north/up directions, respectively, while the corresponding kinematic statistics are 6.8–8.6/3.3–4.0/7.8–8.1 min and 1.0–1.1/0.8/2.5–2.6 cm in three directions, respectively. For completeness, although the real-time precise products from the analysis center Centre National d’Etudes Spatiales (CNES) do not incorporate QZSS satellites, the performance of real-time PPP with the other four satellite systems (i.e., GREC-PPP) is also analyzed. The real-time GREC-PPP can achieve a static convergence time of 8.7/5.2/11.2 min, a static positioning accuracy of 0.6/0.8/1.3 cm, a kinematic convergence time of 11.5/6.9/13.0 min, and a kinematic positioning accuracy of 1.7/1.6/3.6 cm in the three directions, respectively. For comparison, the results of single-system and dual-system PPP are also provided. In addition, the consistency of the precise products from different analysis centers is characterized
Recommended from our members
Finishing of Additively Manufactured Metal Parts by Abrasive Flow Machining
Surface finishing is still a crucial challenge in metal Additive Manufacturing (AM) as the
as-built surface roughness is difficult to fulfill service requirements, due to staircase effect,
balling effect inherent to AM. Abrasive flow machining (AFM) is a non-conventional finishing
technique that offers better accuracy and efficiency for parts with difficult-to-access structures,
and the application of AFM to finishing metal parts of AM process is discussed in this paper. The
aluminum and titanium grilles by selective laser melting are taken to explore the finishing effect
of outer and inner surfaces. The AFM process parameters of abrasive grits sizes, abrasive media
viscosity, and tooling designs are optimized to implement effective material removal from the
outer and inner surfaces. The results show that the AM grille parts with non-trial internal
structures can be finished efficiently and consistently by AFM.Mechanical Engineerin
Auditory Cryptography Security Algorithm With Audio Shelters
AbstractIn this paper, auditory cryptography security algorithm with audio shelters is proposed. The meaningful audio watermarking is pretreated to high-fidelity binary audio, and the binary audio is encrypted to n cryptographic audios by (k, n) threshold scheme. Less than k of the cryptographic audios give no information, only synchronized playing k or more than k of the audios the original can be heard directly. The n cryptographic audios are embedded in the corresponding n shelter audios which are pretreated by high-dimensional matrix transformation. Experiments show that the proposed algorithm has strong practicability, high security and robustness in enduring common attacks
- …