17 research outputs found

    Vertical Semi-Federated Learning for Efficient Online Advertising

    Full text link
    As an emerging secure learning paradigm in leveraging cross-silo private data, vertical federated learning (VFL) is expected to improve advertising models by enabling the joint learning of complementary user attributes privately owned by the advertiser and the publisher. However, the 1) restricted applicable scope to overlapped samples and 2) high system challenge of real-time federated serving have limited its application to advertising systems. In this paper, we advocate new learning setting Semi-VFL (Vertical Semi-Federated Learning) as a lightweight solution to utilize all available data (both the overlapped and non-overlapped data) that is free from federated serving. Semi-VFL is expected to perform better than single-party models and maintain a low inference cost. It's notably important to i) alleviate the absence of the passive party's feature and ii) adapt to the whole sample space to implement a good solution for Semi-VFL. Thus, we propose a carefully designed joint privileged learning framework (JPL) as an efficient implementation of Semi-VFL. Specifically, we build an inference-efficient single-party student model applicable to the whole sample space and meanwhile maintain the advantage of the federated feature extension. Novel feature imitation and ranking consistency restriction methods are proposed to extract cross-party feature correlations and maintain cross-sample-space consistency for both the overlapped and non-overlapped data. We conducted extensive experiments on real-world advertising datasets. The results show that our method achieves the best performance over baseline methods and validate its effectiveness in maintaining cross-view feature correlation

    Achieving Lightweight Federated Advertising with Self-Supervised Split Distillation

    Full text link
    As an emerging secure learning paradigm in leveraging cross-agency private data, vertical federated learning (VFL) is expected to improve advertising models by enabling the joint learning of complementary user attributes privately owned by the advertiser and the publisher. However, there are two key challenges in applying it to advertising systems: a) the limited scale of labeled overlapping samples, and b) the high cost of real-time cross-agency serving. In this paper, we propose a semi-supervised split distillation framework VFed-SSD to alleviate the two limitations. We identify that: i) there are massive unlabeled overlapped data available in advertising systems, and ii) we can keep a balance between model performance and inference cost by decomposing the federated model. Specifically, we develop a self-supervised task Matched Pair Detection (MPD) to exploit the vertically partitioned unlabeled data and propose the Split Knowledge Distillation (SplitKD) schema to avoid cross-agency serving. Empirical studies on three industrial datasets exhibit the effectiveness of our methods, with the median AUC over all datasets improved by 0.86% and 2.6% in the local deployment mode and the federated deployment mode respectively. Overall, our framework provides an efficient federation-enhanced solution for real-time display advertising with minimal deploying cost and significant performance lift.Comment: Accepted to the Trustworthy Federated Learning workshop of IJCAI2022 (FL-IJCAI22). 6 pages, 3 figures, 3 tables Old title: Semi-Supervised Cross-Silo Advertising with Partial Knowledge Transfe

    The Effect of Food Deprivation on Foraging Behavior and Digestive and Metabolic Capacities of the Chinese Mitten Crab, <i>Eriocheir sinensis</i>

    No full text
    Food deprivation is a common stress in crustaceans that can affect their behavior and physiology. In this study, a video recording analysis system was used to compare the predation rate and behavior of the Chinese mitten crab, Eriocheir sinensis, foraging on the freshwater snail, Bellamya quadrata, under different degrees of food deprivation. The activities of lactate dehydrogenase, α-amylase, pepsin, and lipase in the hepatopancreas of crabs were determined after food deprivation for 0, 3, 9, and 15 days. The results showed that the predation and encounter rates of E. sinensis increased and then decreased with an increase in food deprivation time. The proportion of stationary time of E. sinensis initially decreased and then increased, whereas the proportion of searching and handling time increased initially and then decreased. There was a significant difference in the probability of capture upon encountering; however, food deprivation did not significantly affect the probability of consumption upon capture. Under food deprivation, the proportion of E. sinensis crushing tactics after nine-day food deprivation was significantly lower than that in the first nine days. Lactate dehydrogenase activity in the E. sinensis hepatopancreas after food deprivation was significantly higher than that in those not subjected to food deprivation, and the activities of α-amylase and pepsin in the hepatopancreas were consistent with the predation rate trend, showing an initial increase followed by a decrease. In conclusion, different degrees of food deprivation significantly affected the predation cycle of E. sinensis on B. quadrata. These results lay a solid foundation for further studies on the foraging behavioral ecology of E. sinensis and provide important information for improving E. sinensis production

    Single-Cell RNA-Seq Reveals Cellular Heterogeneity of Pluripotency Transition and X Chromosome Dynamics during Early Mouse Development

    No full text
    Summary: Following implantation, the epiblast (EPI) cells transit from the naive to primed pluripotency, accompanied by dynamic changes in X chromosome activity in females. To investigate the molecular attributes of this process, we performed single-cell RNA-seq analysis of 1,724 cells of E5.25, E5.5, E6.25, and E6.5 mouse embryos. We identified three cellular states in the EPI cells that capture the transition along the pluripotency continuum and the acquisition of primitive streak propensity. The transition of three EPI states was driven by inductive signaling activity emanating from the visceral endoderm (VE). In the EPI of female embryos, X chromosome reactivation (XCR) was initiated prior to the completion of imprinted X chromosome inactivation (XCI), and the ensuing random XCI was highly asynchronous. Moreover, imprinted paternal XCI proceeded faster in the VE than the extraembryonic ectoderm. Our study has provided a detailed molecular roadmap of the emergent lineage commitment before gastrulation and characterized X chromosome dynamics during early mouse development. : Cheng et al. present a molecular roadmap at single-cell and allelic resolution that highlights the developmental process of epiblast cells transiting through pluripotency states and acquiring the primitive streak propensity ahead of gastrulation. In the epiblast of female embryos, the paternal X chromosome is reactivated before the completion of imprinted inactivation. Keywords: pluripotency transition, epiblast heterogeneity, pregastrula development, X chromosome inactivation dynamic

    Organellar Genome Assembly Methods and Comparative Analysis of Horticultural Plants

    Get PDF
    Although organellar genomes (including chloroplast and mitochondrial genomes) are smaller than nuclear genomes in size and gene number, organellar genomes are very important for the investigation of plant evolution and molecular ecology mechanisms. Few studies have focused on the organellar genomes of horticultural plants. Approximately 1193 chloroplast genomes and 199 mitochondrial genomes of land plants are available in the National Center for Biotechnology Information (NCBI), of which only 39 are from horticultural plants. In this paper, we report an innovative and efficient method for high-quality horticultural organellar genome assembly from next-generation sequencing (NGS) data. Sequencing reads were first assembled by Newbler, Amos, and Minimus software with default parameters. The remaining gaps were then filled through BLASTN search and PCR. The complete DNA sequence was corrected based on Illumina sequencing data using BWA (Burrows–Wheeler Alignment tool) software. The advantage of this approach is that there is no need to isolate organellar DNA from total DNA during sample preparation. Using this procedure, the complete mitochondrial and chloroplast genomes of an ornamental plant, Salix suchowensis, and a fruit tree, Ziziphus jujuba, were identified. This study shows that horticultural plants have similar mitochondrial and chloroplast sequence organization to other seed plants. Most horticultural plants demonstrate a slight bias toward A+T rich features in the mitochondrial genome. In addition, a phylogenetic analysis of 39 horticultural plants based on 15 protein-coding genes showed that some mitochondrial genes are horizontally transferred from chloroplast DNA. Our study will provide an important reference for organellar genome assembly in other horticultural plants. Furthermore, phylogenetic analysis of the organellar genomes of horticultural plants could accurately clarify the unanticipated relationships among these plants

    A Deoximation Method for Deprotection of Ketones and Aldhydes Using a Graphene-Oxide-Based Co-catalysts System

    No full text
    The deoximation of a wide range of ketoximes and aldoximes to their corresponding carbonyl compounds with high yields has been achieved using graphene oxide (GO) and sodium nitrite (NaNO2) as highly efficient catalysts and air as the green oxidant under mild conditions. The mechanism of deprotection and recycling use of catalyst were revealed in deep experiment. The carboxylic acid groups on the GO were essential for high catalytic activity

    TaWRKY68 responses to biotic stresses are revealed by the orthologous genes from major cereals

    No full text
    WRKY transcription factors have been extensively characterized in the past 20 years, but in wheat, studies onWRKY genes and their function are lagging behind many other species. To explore the function of wheat WRKY genes, we identified a TaWRKY68 gene from a common wheat cultivar. It encodes a protein comprising 313 amino acids which harbors 19 conserved motifs or active sites. Gene expression patterns were determined by analyzing microarray data of TaWRKY68 in wheat and of orthologous genes from maize, rice and barley using Genevestigator. TaWRKY68 orthologs were identified and clustered using DELTA-BLAST and COBALT programs available at NCBI. The results showed that these genes, which are expressed in all tissues tested, had relatively higher levels in the roots and were up-regulated in response to biotic stresses. Bioinformatics results were confirmed by RT-PCR experiments using wheat plants infected by Agrobacterium tumefaciens and Blumeria graminis, or treated with Deoxynivalenol, a Fusarium graminearum-induced mycotoxin in wheat or barley. In summary,TaWRKY68 functions differ during plant developmental stages and might be representing a hub gene function in wheat responses to various biotic stresses. It was also found that including data from major cereal genes in the bioinformatics analysis gave more accurate and comprehensive predictions of wheat gene functions

    GEsture: An Online Hand-Drawing Tool for Gene Expression Pattern Search

    Get PDF
    Gene expression profiling data provide useful information for the investigation of biological function and process. However, identifying a specific expression pattern from extensive time series gene expression data is not an easy task. Clustering, a popular method, is often used to classify similar expression genes, however, genes with a ‘desirable’ or ‘user-defined’ pattern cannot be efficiently detected by clustering methods. To address these limitations, we developed an online tool called GEsture. Users can draw, or graph a curve using a mouse instead of inputting abstract parameters of clustering methods. GEsture explores genes showing similar, opposite and time-delay expression patterns with a gene expression curve as input from time series datasets. We presented three examples that illustrate the capacity of GEsture in gene hunting while following users’ requirements. GEsture also provides visualization tools (such as expression pattern figure, heat map and correlation network) to display the searching results. The result outputs may provide useful information for researchers to understand the targets, function and biological processes of the involved genes
    corecore