15 research outputs found

    Deep Task-specific Bottom Representation Network for Multi-Task Recommendation

    Full text link
    Neural-based multi-task learning (MTL) has gained significant improvement, and it has been successfully applied to recommendation system (RS). Recent deep MTL methods for RS (e.g. MMoE, PLE) focus on designing soft gating-based parameter-sharing networks that implicitly learn a generalized representation for each task. However, MTL methods may suffer from performance degeneration when dealing with conflicting tasks, as negative transfer effects can occur on the task-shared bottom representation. This can result in a reduced capacity for MTL methods to capture task-specific characteristics, ultimately impeding their effectiveness and hindering the ability to generalize well on all tasks. In this paper, we focus on the bottom representation learning of MTL in RS and propose the Deep Task-specific Bottom Representation Network (DTRN) to alleviate the negative transfer problem. DTRN obtains task-specific bottom representation explicitly by making each task have its own representation learning network in the bottom representation modeling stage. Specifically, it extracts the user's interests from multiple types of behavior sequences for each task through the parameter-efficient hypernetwork. To further obtain the dedicated representation for each task, DTRN refines the representation of each feature by employing a SENet-like network for each task. The two proposed modules can achieve the purpose of getting task-specific bottom representation to relieve tasks' mutual interference. Moreover, the proposed DTRN is flexible to combine with existing MTL methods. Experiments on one public dataset and one industrial dataset demonstrate the effectiveness of the proposed DTRN.Comment: CIKM'2

    Efficient Optimal Selection for Composited Advertising Creatives with Tree Structure

    Full text link
    Ad creatives are one of the prominent mediums for online e-commerce advertisements. Ad creatives with enjoyable visual appearance may increase the click-through rate (CTR) of products. Ad creatives are typically handcrafted by advertisers and then delivered to the advertising platforms for advertisement. In recent years, advertising platforms are capable of instantly compositing ad creatives with arbitrarily designated elements of each ingredient, so advertisers are only required to provide basic materials. While facilitating the advertisers, a great number of potential ad creatives can be composited, making it difficult to accurately estimate CTR for them given limited real-time feedback. To this end, we propose an Adaptive and Efficient ad creative Selection (AES) framework based on a tree structure. The tree structure on compositing ingredients enables dynamic programming for efficient ad creative selection on the basis of CTR. Due to limited feedback, the CTR estimator is usually of high variance. Exploration techniques based on Thompson sampling are widely used for reducing variances of the CTR estimator, alleviating feedback sparsity. Based on the tree structure, Thompson sampling is adapted with dynamic programming, leading to efficient exploration for potential ad creatives with the largest CTR. We finally evaluate the proposed algorithm on the synthetic dataset and the real-world dataset. The results show that our approach can outperform competing baselines in terms of convergence rate and overall CTR

    Continual Learning in Predictive Autoscaling

    Full text link
    Predictive Autoscaling is used to forecast the workloads of servers and prepare the resources in advance to ensure service level objectives (SLOs) in dynamic cloud environments. However, in practice, its prediction task often suffers from performance degradation under abnormal traffics caused by external events (such as sales promotional activities and applications re-configurations), for which a common solution is to re-train the model with data of a long historical period, but at the expense of high computational and storage costs. To better address this problem, we propose a replay-based continual learning method, i.e., Density-based Memory Selection and Hint-based Network Learning Model (DMSHM), using only a small part of the historical log to achieve accurate predictions. First, we discover the phenomenon of sample overlap when applying replay-based continual learning in prediction tasks. In order to surmount this challenge and effectively integrate new sample distribution, we propose a density-based sample selection strategy that utilizes kernel density estimation to calculate sample density as a reference to compute sample weight, and employs weight sampling to construct a new memory set. Then we implement hint-based network learning based on hint representation to optimize the parameters. Finally, we conduct experiments on public and industrial datasets to demonstrate that our proposed method outperforms state-of-the-art continual learning methods in terms of memory capacity and prediction accuracy. Furthermore, we demonstrate remarkable practicability of DMSHM in real industrial applications

    Prompt-augmented Temporal Point Process for Streaming Event Sequence

    Full text link
    Neural Temporal Point Processes (TPPs) are the prevalent paradigm for modeling continuous-time event sequences, such as user activities on the web and financial transactions. In real-world applications, event data is typically received in a \emph{streaming} manner, where the distribution of patterns may shift over time. Additionally, \emph{privacy and memory constraints} are commonly observed in practical scenarios, further compounding the challenges. Therefore, the continuous monitoring of a TPP to learn the streaming event sequence is an important yet under-explored problem. Our work paper addresses this challenge by adopting Continual Learning (CL), which makes the model capable of continuously learning a sequence of tasks without catastrophic forgetting under realistic constraints. Correspondingly, we propose a simple yet effective framework, PromptTPP\footnote{Our code is available at {\small \url{ https://github.com/yanyanSann/PromptTPP}}}, by integrating the base TPP with a continuous-time retrieval prompt pool. The prompts, small learnable parameters, are stored in a memory space and jointly optimized with the base TPP, ensuring that the model learns event streams sequentially without buffering past examples or task-specific attributes. We present a novel and realistic experimental setup for modeling event streams, where PromptTPP consistently achieves state-of-the-art performance across three real user behavior datasets.Comment: NeurIPS 2023 camera ready versio

    Glycyrrhizin Treatment Facilitates Extinction of Conditioned Fear Responses After a Single Prolonged Stress Exposure in Rats

    No full text
    Background/Aims: Impaired fear memory extinction is widely considered a key mechanism of post-traumatic stress disorder (PTSD). Recent studies have suggested that neuroinflammation after a single prolonged stress (SPS) exposure may play a critical role in the impaired fear memory extinction. Studies have shown that high mobility group box chromosomal protein 1 (HMGB-1) is critically involved in neuroinflammation. However, the role of HMGB-1 underlying the development of impairment of fear memory extinction is still not known. Methods: Thus, we examined the levels of HMGB-1 in the basolateral amygdala (BLA) following SPS using Western blot and evaluated the levels of microglia and astrocytes activation in the BLA after SPS using immunohistochemical staining. We then examined the effects of pre-SPS intra-BLA administration of glycyrrhizin, an HMGB1 inhibitor, or LPS-RS, a competitive TLR4 antagonist, on subsequent post-SPS fear extinction. Results: We found that SPS treatment prolonged the extinction of contextual fear memory after the SPS. The impairment of SPS-induced extinction of contextual fear memory was associated with increased HMGB1 and Toll-like receptor 4 (TLR4) levels in the BLA. Additionally, the impairment of SPS-induced extinction of contextual fear memory was associated with increased activation of microglia and astrocyte in the BLA. Intra-BLA administrations of glycyrrhizin (HMGB-1 inhibitor) or LPS-RS (TLR4 antagonist) can prevent the development of SPS-induced fear extinction impairment. Conclusion: Taken together, these results suggested that SPS treatment may not only produce short term effects on the HMGB1/TLR4-mediated pro-inflammation, but alter the response of microglia and astrocytes to the exposure to fear associated contextual stimuli

    Mechanism of Secondary Breakage in the Overlying Strata during Repetitious Mining of an Ultrathick Coal Seam in Design Stage

    No full text
    When designing the mining of an ultrathick coal seam, the laws governing movement in the overlying strata during mining are a fundamental issue based on which several problems are addressed, including determining the mining method and the roadway arrangement, controlling the surrounding strata, and selecting the devices. The present paper considers possible problems related to strata overlying a large mining space subjected to repeated disturbances during the mining of an ultrathick coal seam, including repeatedly broken strata and the existence or inexistence of the structure. The BM coal seam in the No. 2 coal mine of the Dajing mining area in the East Junggar coalfield is studied. Physical simulations are performed on the movements of the overlying strata during slicing mining of the ultrathick coal seam, revealing the new feature of “break-joint stability-instability-secondary breakage” in the overlying strata. Mechanical models are constructed of the secondary breakage of the overlying strata blocks under both static and impact loading, and mechanical criteria are proposed for such breakage. Based on the research findings, methods for controlling the surrounding strata during slicing mining of an ultrathick coal seam are proposed, including increasing the mining rate and designing reasonable heights for the slicing mining

    An Empirical Study of Carbon Emission Calculation in the Production and Construction Phase of A Prefabricated Office Building from Zhejiang, China

    No full text
    This study analyzes an office building located in Hangzhou, Zhejiang region, with a high assembly rate of 96.8%. Based on whole-process records and first-hand factory data, using an original method, we empirically investigate the carbon emissions associated to the assembly production and construction phase by comparing the results collected in the field with the calculation results for the simulated non-prefabricated building. The calculation results show that the production and construction stage of the prefabricated office building is characterized by a large reduction in carbon emissions, where the total measured carbon emissions of the subject building were 2265.73 tCO2e, which is 22 kgCO2e/m2 less than that under the non-prefabricated method. In the future development of China’s construction industry, taking Zhejiang Province as an example, the implementation of prefabricated office buildings with a PEC structure system can effectively reduce carbon emissions, which can help China to achieve the carbon peak as soon as possible
    corecore