418 research outputs found

    Interleukin-16

    Get PDF

    ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

    Full text link
    During the development of large language models (LLMs), the scale and quality of the pre-training data play a crucial role in shaping LLMs' capabilities. To accelerate the research of LLMs, several large-scale datasets, such as C4 [1], Pile [2], RefinedWeb [3] and WanJuan [4], have been released to the public. However, most of the released corpus focus mainly on English, and there is still lack of complete tool-chain for extracting clean texts from web data. Furthermore, fine-grained information of the corpus, e.g. the quality of each text, is missing. To address these challenges, we propose in this paper a new complete tool-chain EvalWeb to extract Chinese clean texts from noisy web data. First, similar to previous work, manually crafted rules are employed to discard explicit noisy texts from the raw crawled web contents. Second, a well-designed evaluation model is leveraged to assess the remaining relatively clean data, and each text is assigned a specific quality score. Finally, we can easily utilize an appropriate threshold to select the high-quality pre-training data for Chinese. Using our proposed approach, we release the largest and latest large-scale high-quality Chinese web text ChineseWebText, which consists of 1.42 TB and each text is associated with a quality score, facilitating the LLM researchers to choose the data according to the desired quality thresholds. We also release a much cleaner subset of 600 GB Chinese data with the quality exceeding 90%

    Prevalence of Mycobacterium bovis in deer in mainland China: a systematic review and meta-analysis

    Get PDF
    BackgroundDeer tuberculosis is a chronic zoonotic infectious disease, despite the existence of socio-economic and zoonotic risk factors, but at present, there has been no systematic review of deer tuberculosis prevalence in mainland China. The aim of this meta-analysis was to estimate the overall prevalence of deer TB in mainland China and to assess possible associations between potential risk factors and the prevalence of deer tuberculosis.MethodologyThis study was searched in six databases in Chinese and English, respectively (1981 to December 2023). Four authors independently reviewed the titles and abstracts of all retrieved articles to establish the inclusion exclusion criteria. Using the meta-analysis package estimated the combined effects. Cochran’s Q-statistic was used to analyze heterogeneity. Funnel plots (symmetry) and used the Egger’s test identifying publication bias. Trim-and-fill analysis methods were used for validation and sensitivity analysis. we also performed subgroup and meta-regression analyses.ResultsIn this study, we obtained 4,400 studies, 20 cross-sectional studies were screened and conducted a systematic review and meta-analysis. Results show: The overall prevalence of tuberculosis in deer in mainland China was 16.1% (95% confidence interval (CI):10.5 24.6; (Deer tuberculosis infected 5,367 out of 22,215 deer in mainland China) 5,367/22215; 1981 to 2023). The prevalence in Central China was the highest 17.5% (95% CI:14.0–21.9; 63/362), and among provinces, the prevalence in Heilongjiang was the highest at 26.5% (95% CI:13.2–53.0; 1557/4291). Elaphurus davidianus was the most commonly infected species, with a prevalence of 35.3% (95% CI:18.5–67.2; 6/17). We also assessed the association between geographic risk factors and the incidence of deer tuberculosis.ConclusionDeer tuberculosis is still present in some areas of China. Assessing the association between risk factors and the prevalence of deer tuberculosis showed that reasonable and scientific-based breeding methods, a suitable breeding environment, and rapid and accurate detection methods could effectively reduce the prevalence of deer tuberculosis. In addition, in the management and operation of the breeding base, improving the scientific feed nutrition standards and establishing comprehensive standards for disease prevention, immunization, quarantine, treatment, and disinfection according to the breeding varieties and scale, are suggested as ways to reduce the prevalence of deer tuberculosis

    Artesunate induces oncosis-like cell death in vitro and has antitumor activity against pancreatic cancer xenografts in vivo

    Get PDF
    Pancreatic cancer is highly resistant to the currently available chemotherapeutic agents. Less than 5% of patients diagnosed with this disease could survive beyond 5 years. Thus, there is an urgent need for the development of novel, efficacious drugs that can treat pancreatic cancer. Herein we report the identification of artesunate (ART), a derivative of artemisinin, as a potent and selective antitumor agent against human pancreatic cancer cells in vitro and in vivo. ART exhibits selective cytotoxic activity against Panc-1, BxPC-3 and CFPAC-1 pancreatic cancer cells with IC50 values that are 2.3- to 24-fold less than that of the normal human hepatic cells (HL-7702). The pan caspase inhibitor zVAD-fmk did not inhibit the cytotoxic activity of ART. Electron microscopy of ART-treated cells revealed severe cytoplasmic swelling and vacuolization, swollen and internally disorganized mitochondria, dilation (but not fragmentation) of the nuclei without chromatin condensation, and cell lysis, yielding a morphotype that is typical of oncosis. The ART-treated cells exhibited a loss of mitochondrial membrane potential (ΔΨm) and ART-induced cell death was inhibited in the presence of the reactive oxygen species (ROS) scavenger N-acetyl-cysteine (NAC). Importantly, ART produced a dose-dependent tumor regression in an in vivo pancreatic cancer xenografts model. The in vivo antitumor activity of ART was similar to that of gemcitabine. Taken together, our study suggests that ART exhibits antitumor activity against human pancreatic cancer via a novel form of oncosis-like cell death, and that ART should be considered a potential therapeutic candidate for treating pancreatic cancer

    High Diversity of Tick-associated Microbiota from Five Tick Species in Yunnan, China

    Get PDF
    Ticks are obligate blood-sucking vectors for multiple zoonotic diseases. In this study, tick samples were collected from Yunnan Province, China, which is well-known as the “Global Biodiversity Hotspot” in the world. This study aimed to clarify the microbial populations, including pathogens, associated with ticks and to identify the diversity of tick-borne microbiota in this region. The 16S rRNA full-length sequencing from pooled tick DNA samples and PCR amplification of pathogenic genera from individual samples were performed to understand tick-associated microbiota in this region. A total of 191 adult ticks of 5 tick species were included and revealed 11 phyla and 126 genera bacteria, including pathogenic Anaplasma , Ehrlichia , Candidatus Neoehrlichia, Rickettsia , Borrelia , and Babesia . Further identification suggested that Rickettsia sp. YN01 was a variant strain of Rickettsia spp. IG-1, but Rickettsia sp. YN02 and Rickettsia sp. YN03, were potentially two new SFGR species. This study revealed the complexity of ecological interactions between host and microbe and provided insight for the biological control of ticks. A high microbial diversity in ticks from Yunnan was identified, and more investigation should be undertaken to elucidate the pathogenicity in the area

    Genomewide association study of leprosy.

    Get PDF
    BACKGROUND: The narrow host range of Mycobacterium leprae and the fact that it is refractory to growth in culture has limited research on and the biologic understanding of leprosy. Host genetic factors are thought to influence susceptibility to infection as well as disease progression. METHODS: We performed a two-stage genomewide association study by genotyping 706 patients and 1225 controls using the Human610-Quad BeadChip (Illumina). We then tested three independent replication sets for an association between the presence of leprosy and 93 single-nucleotide polymorphisms (SNPs) that were most strongly associated with the disease in the genomewide association study. Together, these replication sets comprised 3254 patients and 5955 controls. We also carried out tests of heterogeneity of the associations (or lack thereof) between these 93 SNPs and disease, stratified according to clinical subtype (multibacillary vs. paucibacillary). RESULTS: We observed a significant association (P<1.00x10(-10)) between SNPs in the genes CCDC122, C13orf31, NOD2, TNFSF15, HLA-DR, and RIPK2 and a trend toward an association (P=5.10x10(-5)) with a SNP in LRRK2. The associations between the SNPs in C13orf31, LRRK2, NOD2, and RIPK2 and multibacillary leprosy were stronger than the associations between these SNPs and paucibacillary leprosy. CONCLUSIONS: Variants of genes in the NOD2-mediated signaling pathway (which regulates the innate immune response) are associated with susceptibility to infection with M. leprae

    Synthesis and applications of porous non-silica metal oxide submicrospheres

    Get PDF
    © 2016 Royal Society of Chemistry. Nowadays the development of submicroscale products of specific size and morphology that feature a high surface area to volume ratio, well-developed and accessible porosity for adsorbates and reactants, and are non-toxic, biocompatible, thermally stable and suitable as synergetic supports for precious metal catalysts is of great importance for many advanced applications. Complex porous non-silica metal oxide submicrospheres constitute an important class of materials that fulfill all these qualities and in addition, they are relatively easy to synthesize. This review presents a comprehensive appraisal of the methods used for the synthesis of a wide range of porous non-silica metal oxide particles of spherical morphology such as porous solid spheres, core-shell and yolk-shell particles as well as single-shell and multi-shell particles. In particular, hydrothermal and low temperature solution precipitation methods, which both include various structure developing strategies such as hard templating, soft templating, hydrolysis, or those taking advantage of Ostwald ripening and the Kirkendall effect, are reviewed. In addition, a critical assessment of the effects of different experimental parameters such as reaction time, reaction temperature, calcination, pH and the type of reactants and solvents on the structure of the final products is presented. Finally, the practical usefulness of complex porous non-silica metal oxide submicrospheres in sensing, catalysis, biomedical, environmental and energy-related applications is presented

    Graphene-Based Nanocomposites for Energy Storage

    Get PDF
    Since the first report of using micromechanical cleavage method to produce graphene sheets in 2004, graphene/graphene-based nanocomposites have attracted wide attention both for fundamental aspects as well as applications in advanced energy storage and conversion systems. In comparison to other materials, graphene-based nanostructured materials have unique 2D structure, high electronic mobility, exceptional electronic and thermal conductivities, excellent optical transmittance, good mechanical strength, and ultrahigh surface area. Therefore, they are considered as attractive materials for hydrogen (H2) storage and high-performance electrochemical energy storage devices, such as supercapacitors, rechargeable lithium (Li)-ion batteries, Li–sulfur batteries, Li–air batteries, sodium (Na)-ion batteries, Na–air batteries, zinc (Zn)–air batteries, and vanadium redox flow batteries (VRFB), etc., as they can improve the efficiency, capacity, gravimetric energy/power densities, and cycle life of these energy storage devices. In this article, recent progress reported on the synthesis and fabrication of graphene nanocomposite materials for applications in these aforementioned various energy storage systems is reviewed. Importantly, the prospects and future challenges in both scalable manufacturing and more energy storage-related applications are discussed
    corecore