39 research outputs found

    A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge

    Full text link
    A vector database is used to store high-dimensional data that cannot be characterized by traditional DBMS. Although there are not many articles describing existing or introducing new vector database architectures, the approximate nearest neighbor search problem behind vector databases has been studied for a long time, and considerable related algorithmic articles can be found in the literature. This article attempts to comprehensively review relevant algorithms to provide a general understanding of this booming research area. The basis of our framework categorises these studies by the approach of solving ANNS problem, respectively hash-based, tree-based, graph-based and quantization-based approaches. Then we present an overview of existing challenges for vector databases. Lastly, we sketch how vector databases can be combined with large language models and provide new possibilities

    The Clinical Relevance of Serum NDKA, NMDA, PARK7, and UFDP Levels with Phlegm-Heat Syndrome and Treatment Efficacy Evaluation of Traditional Chinese Medicine in Acute Ischemic Stroke

    Get PDF
    According to the methods of Patient-Reported Outcome (PRO) based on the patient reports internationally and referring to U.S. Food and Drug Administration (FDA) guide, some scholars developed this PRO of stroke which is consistent with China’s national conditions, and using it the feel of stroke patients was introduced into the clinical efficacy evaluation system of stoke. “Ischemic Stroke TCM Syndrome Factor Diagnostic Scale (ISTSFDS)” and “Ischemic Stroke TCM Syndrome Factor Evaluation Scale (ISTSFES)” were by “Major State Basic Research Development Program of China (973 Program) (number 2003CB517102).” ISTSFDS can help to classify and diagnose the CM syndrome reasonably and objectively with application of syndrome factors. Six syndrome factors, internal-wind syndrome, internal-fire syndrome, phlegm-dampness syndrome, blood-stasis syndrome, qi-deficiency syndrome, and yin-deficiency syndrome, were included in ISTSFDS and ISTSFES. TCM syndrome factor was considered to be present if the score was greater than or equal to 10 according to ISTSFDS. In our study, patients with phlegm-heat syndrome were recruited, who met the diagnosis of both “phlegm-dampness” and “internal-fire” according to ISTSFDS. ISTSFES was used to assess the syndrome severity; in our study it was used to assess the severity of phlegm-heat syndrome (phlegm-heat syndrome scores = phlegm-dampness syndrome scores + internal-fire syndrome scores)

    An adaptive forecasting method for the aggregated load with pattern matching

    Get PDF
    Electrical load forecasting plays a vital role in the operation of power system. In this paper, a novel adaptive short-term load forecasting method for the aggregated load is built. The proposed method consists of two stages: load forecast model preparation stage and adaptive load forecast model selection stage. In the first stage, based on historical load data of all consumers, the typical monthly load patterns are firstly identified in an optimal fashion with the aid of the cosine similarity. Then, for each identified monthly load pattern, a stacking ensemble learning method is proposed to train the load forecasting model. In the second stage, according to the similarity between individual load data of the latest month and the identified monthly load pattern, all the consumers are firstly classified into different groups where each group corresponds to a particular load pattern. Then, for each group, the corresponding trained load forecasting model is employed for short-term load forecast and the final forecast of the aggregated load is calculated as a simple aggregation of the produced load forecast for each group of consumers. Case studies conducted on open dataset show that, compared with the single forecasting model, the proposed adaptive load forecasting method can effectively improve the load forecasting accuracy

    Development of Omni InDel and supporting database for maize

    Get PDF
    Insertions–deletions (InDels) are the second most abundant molecular marker in the genome and have been widely used in molecular biology research along with simple sequence repeats (SSR) and single-nucleotide polymorphisms (SNP). However, InDel variant mining and marker development usually focuses on a single type of dimorphic InDel, which does not reflect the overall InDel diversity across the genome. Here, we developed Omni InDels for maize, soybean, and rice based on sequencing data and genome assembly that included InDel variants with base lengths from 1 bp to several Mb, and we conducted a detailed classification of Omni InDels. Moreover, we screened a set of InDels that are easily detected and typed (Perfect InDels) from the Omni InDels, verified the site authenticity using 3,587 germplasm resources from 11 groups, and analyzed the germplasm resources. Furthermore, we developed a Multi-InDel set based on the Omni InDels; each Multi-InDel contains multiple InDels, which greatly increases site polymorphism, they can be detected in multiple platforms such as fluorescent capillary electrophoresis and sequencing. Finally, we developed an online database website to make Omni InDels easy to use and share and developed a visual browsing function called “Variant viewer” for all Omni InDel sites to better display the variant distribution

    Policies of improving developers’ willingness to implement prefabricated building: a case study from China

    No full text
    Multiple policies have been formulated to promote the development of prefabricated building (PB). However, ineffective policies increase the financial burden of the governments and hinder PB development. This study aims to identify effective policies and develop a practical policy framework to encourage developers to implement PB. Text analysis is to identify the policies related to PBs from the numerous samples. A survey is to verify the effective policies and explore their effects on the developers’ willingness to implement PBs. The findings suggest that the current policy system of PBs is complete but uneven, focusing on environmental policies. The most cost-effective tools are land limitations, tax incentives, and financial support. Besides, policies are interrelated in affecting developers’ willingness to implement PBs. It provides a reference for policy evaluation in application scenarios, which expands the literature research on the policy. Meanwhile, it provides a guide for policymakers to evaluate the effectiveness of policies and formulate practical policy frameworks to promote PB development
    corecore