19 research outputs found
A suitable method for alpine wetland delineation: An example for the headwater area of the yellow river, Tibetan Plateau
Alpine wetlands are one of the most important ecosystems in the Three Rivers Source Area, China, which plays an important role in regulating the regional hydrological cycle and carbon cycle. Accordingly, Wetland area and its distribution are of great significance for wetland management and scientific research. In our study, a new wetland classification model which based on geomorphological types and combine object-oriented and decision tree classification model (ODTC), and used a new wetland classification system to accurately extract the wetland distributed in the Headwater Area of the Yellow River (HAYR) of the Qinghai-Tibet Plateau (QTP), China. The object-oriented method was first used to segment the image into several areas according to similarity in Pixels and Textures, and then the wetland was extracted through a decision tree constructed based on geomorphological types. The wetland extracted by the model was compared with that by other seven commonly methods, such as support vector machine (SVM) and random forest (RF), and it proved the accuracy was improved by 10%–20%. The overall classification accuracy rate was 98.9%. According to our results, the HAYR’s wetland area is 3142.3 km2, accounting for 16.1% of the study area. Marsh wetlands and flood wetlands accounted for 37.7% and 16.7% respectively. A three-dimensional map of the area showed that alpine wetlands in the research region are distributed around lakes, piedmont groundwater overflow belts, and inter-mountain catchment basin. This phenomenon demonstrates that hydrogeological circumstances influence alpine wetlands’ genesis and evolution. This work provides a new approach to investigating alpine wetlands
Named Entity Recognition in Power Marketing Domain Based on Whole Word Masking and Dual Feature Extraction
With the aim of solving the current problems of low utilization of entity features, multiple meanings of a word, and poor recognition of specialized terms in the Chinese power marketing domain named entity recognition (PMDNER), this study proposes a Chinese power marketing named entity recognition method based on whole word masking and joint extraction of dual features. Firstly, word vectorization of the electricity text data is performed using the RoBERTa pre-training model; then, it is fed into the constructed dual feature extraction neural network (DFENN) to acquire the local and global features of text in a parallel manner and fuse them. The output of the RoBERTa layer is used as the auxiliary classification layer, the output of the DFENN layer is used as the master classification layer, and the output of the two layers is dynamically combined through the attention mechanism to weight the outputs of the two layers so as to fuse new features, which are input into the conditional random field (CRF) layer to obtain the most reasonable label sequence. A focal loss function is used in the training process to alleviate the problem of uneven sample distribution. The experimental results show that the method achieved an F1 value of 88.58% on the constructed named entity recognition dataset in the power marketing domain, which is a significant improvement in performance compared with the existing methods
Named Entity Recognition for Few-Shot Power Dispatch Based on Multi-Task
In view of the fact that entity nested and professional terms are difficult to identify in the field of power dispatch, a multi-task-based few-shot named entity recognition model (FSPD-NER) for power dispatch is proposed. The model consists of four modules: feature enhancement, seed, expansion, and implication. Firstly, the masking strategy of the encoder is improved by adopting whole-word masking, using a RoBERTa (Robustly Optimized BERT Pretraining Approach) encoder as the embedding layer to obtain the text feature representation, and an IDCNN (Iterated Dilated CNN) module to enhance the feature. Then the text is cut into one Chinese character and two Chinese characters as a seed set, the score for each seed is calculated, and if the score is greater than the threshold value ω, they are passed to the expansion module as candidate seeds; next, the candidate seeds need to be expanded left and right according to offset γ to obtain the candidate entities; finally, to construct text implication pairs, the input text is used as a premise sentence, the candidate entity is connected with predefined label templates as hypothesis sentences, and the implication pairs are passed to the RoBERTa encoder for the classification task. The focus loss function is used to alleviate label imbalance during training. The experimental results of the model on the power dispatch dataset show that the precision, recall, and F1 scores of the recognition results in 20-shot samples are 63.39%, 61.97%, and 62.67%, respectively, which is a significant performance improvement compared to existing methods
Named Entity Identification in the Power Dispatch Domain Based on RoBERTa-Attention-FL Model
Named entity identification is an important step in building a knowledge graph of the grid domain, which contains a certain number of nested entities. To address the issue of nested entities in the Chinese power dispatching domain’s named entity recognition, we propose a RoBERTa-Attention-FL model. This model effectively recognizes nested entities using the span representation annotation method. We extract the output values from RoBERTa’s middle 4–10 layers, obtain syntactic information from the Transformer Encoder layers via the multi-head self-attention mechanism, and integrate it with deep semantic information output from RoBERTa’s last layer. During training, we use Focal Loss to mitigate the sample imbalance problem. To evaluate the model’s performance, we construct named entity recognition datasets for flat and nested entities in the power dispatching domain annotated with actual power operation data, and conduct experiments. The results indicate that compared to the baseline model, the RoBERTa-Attention-FL model significantly improves recognition performance, increasing the F1-score by 4.28% to 90.35%, with an accuracy rate of 92.53% and a recall rate of 88.12%
Fast Sleep Stage Classification Using Cascaded Support Vector Machines with Single-Channel EEG Signals
Long-term sleep stage monitoring is very important for the diagnosis and treatment of insomnia. With the development of wearable electroencephalogram (EEG) devices, we developed a fast and accurate sleep stage classification method in this study with single-channel EEG signals for practical applications. The original sleep recordings were collected from the Sleep-EDF database. The wavelet threshold denoising (WTD) method and wavelet packet transformation (WPT) method were applied as signal preprocessing to extract six kinds of characteristic waves. With a comprehensive feature system including time, frequency, and nonlinear dynamics, we obtained the sleep stage classification results with different Support Vector Machine (SVM) models. We proposed a novel classification method based on cascaded SVM models with various features extracted from denoised EEG signals. To enhance the accuracy and generalization performance of this method, nonlinear dynamics features were taken into consideration. With nonlinear dynamics features included, the average classification accuracy was up to 88.11% using this method. In addition, with cascaded SVM models, the classification accuracy of the non-rapid eye movement sleep stage 1 (N1) was enhanced from 41.5% to 55.65% compared with the single SVM model, and the overall classification time for each epoch was less than 1.7 s. Moreover, we demonstrated that it was possible to apply this method for long-term sleep stage monitor applications
Numerical Investigations of Static and Dynamic Characteristics of a Novel Staggered Labyrinth Seal with Semi-Elliptical Structure
In order to optimize sealing performance, a novel labyrinth seal with semi-elliptical teeth (SET) structure is proposed in this paper, which includes semi-elliptical teeth and a series of cavities. The simulation results calculated by the numerical methods are compared with the experimental and theoretical results, and static and dynamic characteristics of the novel SET structure are further investigated. The numerical simulations of labyrinth seals with the SET structure demonstrate high accuracy and reliability, with a maximum relative error of less than 6% as compared to experimental results, underscoring the validity of the model. Notably, leakage rates are directly influenced by pressure drop and axial offset, with optimal sealing achieved at zero axial displacement. The direct damping coefficient increases as the pressure drop increases while the other dynamic coefficients decrease. Additionally, the stability results show that the novel SET structure exhibits higher stability for positive axial offsets. The novel model and corresponding results can provide a meaningful reference for the study of sealing structure and coupled vibration in the field of fluid machinery
Increasing annual streamflow and groundwater storage in response to climate warming in the Yangtze River source region
Climate warming has been driving hydrological changes across the globe, especially in high latitude and altitude regions. Long-term (1962–2012) streamflow records and permafrost data in the Yangtze River source region were selected to analyze streamflow variations and groundwater storage in response to climate warming. Results of Mann–Kendall test and Morlet wavelet analysis show that the anomalies of both annual streamflow and winter baseflow are near the year 2010, and their main period scales are 37 years and 34 years, respectively. The annual streamflow and the annual baseflow increased significantly, as assessed by the recursive digital filtering baseflow separation. Results of Pearson correlation coefficient indicate that the rising air temperature is the primary cause for the increased streamflow instead of precipitation and evaporation. By using the top temperature of permafrost model, the total permafrost area has decreased by 8200 km ^2 during the past 50 years, which causes groundwater storage to increase by about 1.62 km ^3 per year due to climate warming. More space has been made available to store the increasing meltwater during the permafrost thawing. Permafrost thawing and increasing temperature are the direct and indirect causes of the increasing groundwater storage. The results of the cumulative anomaly method and Pearson correlation coefficients show that permafrost thawing has a greater impact than increasing temperature on the increase of groundwater storage. Permafrost thawing due to climate warming show compound effects on groundwater storage–discharge mechanism, and significantly affects the mechanisms of streamflow generation and variation
E3W—A Combined Model Based on GreedySoup Weighting Strategy for Chinese Agricultural News Classification
With the continuous development of the internet and big data, modernization and informatization are rapidly being realized in the agricultural field. In this line, the volume of agricultural news is also increasing. This explosion of agricultural news has made accurate access to agricultural news difficult, and the spread of news about some agricultural technologies has slowed down, resulting in certain hindrance to the development of agriculture. To address this problem, we apply NLP to agricultural news texts to classify the agricultural news, in order to ultimately improve the efficiency of agricultural news dissemination. We propose a classification model based on ERNIE + DPCNN, ERNIE, EGC, and Word2Vec + TextCNN as sub-models for Chinese short-agriculture text classification (E3W), utilizing the GreedySoup weighting strategy and multi-model combination; specifically, E3W consists of four sub-models, the output of which is processed using the GreedySoup weighting strategy. In the E3W model, we divide the classification process into two steps: in the first step, the text is passed through the four independent sub-models to obtain an initial classification result given by each sub-model; in the second step, the model considers the relationship between the initial classification result and the sub-models, and assigns weights to this initial classification result. The final category with the highest weight is used as the output of E3W. To fully evaluate the effectiveness of the E3W model, the accuracy, precision, recall, and F1-score are used as evaluation metrics in this paper. We conduct multiple sets of comparative experiments on a self-constructed agricultural data set, comparing E3W and its sub-models, as well as performing ablation experiments. The results demonstrate that the E3W model can improve the average accuracy by 1.02%, the average precision by 1.62%, the average recall by 1.21%, and the average F1-score by 1.02%. Overall, E3W can achieve state-of-the-art performance in Chinese agricultural news classification
Identifying Alpine Lakes in the Eastern Himalayas Using Deep Learning
Alpine lakes, which include glacial and nonglacial lakes, are widely distributed in high mountain areas and are sensitive to climate and environmental changes. Remote sensing is an effective tool for identifying alpine lakes over large regions, but in the case of small lakes, the complex terrain and extreme weather make their accurate identification extremely challenging. This paper presents an automated method for alpine lake identification developed by leveraging deep learning algorithms and multi-source high-resolution satellite data. The method is able to detect the outlines and types of alpine lakes from high-resolution optical and Synthetic Aperture Radar (SAR) satellite data. In this study, a total of 4584 alpine lakes (including 2795 glacial lakes) were identified in the Eastern Himalayas from Sentinel-1 and Sentinel-2 data acquired during 2016–2020. The average area of the lakes was 0.038 km2, and the average elevation was 4974 m. High accuracy was reported for the dataset for both segmentation (mean Intersection Over Union (MIoU) > 72%) and classification (Overall Accuracy, User’s and Producer’s Accuracies, and F1-Score are all higher than 85%). A higher accuracy was found for the combination of optical and SAR data than relying on single-sourced data, for which the MIoU increased by at least 12%, suggesting that the combination of optical and SAR data is critical for improving the identification of alpine lakes. The deep learning-based method demonstrated a significant improvement over traditional spectral extraction methods
Multi-objective integrated optimization of geothermal heating system with energy storage using digital twin technology
Heat energy storage technology plays a significant role in energy systems, and the various technological solutions brought about by digitalization are especially valuable in the field of energy storage. This article proposes an innovative model based on digital twin technology to solve the supply–demand mismatch problem in geothermal heating systems. This model achieves multi-objective optimization of comprehensive cost, geothermal energy utilization rate, and carbon emission by constructing a heat storage geothermal heating system. Digital twin technology integrates data and information models of public buildings and facilitates their sharing and transmission throughout the entire lifecycle of the geothermal heating system. Initially, the proposed method employs a machine learning-based approach to accurately predict heating demand. Subsequently, the operation of the heat storage water tank and heat pump units is optimized to resolve difficulties in matching energy supply and demand. Finally, the method takes full advantage of time-of-use electricity pricing policies to reduce costs. The data utilized were collected from an office building in China over a period of six months. Experimental results demonstrate that: (1) In terms of predicting heating demand, the improved neural network proposed in this study achieved a prediction accuracy of 98%, which is a 10% improvement over comparative algorithms. Additionally, the experimental comparison of four types of errors showed that the machine learning method proposed had smaller errors across the board. (2) The method realized collaborative multi-objective optimization, and in five scenarios, the comprehensive performance index increased by up to 38.03% compared to the benchmark system. This indicates that intelligent technology is an effective means of enhancing the energy sustainability of geothermal heating systems, and the use of geothermal energy as a clean energy source effectively addresses issues related to the storage, utilization, management, and energy conservation of buildings