51 research outputs found

    Multimodal Co-learning: A Domain Adaptation Method for Building Extraction from Optical Remote Sensing Imagery

    Get PDF
    In this paper, we aim to improve the transfer learning ability of 2D convolutional neural networks (CNNs) for building extraction from optical imagery and digital surface models (DSMs) using a 2D-3D co-learning framework. Unlabeled target domain data are incorporated as unlabeled training data pairs to optimize the training procedure. Our framework adaptively transfers unsupervised mutual information between the 2D and 3D modality (i.e., DSM-derived point clouds) during the training phase via a soft connection, utilizing a predefined loss function. Experimental results from a spaceborne-to-airborne cross-domain case demonstrate that the framework we present can quantitatively and qualitatively improve the testing results for building extraction from single-modality optical images

    A Co-learning Method to Utilize Optical Images and Photogrammetric Point Clouds for Building Extraction

    Get PDF
    Although deep learning techniques have brought unprecedented accuracy to automatic building extraction, several main issues still constitute an obstacle to effective and practical applications. The industry is eager for higher accuracy and more flexible data usage. In this paper, we present a co-learning framework applicable to building extraction from optical images and photogrammetric point clouds, which can take the advantage of 2D/3D multimodality data. Instead of direct information fusion, our co-learning framework adaptively exploits knowledge from another modality during the training phase with a soft connection, via a predefined loss function. Compared to conventional data fusion, this method is more flexible, as it is not mandatory to provide multimodality data in the test phase. We propose two types of co-learning: a standard version and an enhanced version, depending on whether unlabeled training data are employed. Experimental results from two data sets show that the methods we present can enhance the performance of both image and point cloud networks in few-shot tasks, as well as image networks when applying fully labeled training data sets

    Motif-aware temporal GCN for fraud detection in signed cryptocurrency trust networks

    Full text link
    Graph convolutional networks (GCNs) is a class of artificial neural networks for processing data that can be represented as graphs. Since financial transactions can naturally be constructed as graphs, GCNs are widely applied in the financial industry, especially for financial fraud detection. In this paper, we focus on fraud detection on cryptocurrency truct networks. In the literature, most works focus on static networks. Whereas in this study, we consider the evolving nature of cryptocurrency networks, and use local structural as well as the balance theory to guide the training process. More specifically, we compute motif matrices to capture the local topological information, then use them in the GCN aggregation process. The generated embedding at each snapshot is a weighted average of embeddings within a time window, where the weights are learnable parameters. Since the trust networks is signed on each edge, balance theory is used to guide the training process. Experimental results on bitcoin-alpha and bitcoin-otc datasets show that the proposed model outperforms those in the literature

    Exploring Cross-city Semantic Segmentation of ALS Point Clouds

    Get PDF
    Deep learning models achieve excellent semantic segmentation results for airborne laser scanning (ALS) point clouds, if sufficient training data are provided. Increasing amounts of annotated data are becoming publicly available thanks to contributors from all over the world. However, models trained on a specific dataset typically exhibit poor performance on other datasets. I.e., there are significant domain shifts, as data captured in different environments or by distinct sensors have different distributions. In this work, we study this domain shift and potential strategies to mitigate it, using two popular ALS datasets: the ISPRS Vaihingen benchmark from Germany and the LASDU benchmark from China. We compare different training strategies for cross-city ALS point cloud semantic segmentation. In our experiments, we analyse three factors that may lead to domain shift and affect the learning: point cloud density, LiDAR intensity, and the role of data augmentation. Moreover, we evaluate a well-known standard method of domain adaptation, deep CORAL (Sun and Saenko, 2016). In our experiments, adapting the point cloud density and appropriate data augmentation both help to reduce the domain gap and improve segmentation accuracy. On the contrary, intensity features can bring an improvement within a dataset, but deteriorate the generalisation across datasets. Deep CORAL does not further improve the accuracy over the simple adaptation of density and data augmentation, although it can mitigate the impact of improperly chosen point density, intensity features, and further dataset biases like lack of diversity

    Natural Language Interfaces for Tabular Data Querying and Visualization: A Survey

    Full text link
    The emergence of natural language processing has revolutionized the way users interact with tabular data, enabling a shift from traditional query languages and manual plotting to more intuitive, language-based interfaces. The rise of large language models (LLMs) such as ChatGPT and its successors has further advanced this field, opening new avenues for natural language processing techniques. This survey presents a comprehensive overview of natural language interfaces for tabular data querying and visualization, which allow users to interact with data using natural language queries. We introduce the fundamental concepts and techniques underlying these interfaces with a particular emphasis on semantic parsing, the key technology facilitating the translation from natural language to SQL queries or data visualization commands. We then delve into the recent advancements in Text-to-SQL and Text-to-Vis problems from the perspectives of datasets, methodologies, metrics, and system designs. This includes a deep dive into the influence of LLMs, highlighting their strengths, limitations, and potential for future improvements. Through this survey, we aim to provide a roadmap for researchers and practitioners interested in developing and applying natural language interfaces for data interaction in the era of large language models.Comment: 20 pages, 4 figures, 5 tables. Submitted to IEEE TKD

    Interactions between depositional regime and climate proxies in the northern South China Sea since the Last Glacial Maximum

    Get PDF
    Sedimentary deposits from the northern South China Sea (SCS) can provide important constraints on past changes in ocean currents and the East Asian summer monsoon (EASM) in this region. However, the interpretation of such records spanning the last deglaciation is complicated because sea-level change may also have influenced the depositional processes and patterns. Here, we present new records of grain size, clay mineralogy, and magnetic mineralogy spanning the past 24 kyr from both shallow and deep-water sediment cores in the northern SCS. Our multi-proxy comparison among multiple cores helps constrain the influence of sea-level change, providing confidence in interpreting the regional climate-forced signals. After accounting for the influence of sea-level change, we find that these multi-proxy records reflect a combination of changes in (a) the strength of the North Pacific Intermediate Water inflow, (b) the EASM strength, and (c) the Kuroshio Current extent. Overall, this study provides new insights into the roles of varying terrestrial weathering and oceanographic processes in controlling the depositional record on the northern SCS margin in response to climate and sea-level fluctuations

    Traffic Optimization for Signalized Corridors (TOSCo) Phase 2 Project

    Get PDF
    DTFH6114H00002This report provides a roadmap to details contained in four TOSCo Phase 2 detailed reports that focus on specific aspects of the four key technical objectives undertaken in the TOSCo Phase 2 Project. The four key technical objectives that the project team focused their efforts on in the project were: 1. Implement TOSCo vehicle algorithm in vehicles 2. Implement TOSCo infrastructure algorithm in infrastructure components 3. Verify and refine TOSCo system in a closed course setting then exporting it onto an actual corridor 4. Assess TOSCo functional safety and performance using real-world observations and actual on-road dat
    corecore