2 research outputs found

    WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning Experiments for Slovak Named Entity Recognition

    Full text link
    Named Entity Recognition (NER) is a fundamental NLP tasks with a wide range of practical applications. The performance of state-of-the-art NER methods depends on high quality manually anotated datasets which still do not exist for some languages. In this work we aim to remedy this situation in Slovak by introducing WikiGoldSK, the first sizable human labelled Slovak NER dataset. We benchmark it by evaluating state-of-the-art multilingual Pretrained Language Models and comparing it to the existing silver-standard Slovak NER dataset. We also conduct few-shot experiments and show that training on a sliver-standard dataset yields better results. To enable future work that can be based on Slovak NER, we release the dataset, code, as well as the trained models publicly under permissible licensing terms at https://github.com/NaiveNeuron/WikiGoldSK.Comment: BSNLP 2023 Workshop at EACL 202

    Passenger Occupancy Estimation in Vehicles: A Review of Current Methods and Research Challenges

    No full text
    Passenger detection and occupancy estimation are vital tasks in many fields. The existing literature emphasises that the increasing demand for such systems will continue to grow. This paper reviews the existing literature specializing in the field of transportation safety and efficiency concerning occupancy estimation in vehicles and passenger detection at public transport stations. A comparison between different approaches to passenger estimation is presented. Discussion on the advantages and disadvantages is highlighted. Hence, this paper provides an analysis of 146 papers on the current state of the field. This review paper concludes that invasive methods provide high accuracy with relatively cheap implementation, while noninvasive systems do not violate passenger privacy but lack state-of-the-art accuracy. Future work will include a systematic literature review and a comparative analysis of systems considering the existing window tinting and solar windshields heavily blocking certain parts of the electromagnetic spectrum. Moreover, future work will investigate the critical challenges of noninvasive passenger estimation in different types of vehicles: trucks, buses, or even motorcycles
    corecore