2 research outputs found
WikiGoldSK: Annotated Dataset, Baselines and Few-Shot Learning Experiments for Slovak Named Entity Recognition
Named Entity Recognition (NER) is a fundamental NLP tasks with a wide range
of practical applications. The performance of state-of-the-art NER methods
depends on high quality manually anotated datasets which still do not exist for
some languages. In this work we aim to remedy this situation in Slovak by
introducing WikiGoldSK, the first sizable human labelled Slovak NER dataset. We
benchmark it by evaluating state-of-the-art multilingual Pretrained Language
Models and comparing it to the existing silver-standard Slovak NER dataset. We
also conduct few-shot experiments and show that training on a sliver-standard
dataset yields better results. To enable future work that can be based on
Slovak NER, we release the dataset, code, as well as the trained models
publicly under permissible licensing terms at
https://github.com/NaiveNeuron/WikiGoldSK.Comment: BSNLP 2023 Workshop at EACL 202
Passenger Occupancy Estimation in Vehicles: A Review of Current Methods and Research Challenges
Passenger detection and occupancy estimation are vital tasks in many fields. The existing literature emphasises that the increasing demand for such systems will continue to grow. This paper reviews the existing literature specializing in the field of transportation safety and efficiency concerning occupancy estimation in vehicles and passenger detection at public transport stations. A comparison between different approaches to passenger estimation is presented. Discussion on the advantages and disadvantages is highlighted. Hence, this paper provides an analysis of 146 papers on the current state of the field. This review paper concludes that invasive methods provide high accuracy with relatively cheap implementation, while noninvasive systems do not violate passenger privacy but lack state-of-the-art accuracy. Future work will include a systematic literature review and a comparative analysis of systems considering the existing window tinting and solar windshields heavily blocking certain parts of the electromagnetic spectrum. Moreover, future work will investigate the critical challenges of noninvasive passenger estimation in different types of vehicles: trucks, buses, or even motorcycles