Search CORE

686 research outputs found

Unconstrained Scene Text and Video Text Recognition for Arabic Script

Author: Jain Mohit
Jawahar C. V.
Mathew Minesh
Publication venue
Publication date: 07/11/2017
Field of study

Building robust recognizers for Arabic has always been challenging. We demonstrate the effectiveness of an end-to-end trainable CNN-RNN hybrid architecture in recognizing Arabic text in videos and natural scenes. We outperform previous state-of-the-art on two publicly available video text datasets - ALIF and ACTIV. For the scene text recognition task, we introduce a new Arabic scene text dataset and establish baseline results. For scripts like Arabic, a major challenge in developing robust recognizers is the lack of large quantity of annotated data. We overcome this by synthesising millions of Arabic text images from a large vocabulary of Arabic words and phrases. Our implementation is built on top of the model introduced here [37] which is proven quite effective for English scene text recognition. The model follows a segmentation-free, sequence to sequence transcription approach. The network transcribes a sequence of convolutional features from the input image to a sequence of target labels. This does away with the need for segmenting input image into constituent characters/glyphs, which is often difficult for Arabic script. Further, the ability of RNNs to model contextual dependencies yields superior recognition results.Comment: 5 page

arXiv.org e-Print Archive

Crossref

Hybrid Approach for Food Recognition Using Various Filters

Author: jadon swati
SHARMA ANKUSH
SHARMA POOJA
Publication venue: 'Revista Mexicana de Biodiversidad'
Publication date: 05/01/2022
Field of study

Food image recognition system has various applications now a day. In this paper we have used machine learning supervised approach and Support Vector Machine to classify different food images. SVM has being classified to detect and recognize food images with least modification. By applying various filters like texture filter, segmentation method, clustering and SVM approach we have achieved more accuracy then other machine learning approaches with manually extract features. Sustenance is an indivisible piece of people groups lives. we tend to apply an convolution neural network(CNN) to the undertakings of analyst work and perceiving sustenance pictures. Be clarification for the wide decent variety of styles of nourishment, picture acknowledgment of sustenance things is typically unpleasantly difficulties. Nevertheless, profound learning has been demonstrated starting late to be a genuinely extreme picture acknowledgment framework, and CNN could be a dynamic approach to manage profound learning. CNN showed on a very basic level higher precision than did old-fashioned help vector-machine-based courses with carefully assembled decisions. For sustenance picture disclosure, CNN likewise demonstrated fundamentally count higher precision than a standard technique. Generally higher precision than standard techniques.Keywords: CNN, texture filter, k-mean clustering, segmentatio

International Journal of Advanced Computer Technology

Recommended from our members

Wearables, smartphones, and artificial intelligence for digital phenotyping and health

Author: Clifton EAD
Mascolo C
Perez-Pozuelo I
Spathis D
Publication venue: Digital Health
Publication date: 01/01/2021
Field of study

Ubiquitous progress in wearable sensing and mobile computing technologies, alongside growing diversity in sensor modalities, has created new pathways for the collection of health and well-being data outside of laboratory settings, in a longitudinal fashion. Wearable and mobile devices have the potential to provide low-cost, objective measures of physical activity, clinically relevant data for patient assessment, and scalable behavior monitoring in large populations. These data can be used in both interventional and observational studies to derive insights regarding the links between behavior, health. and disease, as well as to advance the personalization and effectiveness of commercial wellness applications. Today, over 400,000 participants have had their behavior tracked prospectively using accelerometers for epidemiological studies across the globe. Traditionally, epidemiologists and clinicians have relied upon self-report measures of physical activity and sleep which, while valuable in the absence of alternatives, are subject to bias and often provide partial, incomplete information Physical behavior data extracted from wearable devices are being used to derive sensor-assessed, objective measures of physical behaviors, overcoming the limitations of self-report with the aim of relating these to clinical endpoints and eventually applying the findings to preventive and predictive medicine. Moreover, the application of artificial intelligence (AI), sensor fusion, and signal processing to wearable sensor data has led to improved human activity recognition and behavioral phenotyping. Here, we review the state of the art in wearable and mobile sensing technology in epidemiology and clinical medicine and discuss how AI is changing the field

Apollo (Cambridge)

International Conference on Mechatronics, System Engineering and Robotics & Information System and Engineering

Author: University for Business and Technology - UBT
Publication venue: UBT Knowledge Center
Publication date: 26/10/2019
Field of study

UBT Annual International Conference is the 8th international interdisciplinary peer reviewed conference which publishes works of the scientists as well as practitioners in the area where UBT is active in Education, Research and Development. The UBT aims to implement an integrated strategy to establish itself as an internationally competitive, research-intensive university, committed to the transfer of knowledge and the provision of a world-class education to the most talented students from all background. The main perspective of the conference is to connect the scientists and practitioners from different disciplines in the same place and make them be aware of the recent advancements in different research fields, and provide them with a unique forum to share their experiences. It is also the place to support the new academic staff for doing research and publish their work in international standard level. This conference consists of sub conferences in different fields like: – Computer Science and Communication Engineering– Management, Business and Economics– Mechatronics, System Engineering and Robotics– Energy Efficiency Engineering– Information Systems and Security– Architecture – Spatial Planning– Civil Engineering , Infrastructure and Environment– Law– Political Science– Journalism , Media and Communication– Food Science and Technology– Pharmaceutical and Natural Sciences– Design– Psychology– Education and Development– Fashion– Music– Art and Digital Media– Dentistry– Applied Medicine– Nursing This conference is the major scientific event of the UBT. It is organizing annually and always in cooperation with the partner universities from the region and Europe. We have to thank all Authors, partners, sponsors and also the conference organizing team making this event a real international scientific event. Edmond Hajrizi, President of UBT UBT – Higher Education Institutio

University of Business and Technology in Kosovo: UBT Knowledge Center Collections

Facial Expression Recognition of Instructor Using Deep Features and Extreme Learning Machine

Author: Haroon Yousaf Muhammad
Jamil Afshan
Khalid Bhatti Yusra
Nida Nudrat
Velastin Carroza Sergio Alejandro
Viriri Serestina
Publication venue: 'Hindawi Limited'
Publication date: 03/05/2021
Field of study

Classroom communication involves teacher’s behavior and student’s responses. Extensive research has been done on the analysis of student’s facial expressions, but the impact of instructor’s facial expressions is yet an unexplored area of research. Facial expression recognition has the potential to predict the impact of teacher’s emotions in a classroom environment. Intelligent assessment of instructor behavior during lecture delivery not only might improve the learning environment but also could save time and resources utilized in manual assessment strategies. To address the issue of manual assessment, we propose an instructor’s facial expression recognition approach within a classroom using a feedforward learning model. First, the face is detected from the acquired lecture videos and key frames are selected, discarding all the redundant frames for effective high-level feature extraction. Then, deep features are extracted using multiple convolution neural networks along with parameter tuning which are then fed to a classifier. For fast learning and good generalization of the algorithm, a regularized extreme learning machine (RELM) classifier is employed which classifies five different expressions of the instructor within the classroom. Experiments are conducted on a newly created instructor’s facial expression dataset in classroom environments plus three benchmark facial datasets, i.e., Cohn–Kanade, the Japanese Female Facial Expression (JAFFE) dataset, and the Facial Expression Recognition 2013 (FER2013) dataset. Furthermore, the proposed method is compared with state-of-the-art techniques, traditional classifiers, and convolutional neural models. Experimentation results indicate significant performance gain on parameters such as accuracy, F1-score, and recall

Universidad Carlos III de Madrid e-Archivo

Cognition-Based Networks: A New Perspective on Network Optimization Using Learning and Distributed Intelligence

Author: DE FILIPPO DE GRAZIA Michele
Testolin Alberto
Zanella Andrea
Zorzi Marco
Zorzi Michele
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

IEEE Access Volume 3, 2015, Article number 7217798, Pages 1512-1530 Open Access Cognition-based networks: A new perspective on network optimization using learning and distributed intelligence (Article) Zorzi, M.a , Zanella, A.a, Testolin, A.b, De Filippo De Grazia, M.b, Zorzi, M.bc a Department of Information Engineering, University of Padua, Padua, Italy b Department of General Psychology, University of Padua, Padua, Italy c IRCCS San Camillo Foundation, Venice-Lido, Italy View additional affiliations View references (107) Abstract In response to the new challenges in the design and operation of communication networks, and taking inspiration from how living beings deal with complexity and scalability, in this paper we introduce an innovative system concept called COgnition-BAsed NETworkS (COBANETS). The proposed approach develops around the systematic application of advanced machine learning techniques and, in particular, unsupervised deep learning and probabilistic generative models for system-wide learning, modeling, optimization, and data representation. Moreover, in COBANETS, we propose to combine this learning architecture with the emerging network virtualization paradigms, which make it possible to actuate automatic optimization and reconfiguration strategies at the system level, thus fully unleashing the potential of the learning approach. Compared with the past and current research efforts in this area, the technical approach outlined in this paper is deeply interdisciplinary and more comprehensive, calling for the synergic combination of expertise of computer scientists, communications and networking engineers, and cognitive scientists, with the ultimate aim of breaking new ground through a profound rethinking of how the modern understanding of cognition can be used in the management and optimization of telecommunication network

Archivio istituzionale della ricerca - Università di Padova