Search CORE

6 research outputs found

Mavericks at NADI 2023 Shared Task: Unravelling Regional Nuances through Dialect Identification using Transformer-based Approach

Author: Deshpande Kshitij
Deshpande Vedant
Mangalvedhekar Sudeep
Murumkar Ravindra
Patwardhan Yash
Publication venue
Publication date: 30/11/2023
Field of study

In this paper, we present our approach for the "Nuanced Arabic Dialect Identification (NADI) Shared Task 2023". We highlight our methodology for subtask 1 which deals with country-level dialect identification. Recognizing dialects plays an instrumental role in enhancing the performance of various downstream NLP tasks such as speech recognition and translation. The task uses the Twitter dataset (TWT-2023) that encompasses 18 dialects for the multi-class classification problem. Numerous transformer-based models, pre-trained on Arabic language, are employed for identifying country-level dialects. We fine-tune these state-of-the-art models on the provided dataset. The ensembling method is leveraged to yield improved performance of the system. We achieved an F1-score of 76.65 (11th rank on the leaderboard) on the test dataset.Comment: 5 pages, 1 figure, accepted at the NADI ArabicNLP Workshop, EMNLP 202

arXiv.org e-Print Archive

Mavericks at ArAIEval Shared Task: Towards a Safer Digital Space -- Transformer Ensemble Models Tackling Deception and Persuasion

Author: Deshpande Kshitij
Deshpande Vedant
Mangalvedhekar Sudeep
Murumkar Ravindra
Patwardhan Yash
Publication venue
Publication date: 30/11/2023
Field of study

In this paper, we highlight our approach for the "Arabic AI Tasks Evaluation (ArAiEval) Shared Task 2023". We present our approaches for task 1-A and task 2-A of the shared task which focus on persuasion technique detection and disinformation detection respectively. Detection of persuasion techniques and disinformation has become imperative to avoid distortion of authentic information. The tasks use multigenre snippets of tweets and news articles for the given binary classification problem. We experiment with several transformer-based models that were pre-trained on the Arabic language. We fine-tune these state-of-the-art models on the provided dataset. Ensembling is employed to enhance the performance of the systems. We achieved a micro F1-score of 0.742 on task 1-A (8th rank on the leaderboard) and 0.901 on task 2-A (7th rank on the leaderboard) respectively.Comment: 6 pages, 1 figure, accepted at the ArAIEval ArabicNLP workshop, EMNLP conference 202

arXiv.org e-Print Archive

CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

Author: Ahn Heeyoung
Aviles-Rivero Angelica I.
Azzuni Hussam
Bashir Raja Muhammad Saad
Baumann Elias
Blache Marie-Claire
Böhland Moritz
Campilho Aurélio
Cardoso Jaime S.
Cheng Jijun
Chien Hsiang-Chin
Costa Pedro
Dawood Muhammad
Deshpande Srijay
Devika R. G.
Dubey Yash
Dumbhare Pranay
Fang Zijie
Graham Simon
Han Chu
Hirsch Peter
Hong Chenyang
Hong Yiyu
Hrishikesh P. S.
Huang Banban
Jahanifar Mostafa
Jain Ayushi
Jamthikar Ankush
Jiji C. V.
Jung Hyun
Kainmueller Dagmar
Kasai Satoshi
Kim Soo-Hyung
Kondo Satoshi
Kwak Jin Tae
Lee Chia-Yen
Lee Taebum
Li Jiachen
Lin Chunhui
Lin Hong-Kun
Lin Zhifan
Liu Lihao
Liu Shuolin
Liu Zaiyi
Löffler Katharina
Mao Lijian
Meda Yughender
Miao Tianyi
Mikut Ralf
Minhas Fayyaz
Mishra Prakash
Neumann Oliver
Nunes João D.
Pan Xipeng
Phuse Vedant
Piégu Benoît
Puthussery Densen
Rajpoot Nasir M.
Raza Shan E. Ahmed
Reischl Markus
Ridzuan Muhammad
Rumberger Josef Lorenz
Scherr Tim
Schilling Marcel P.
Schmidt Uwe
Schönlieb Carola-Bibiane
Shephard Adam
Snead David
Talsania Dhairya
The CoNIC Challenge Consortium
Vernay Bertrand
Vo Vi Thi-Tuong
Vu Quoc Dang
Vuong Trinh Thi Le
Wang Ching-Ping
Wang Chixin
Wang Xiyue
Weigert Martin
Wu Min
Xiang Jinxi
Xu Min
Yang Sen
Yaqub Mohammad
Ying Weiqin
Zhang Jun
Zhang Liukun
Zhang Wenhua
Zhang Ye
Zhang Yongbing
Ziaei Dorsa
Publication venue
Publication date: 14/03/2023
Field of study

Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of reproducible algorithms for cellular recognition with real-time result inspection on public leaderboards. We conducted an extensive post-challenge analysis based on the top-performing models using 1,658 whole-slide images of colon tissue. With around 700 million detected nuclei per model, associated features were used for dysplasia grading and survival analysis, where we demonstrated that the challenge's improvement over the previous state-of-the-art led to significant boosts in downstream performance. Our findings also suggest that eosinophils and neutrophils play an important role in the tumour microevironment. We release challenge models and WSI-level results to foster the development of further methods for biomarker discovery

arXiv.org e-Print Archive

KITopen

A Robotic Process Automation for Stock Selection Process and Price Prediction Model using Machine Learning Techniques

Author: Bhutada Pritesh
Deshpande Prof. Leena
Gampawar Vedant
Jadkar Vinayak
Khandate Mayur
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/07/2022
Field of study

Among these last few years, we have seen a tremendous increase in the participation in financial markets as well as there are more robotic process automation jobs emerging in recent years. We can clearly see the scope and increased requirement in both these domains. In the stock market, predicting the stock prices/direction and making profits is the main goal whereas in rpa, tasks which are done on a regular basis are converted into automated or semi-automated form. In this paper we have tried to apply both things into the picture such as developing a price prediction model using machine learning techniques and automating the stock selecting process through technical screeners depending on user requirements. Stacked LSTM and Bi-directional LSTM ML techniques are used and for automation part powerful rpa tool Automation Anywhere has been used. Factors such as evaluation metrics and graph plots are compared for models and advantages, and disadvantages are discussed for using systems with RPA and without RPA practices. Price prediction plots have been analyzed for stocks of different sectors with highest market capitalization and results/analysis and inferences have been stated.     &nbsp

International Journal on Recent and Innovation Trends in Computing and Communication

Equilibria: Janunary 2023

Author: Agarwal Megha
Babbar Ria
Barve Pia
Biswas Anshuja
Bonnjha
Danane Rajesh
Debroy Bibek
Deshpande Vedant
Deswal Samiksha
Doshi Yashvi
Feroz Zahra
Garg Sonakshi
Gokhale Institute of Politics and Economics (GIPE) Pune (India)
Jha Shachi
Kajale Jayanti
Khare Ashwin
Mahajan Devaanshi
Mahanta Dikshita
Mahindrakar Pranautee
Meshram Pranjal
Mohandas Vandana
Naikwade Daulat
Panwar Arpit
Poddar Deboparna
Radkar Anjali
Ranade Ajit
Reddy Kalluru Siva
Salian Srishti
Sankar Arvind M.
Santhanakrishnan Deepika
Seth Shagun
Sharma Manan
Sharma Osheen
Surana Samrudha
Tali Tuhina
Thomas Aliza
Publication venue: Gokhale Institute of Politics and Economics (GIPE), Pune (India)
Publication date: 01/01/2023
Field of study

DSpace@GIPE