Search CORE

11 research outputs found

InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis

Author: Baral Chitta
Goyal Siddharth
Gupta Himanshu
Mishra Swaroop
Sawant Saurabh Arjun
Scaria Kevin
Publication venue
Publication date: 05/04/2023
Field of study

In this paper, we present InstructABSA, Aspect Based Sentiment Analysis (ABSA) using the instruction learning paradigm for all ABSA subtasks: Aspect Term Extraction (ATE), Aspect Term Sentiment Classification (ATSC), and Joint Task modeling. Our method introduces positive, negative, and neutral examples to each training sample, and instruction tunes the model (Tk-Instruct) for each ABSA subtask, yielding significant performance improvements. Experimental results on the Sem Eval 2014, 15, and 16 datasets demonstrate that InstructABSA outperforms the previous state-of-the-art (SOTA) approaches on all three ABSA subtasks (ATE, ATSC, and Joint Task) by a significant margin, outperforming 7x larger models. In particular, InstructABSA surpasses the SOTA on the Rest14 ATE subtask by 7.31% points, Rest15 ATSC subtask by and on the Lapt14 Joint Task by 8.63% points. Our results also suggest a strong generalization ability to new domains across all three subtasksComment: 4 pages, 2 figures, 5 tables, 5 appendix page

arXiv.org e-Print Archive

Instruction Tuned Models are Quick Learners

Author: Baral Chitta
Gupta Himanshu
Mashetty Santosh
Mishra Swaroop
Mitra Arindam
Nakamura Mutsumi
Sawant Saurabh Arjun
Publication venue
Publication date: 17/05/2023
Field of study

Instruction tuning of language models has demonstrated the ability to enhance model generalization to unseen tasks via in-context learning using a few examples. However, typical supervised learning still requires a plethora of downstream training data for finetuning. Often in real-world situations, there is a scarcity of data available for finetuning, falling somewhere between few shot inference and fully supervised finetuning. In this work, we demonstrate the sample efficiency of instruction tuned models over various tasks by estimating the minimal downstream training data required by them to perform transfer learning and match the performance of state-of-the-art (SOTA) supervised models. We conduct experiments on 119 tasks from Super Natural Instructions (SuperNI) in both the single task learning (STL) and multi task learning (MTL) settings. Our findings reveal that, in the STL setting, instruction tuned models equipped with 25% of the downstream train data surpass the SOTA performance on the downstream tasks. In the MTL setting, an instruction tuned model trained on only 6% of downstream training data achieve SOTA, while using 100% of the training data results in a 3.69% points improvement (ROUGE-L 74.68) over the previous SOTA. We conduct an analysis on T5 vs Tk-Instruct by developing several baselines to demonstrate that instruction tuning aids in increasing both sample efficiency and transfer learning. Additionally, we observe a consistent ~4% performance increase in both settings when pre-finetuning is performed with instructions. Finally, we conduct a categorical study and find that contrary to previous results, tasks in the question rewriting and title generation categories suffer from instruction tuning.Comment: 9 pages, 5 figures, 19 Tables (inclusing appendix), 12 pages of Appendi

arXiv.org e-Print Archive

TarGEN: Targeted Data Generation with Large Language Models

Author: Anantheswaran Ujjwala
Baral Chitta
Gupta Himanshu
Mishra Swaroop
Parmar Mihir
Sawant Saurabh Arjun
Scaria Kevin
Verma Shreyas
Publication venue
Publication date: 30/10/2023
Field of study

The rapid advancement of large language models (LLMs) has sparked interest in data synthesis techniques, aiming to generate diverse and high-quality synthetic datasets. However, these synthetic datasets often suffer from a lack of diversity and added noise. In this paper, we present TarGEN, a multi-step prompting strategy for generating high-quality synthetic datasets utilizing a LLM. An advantage of TarGEN is its seedless nature; it does not require specific task instances, broadening its applicability beyond task replication. We augment TarGEN with a method known as self-correction empowering LLMs to rectify inaccurately labeled instances during dataset creation, ensuring reliable labels. To assess our technique's effectiveness, we emulate 8 tasks from the SuperGLUE benchmark and finetune various language models, including encoder-only, encoder-decoder, and decoder-only models on both synthetic and original training sets. Evaluation on the original test set reveals that models trained on datasets generated by TarGEN perform approximately 1-2% points better than those trained on original datasets (82.84% via syn. vs. 81.12% on og. using Flan-T5). When incorporating instruction tuning, the performance increases to 84.54% on synthetic data vs. 81.49% on original data by Flan-T5. A comprehensive analysis of the synthetic dataset compared to the original dataset reveals that the synthetic dataset demonstrates similar or higher levels of dataset complexity and diversity. Furthermore, the synthetic dataset displays a bias level that aligns closely with the original dataset. Finally, when pre-finetuned on our synthetic SuperGLUE dataset, T5-3B yields impressive results on the OpenLLM leaderboard, surpassing the model trained on the Self-Instruct dataset by 4.14% points. We hope that TarGEN can be helpful for quality data generation and reducing the human efforts to create complex benchmarks.Comment: 10 pages, 6 tables, 5 figures, 5 pages references, 17 pages appendi

arXiv.org e-Print Archive

Topical hemocoagulase: A novel method for achieving hemostasis

Author: Arushi Gakhar
Ravi Shankar Jangra
Sanjeev Gupta
Saurabh Swaroop Gupta
Publication venue: Elsevier BV
Publication date: 01/03/2020
Field of study

Crossref

The use of smartphones as operating microscopes

Author: Kartikay Aggarwal
Ravi Shankar Jangra
Sanjeev Gupta
Saurabh Swaroop Gupta
Publication venue: Elsevier BV
Publication date: 01/01/2021
Field of study

Crossref

Comment on: “The use of surgical gloves as an aseptic, effective, and inexpensive method to deliver circumferential cryoanesthesia”

Author: Ravi Shankar Jangra
Rohit Singla
Sanjeev Gupta
Saurabh Swaroop Gupta
Somesh Gupta
Publication venue: Elsevier BV
Publication date: 01/06/2020
Field of study

Crossref

Innovative method of intralesional drug delivery in nodulocystic acne

Author: Ajinkya Gujrathi
Ravi Shankar Jangra
Sanjeev Gupta
Saurabh Swaroop Gupta
Sunita Gupta
Publication venue: Elsevier BV
Publication date: 01/04/2021
Field of study

Crossref

Innovative method for self-application of topical preparations on inaccessible sites

Author: Ajinkya Vinayak Gujrathi
Ravi Shankar Jangra
Sanjeev Gupta
Saurabh Swaroop Gupta
Sunita Gupta
Publication venue: Elsevier BV
Publication date: 01/05/2021
Field of study

Crossref

Contributors

Author: Aditya Arya
Aishwarya Singh
Alan Prem Kumar
Amit Kumar Pandey
Amit Kumar Yadav
Anuradha Kirtonia
Atul Kumar Tiwari
Carson Zabel
Chakrabhavi Dhananjaya Mohan
Chandan Seth Nanda
Chung Yeng Looi
Dhananjay Shukla
Ekta Khattar
Gautam Sethi
Gouri Pandya
Harsh Sharma
Kailash Prasad Jaiswal
Kanchugarakoppal S. Rangappa
Khurram Aamir
Kusum Yadav
Manoj Garg
Manoj Kumar Tembhre
Megan Butler
Muthu K. Shanmugam
Nandini Verma
Naveen Kumar Vishvakarma
Payal Gupta
Prashant Bhatt
Pratima Tripathi
Priya Madhavan
Rachana Kumari
Ravi Datta Sharma
Reena Kumari
Rohit Srivastava
Sagar Vyavahare
Sandeep Kumar
Sanjay K. Srivastava
Santosh Kumar
Sapnita Shinde
Saurabh Saxena
Shafaque Imran
Shalini Swaroop
Shantini Vijayabalan
Shivani Singhal
Shrey Madeka
Suhailah Abdullah
Suruchi Aggarwal
Swayam Prakash Srivastava
Trishna Pani
Ujjaini Dasgupta
Uma Dhawan
Vaisnevee Sugumar
Vibha Rani
Vibha Sinha
Vineeta Dixit
Vinit Singh Baghel
Won Feng Wong
Yi Ying Cheok
Yogita K. Adlakha
Publication venue: Elsevier
Publication date: 01/01/2023
Field of study

Crossref

Mid- to late Holocene climate response from the Triloknath palaeolake, Lahaul Himalaya based on multiproxy data

Author: Adams
Ali
Ali
Amit K Mishra
Anderson
Arnold
Azam
Azam
Azam
Bali
Bali
Bali
Bali
Bali
Banerjee
Barnard
Benn
Bhattacharyya
Bond
Bookhagen
Bookhagen
Bookhagen
Bøtter-Jensen
Central Ground Water Board
Central Ground Water Board
Chakraborty
Chauhan
Chauhan
Chauhan
Chauhan
Chauhan
Chauhan
Chauhan
Collinson
Dearing
Dekkers
Demske
Dhruv Sen Singh
Dixit
Dunlop
Evans
Fedo
Fleitmann
France
Galbraith
Gasse
Gupta
Gupta
Haug
Hong
Imran Khan
Islam
Jacobs
Jacobs
Jayant K. Tripathi
Johnsen
Juyal
King
Kotlia
Kotlia
Kotlia
Kruiver
Kӓӓb
Lechler
Leipe
Liu
Liu
Lyons
Maher
Maher
Mayewski
Mazari
Menzel
Murray
Naidu
Nesbitt
Nesbitt
Nesbitt
Oldfield
Owen
Owen
Owen
Owen
Pandey
Peters
Phadtare
Prasad
Pratt-Sitaula
Prescott
Purnima Srivastava
Raina
Rameshwar Bali
Rawat
Rawat
Reineck
Roberts
Roberts
Robertson
Rochette
S Nawaz Ali
S.J. Sangode
Sangewar
Sangode
Sarkar
Sati
Saurabh K. Singh
Schaefer
Sharma
Shukla
Singh
Singh
Singh
Singh
Singh
Snowball
Soon
Swaroop
Takahashi
Thompson
Thompson
Thompson
Tripathi
Trivedi
Trivedi
Verosub
Von Rad
Watson
Weiers
Wulf
Wünnemann
Yadav
Yadav
Yang
Yao
Zhao
Zheng
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref