2 research outputs found
MAILEX: Email Event and Argument Extraction
In this work, we present the first dataset, \dataset, for performing event
extraction from conversational email threads. To this end, we first proposed a
new taxonomy covering 10 event types and 76 arguments in the email domain. Our
final dataset includes 4K emails annotated with 9K event instances.
To understand the task challenges, we conducted a series of experiments
comparing two commonly-seen lines of approaches for event extraction, i.e.,
sequence labeling and generative end-to-end extraction (including few-shot
GPT-3.5). Our results showed that the task of email event extraction is far
from being addressed, due to challenges lying in, e.g., extracting
non-continuous, shared trigger spans, extracting non-named entity arguments,
and modeling the email conversational history. Our work thus suggests more
investigations in this domain-specific event extraction task in the
future.\footnote{The source code and dataset can be obtained from
\url{https://github.com/salokr/Email-Event-Extraction}