Search CORE

7 research outputs found

Human-like summaries from heterogeneous and time-windowed software development artefacts

Author: A Nenkova
G Erkan
J-M Torres-Moreno
N Nazar
P Verma
PB Baxendale
S Rastkar
S Sohangir
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

First Online: 02 September 2020Automatic text summarisation has drawn considerable interest in the area of software engineering. It is challenging to summarise the activities related to a software project, (1) because of the volume and heterogeneity of involved software artefacts, and (2) because it is unclear what information a developer seeks in such a multi-document summary. We present the first framework for summarising multi-document software artefacts containing heterogeneous data within a given time frame. To produce human-like summaries, we employ a range of iterative heuristics to minimise the cosine-similarity between texts and high-dimensional feature vectors. A first study shows that users find the automatically generated summaries the most useful when they are generated using word similarity and based on the eight most relevant software artefacts.Mahfouth Alghamdi, Christoph Treude, Markus Wagne

arXiv.org e-Print Archive

Crossref

Adelaide Research & Scholarship

Big data framework for finding patterns in multi-market trading data

Author: B Chowdhry
B Fang
CW Holden
I Aldridge
J Han
J Han
Jiawei Han
M Mythili
M Zaharia
PN Tan
Rakesh Agrawal
RC Agarwal
S Sohangir
T Preis
X Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

In the United States, multimarket trading is becoming very popular for investors, professionals and high-frequency traders. This research focuses on 13 exchanges and applies data mining algorithm, an unsupervised machine learning technique for discovering the relationships between stock exchanges. In this work, we used an association rule (FP-growth) algorithm for finding trading pattern in exchanges. Thirty days NYSE Trade and Quote (TAQ) data were used for these experiments. We implemented a big data framework of Spark clusters on the top of Hadoop to conduct the experiment. The rules and co-relations found in this work seems promising and can be used by the investors and traders to make a decision

University of Memphis Digital Commons

Crossref

Cross-domain similarity assessment for workflow improvement to handle Big Data challenge in workflow management

Author: A Fiannaca
A Polyvyanyy
A Schoknecht
CC Aggarwal
CW Tsai
D Fahland
D Garijo
D Loreti
E Maguire
F Durante
FE Tosta
G Janssenswillen
H Bunke
I Vanderfeesten
J Goecks
J Gubbi
J Mendling
J Schmidhuber
J Starlinger
J Starlinger
K Wolstencroft
L Zeng
M Reichert
R Bergmann
R Conforti
R Dijkman
S Sohangir
SJ Pan
T Koohi-Var
T Koohi-Var
W Medhata
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Data science: developing theoretical contributions in information systems via text analytics

Author: A Abbas
A Agarwal
A Elragal
A Hevner
A Mavragani
A Rai
AH Van de Ven
AH Van de Ven
AJ Hey
AS Lee
AS Lee
BG Glaser
D Antons
D Newman
DA Whetten
DM Blei
DM Blei
G Bell
G George
G Neff
G Walsham
H Asri
HK Klein
J Bollen
J Bughin
Jiahui Mo
JP Gibbs
K Goswami
KM Eisenhardt
M Alvesson
M Andrejevic
M Frické
M Saar-Tsechansky
M Sein
MB Miles
MD Myers
ME Roberts
N Berente
N Trifunovic
NR Hassan
O Müller
P Lenca
PB Goes
R Agarwal
R Dong
R Dubin
R Kitchin
R Kitchin
RK Yin
S Debortoli
S Gregor
S Gregor
S Kelling
S Sohangir
SB Bacharach
T Geva
Tomonari Masada
V Dhar
V Grover
WC Booth
WJ Orlikowski
WJ Orlikowski
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref