Search CORE

1,643 research outputs found

How Algorithmic Confounding in Recommendation Systems Increases Homogeneity and Decreases Utility

Author: Anderson C.
Bennett J.
Bottou L.
Chander A
Dan-Dan Z.
Jolliffe I.
Lee D. D.
Salakhutdinov R.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/09/2018
Field of study

Recommendation systems are ubiquitous and impact many domains; they have the potential to influence product consumption, individuals' perceptions of the world, and life-altering decisions. These systems are often evaluated or trained with data from users already exposed to algorithmic recommendations; this creates a pernicious feedback loop. Using simulations, we demonstrate how using data confounded in this way homogenizes user behavior without increasing utility

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Geurts Joost
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Sebe Nicu
Publication venue: Chorus Project Consortium
Publication date: 01/01/2007
Field of study

Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Web Mining for Social Network Analysis:A Review, Direction and Future Vision.

Author: Yumnam Jayanta Singh Verma Gulhati Manjula,
Publication venue: Assam Don Bosco University
Publication date: 01/03/2016
Field of study

Although web is rich in data, gathering this data and making sense of this data is extremely difficult due to its unorganised nature. Therefore existing Data Mining techniques can be applied toextract information from the web data. The knowledge thus extracted can also be used for Analysis of Social Networks and Online Communities. This paper gives a brief insight to Web Mining and Link Analysis used in Social Network Analysis and reveals the algorithms such as HITS, PAGERANK, SALSA, PHITS, CLEVER and INDEGREE which gives a measure to identify Online Communities over Social Networks. The most common amongst these algorithms are PageRank and HITS. PageRank measures the importance of a page efficiently with the help of inlinks in less time, while HITS uses both inlinks and outlinks to measure the importance of a web page and is sensitive to user query. Further various extensions to these algorithms also exist to refine the query based search results. It opens many doors for future researches to find undiscovered knowledge of existing online communities over various social networks.Keywords:Web Structure Mining, Link Analysis, Link Mining, Online Community Minin

Assam Don Bosco University Journals

Social Data Mining to Improve Bioinspired Intelligent Systems

Author: Alberto Hern&#225
Alberto Ochoa
Alexander Gelbukh
Arnulfo Castro
Arturo Hern&#225
Halina Iztebegovi&#269
Sa&#250
Publication venue: 'IntechOpen'
Publication date: 01/01/2008
Field of study

IntechOpen

CiteSeerX

CONTEST : a Controllable Test Matrix Toolbox for MATLAB

Author: Alan Taylor
Desmond J. Higham
Erdös P.
Fagiolo G.
Lovász L.
Mangan S.
Milgram S.
Newman M. E. J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2008
Field of study

Large, sparse networks that describe complex interactions are a common feature across a number of disciplines, giving rise to many challenging matrix computational tasks. Several random graph models have been proposed that capture key properties of real-life networks. These models provide realistic, parametrized matrices for testing linear system and eigenvalue solvers. CONTEST (CONtrollable TEST matrices) is a random network toolbox for MATLAB that implements nine models. The models produce unweighted directed or undirected graphs; that is, symmetric or unsymmetric matrices with elements equal to zero or one. They have one or more parameters that affect features such as sparsity and characteristic pathlength and all can be of arbitrary dimension. Utility functions are supplied for rewiring, adding extra shortcuts and subsampling in order to create further classes of networks. Other utilities convert the adjacency matrices into real-valued coefficient matrices for naturally arising computational tasks that reduce to sparse linear system and eigenvalue problems

CiteSeerX

Crossref

University of Strathclyde Institutional Repository

Edinburgh Research Explorer

Creating Capsule Wardrobes from Fashion Images

Author: Grauman Kristen
Hsiao Wei-Lin
Publication venue
Publication date: 14/04/2018
Field of study

We propose to automatically create capsule wardrobes. Given an inventory of candidate garments and accessories, the algorithm must assemble a minimal set of items that provides maximal mix-and-match outfits. We pose the task as a subset selection problem. To permit efficient subset selection over the space of all outfit combinations, we develop submodular objective functions capturing the key ingredients of visual compatibility, versatility, and user-specific preference. Since adding garments to a capsule only expands its possible outfits, we devise an iterative approach to allow near-optimal submodular function maximization. Finally, we present an unsupervised approach to learn visual compatibility from "in the wild" full body outfit photos; the compatibility metric translates well to cleaner catalog photos and improves over existing methods. Our results on thousands of pieces from popular fashion websites show that automatic capsule creation has potential to mimic skilled fashionistas in assembling flexible wardrobes, while being significantly more scalable.Comment: Accepted to CVPR 201

arXiv.org e-Print Archive

Crossref

Computational Methods for Protein Identification from Mass Spectrometry Data

Author: A Frank
A Ganapathy
A Keller
A Keller
A Keller
A Taylor
AI Nesvizhskii
AJ Liska
AJ Liska
AJ Liska
AJ Mackey
B Habermann
B Ma
BC Searle
BE Boyes
C Robertson
CA Hastings
D Fenyo
DA Stead
DB Weatherly
DC Chamrad
DF Hochstrasser
DJ Pappin
DL Tabb
DN Perkins
ED Salin
F Levander
H Wang
HI Field
HJ Joshi
I Beer
I Rogers
I Shadforth
J Arthur
J Eriksson
J Eriksson
J Magnin
J Peng
J Razumovskaya
J Reinders
J Samuelsson
JA Bons
JE Elias
JG Rohrbough
JJ Thomson
JK Eng
JL Joss
JM Hogan
Johanna McEntyre
Jonathan W Arthur
JR Yates III
JWH Wong
K Biemann
KA Resing
KA Resing
KR Coombes
L Huang
Leo McHugh
M Kempka
M Mann
M Tuloup
MA Baldwin
MJ Noga
ML Nielsen
MR Wilkins
NL Anderson
P Hernandez
R Apweiler
R Ullmer
RE Moorea
RJ Arnold
RM Day
S Carr
S Gay
S Orchard
SB Vardeman
SF Altschul
SJ Cordwell
V Bafna
V Dancik
W Zhang
WJ Henzel
WR Pearson
Y Chen
Y Han
Z Zhang
Z Zhang
Publication venue: Public Library of Science
Publication date: 01/02/2008
Field of study

Protein identification using mass spectrometry is an indispensable computational tool in the life sciences. A dramatic increase in the use of proteomic strategies to understand the biology of living systems generates an ongoing need for more effective, efficient, and accurate computational methods for protein identification. A wide range of computational methods, each with various implementations, are available to complement different proteomic approaches. A solid knowledge of the range of algorithms available and, more critically, the accuracy and effectiveness of these techniques is essential to ensure as many of the proteins as possible, within any particular experiment, are correctly identified. Here, we undertake a systematic review of the currently available methods and algorithms for interpreting, managing, and analyzing biological data associated with protein identification. We summarize the advances in computational solutions as they have responded to corresponding advances in mass spectrometry hardware. The evolution of scoring algorithms and metrics for automated protein identification are also discussed with a focus on the relative performance of different techniques. We also consider the relative advantages and limitations of different techniques in particular biological contexts. Finally, we present our perspective on future developments in the area of computational protein identification by considering the most recent literature on new and promising approaches to the problem as well as identifying areas yet to be explored and the potential application of methods from other areas of computational biology

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central