1,569 research outputs found
Evaluating Throwing Ability in Baseball
We present a quantitative analysis of throwing ability for major league
outfielders and catchers. We use detailed game event data to tabulate success
and failure events in outfielder and catcher throwing opportunities. We
attribute a run contribution to each success or failure which are tabulated for
each player in each season. We use four seasons of data to estimate the overall
throwing ability of each player using a Bayesian hierarchical model. This model
allows us to shrink individual player estimates towards an overall population
mean depending on the number of opportunities for each player. We use the
posterior distribution of player abilities from this model to identify players
with significant positive and negative throwing contributions.Comment: Accepted for publication in the Journal of Quantitative Analysis in
Sport
A Hierarchical Dirichlet Process Model with Multiple Levels of Clustering for Human EEG Seizure Modeling
Driven by the multi-level structure of human intracranial
electroencephalogram (iEEG) recordings of epileptic seizures, we introduce a
new variant of a hierarchical Dirichlet Process---the multi-level clustering
hierarchical Dirichlet Process (MLC-HDP)---that simultaneously clusters
datasets on multiple levels. Our seizure dataset contains brain activity
recorded in typically more than a hundred individual channels for each seizure
of each patient. The MLC-HDP model clusters over channels-types, seizure-types,
and patient-types simultaneously. We describe this model and its implementation
in detail. We also present the results of a simulation study comparing the
MLC-HDP to a similar model, the Nested Dirichlet Process and finally
demonstrate the MLC-HDP's use in modeling seizures across multiple patients. We
find the MLC-HDP's clustering to be comparable to independent human physician
clusterings. To our knowledge, the MLC-HDP model is the first in the epilepsy
literature capable of clustering seizures within and between patients.Comment: ICML201
Changes in the Distribution of Income Volatility
Recent research has documented a significant rise in the volatility (e.g.,
expected squared change) of individual incomes in the U.S. since the 1970s.
Existing measures of this trend abstract from individual heterogeneity,
effectively estimating an increase in average volatility. We decompose this
increase in average volatility and find that it is far from representative of
the experience of most people: there has been no systematic rise in volatility
for the vast majority of individuals. The rise in average volatility has been
driven almost entirely by a sharp rise in the income volatility of those
expected to have the most volatile incomes, identified ex-ante by large income
changes in the past. We document that the self-employed and those who
self-identify as risk-tolerant are much more likely to have such volatile
incomes; these groups have experienced much larger increases in income
volatility than the population at large. These results color the policy
implications one might draw from the rise in average volatility. While the
basic results are apparent from PSID summary statistics, providing a complete
characterization of the dynamics of the volatility distribution is a
methodological challenge. We resolve these difficulties with a Markovian
hierarchical Dirichlet process that builds on work from the non-parametric
Bayesian statistics literature
Estimating an NBA player's impact on his team's chances of winning
Traditional NBA player evaluation metrics are based on scoring differential
or some pace-adjusted linear combination of box score statistics like points,
rebounds, assists, etc. These measures treat performances with the outcome of
the game still in question (e.g. tie score with five minutes left) in exactly
the same way as they treat performances with the outcome virtually decided
(e.g. when one team leads by 30 points with one minute left). Because they
ignore the context in which players perform, these measures can result in
misleading estimates of how players help their teams win. We instead use a win
probability framework for evaluating the impact NBA players have on their
teams' chances of winning. We propose a Bayesian linear regression model to
estimate an individual player's impact, after controlling for the other players
on the court. We introduce several posterior summaries to derive rank-orderings
of players within their team and across the league. This allows us to identify
highly paid players with low impact relative to their teammates, as well as
players whose high impact is not captured by existing metrics.Comment: To appear in the Journal of Quantitative Analysis of Spor
Hierarchical Bayesian Modeling of Hitting Performance in Baseball
We have developed a sophisticated statistical model for predicting the
hitting performance of Major League baseball players. The Bayesian paradigm
provides a principled method for balancing past performance with crucial
covariates, such as player age and position. We share information across time
and across players by using mixture distributions to control shrinkage for
improved accuracy. We compare the performance of our model to current
sabermetric methods on a held-out season (2006), and discuss both successes and
limitations
Slaps, Swords and Cheating!
Program listing performers and works performe
- …