978 research outputs found
Recommended from our members
Essays in Basketball Analytics
With the increasing popularity and competition in professional basketball in the past decade, data driven decision has emerged as a big competitive edge. The advent of high frequency player tracking data from SportVU has enabled a rigorous analysis of player abilities and interactions that was not possible before. The tracking data records two-dimensional x-y coordinates of 10 players on the court as well as the x-y-z coordinates of the ball at a resolution of 25 frames per second, yielding over 1 billion space-time observations over the course of a full season. This dissertation offers a collection of spatio-temporal models and player evaluation metrics that provide insight into the player interactions and their performance, hence allowing the teams to make better decisions.
Conventional approaches to simulate matches have ignored that in basketball the dynamics of ball movement is very sensitive to the lineups on the court and unique identities of players on both offense and defense sides. In chapter 2, we propose the simulation infrastructure that can bridge the gap between player identity and team level network. We model the progression of a basketball match using a probabilistic graphical model. We model every touch event in a game as a sequence of transitions between discrete states. We treat the progression of a match as a graph, where each node represents the network structure of players on the court, their actions, events, etc., and edges denote possible moves in the game flow. Our results show that either changes in the team lineup or changes in the opponent team lineup significantly affects the dynamics of a match progression. Evaluation on the match data for the 2013-16 NBA season suggests that the graphical model approach is appropriate for modeling a basketball match.
NBA teams value players who can ``stretch'' the floor, i.e. create space on the court by drawing their defender(s) closer to themselves. Clearly, this ability to attract defenders varies across players, and furthermore, this effect may also vary by the court location of the offensive player, and whether or not the player is the ball handler. For instance, a ball-handler near the basket attracts a defender more when compared to a non ball-handler at the 3 point line. This has a significant effect on the defensive assignment. This is particularly important because defensive assignment has become the cornerstone of all tracking data based player evaluation models. In chapter 3, we propose a new model to learn player and court location specific offensive attraction. We show that offensive players indeed have varying ability to attract the defender in different parts of the court. Using this metric, teams can evaluate players to construct a roster or lineup which maximizes spacing. We also improve upon the existing defensive matchup inference algorithm for SportVU data.
While the ultimate goal of the offense is to shoot the ball, the strategy lies in creating good shot opportunities. Offensive play event detection has been a topic of research interest. Current research in this area have used a supervised learning approach to detect and classify such events. We took an unsupervised learning approach to detect these events. This has two inherent benefits: first, there is no need for pretagged data to learn identifying these events which is a lobor intensive and error prone task; second, an unsupervised approach allows us to detect events that has not been tagged yet i.e. novel events. We use a HMM based approach to detect these events at any point in the time during a possession by specifying the functional form of the prior distribution on the player movement data. We test our framework on detecting ball screen, post up, and drive. However, it can be easily extended to events like isolation or a new event that has certain distinct defensive matchup or player movement feature compared to a non event. This is the topic for chapter 4.
Accurate estimation of the offensive and the defensive abilities of players in the NBA plays a crucial role in player selection and ranking. A typical approach to estimate players' defensive and offensive abilities is to learn the defensive assignment for each shot and then use a random effects model to estimate the offensive and defensive abilities for each player. The scalar estimate from the random effects model can then be used to rank player. In this approach, a shot has a binary outcome, either it is made or it is a miss. This approach is not able to take advantage of the “quality” of the shot trajectory. In chapter 5, we propose a new method for ranking players that infers the quality of a shot trajectory using a deep recurrent neural network, and then uses this quality measure in a random effects model to rank players taking defensive matchup into account. We show that the quality information significantly improves the player ranking. We also show that including the quality of shots increases the separation between the learned random effect coefficients, and thus, allows for a better differentiation of player abilities. Further, we show that we are able to infer changes in the player's ability on a game-by-game basis when using a trajectory based model. A shot based model does not have enough information to detect changes in player's ability on a game-by-game basis.
A good defensive player prevents its opponent from making a shot, attempting a good shot, making an easy pass, or scoring events, eventually leading to wasted shot clock time. The salient feature here is that a good defender prevents events. Consequently, event driven metrics, such as box scores, cannot measure defensive abilities. Conventional wisdom in basketball is that ``pesky'' defenders continuously maintain a close distance to the ball handler. A closely guarded offensive player is less likely to take or make a shot, less likely to pass, and more likely to lose the ball. In chapter 6, we introduce Defensive Efficiency Rating (DER), a new statistic that measures the defensive effectiveness of a player. DER is the effective distance a defender maintains with the ball handler during an interaction where we control for the identity and wingspan of the the defender, the shot efficiency of the ball handler, and the zone on the court. DER allows us to quantify the quality of defensive interaction without being limited by the occurrence of discrete and infrequent events like shots and rebounds. We show that the ranking from this statistic naturally picks out defenders known to perform well in particular zones
A Method for Using Player Tracking Data in Basketball to Learn Player Skills and Predict Team Performance
Player tracking data represents a revolutionary new data source for basketball analysis, in which essentially every aspect of a player’s performance is tracked and can be analyzed numerically. We suggest a way by which this data set, when coupled with a network-style model of the offense that relates players’ skills to the team’s success at running different plays, can be used to automatically learn players’ skills and predict the performance of untested 5-man lineups in a way that accounts for the interaction between players’ respective skill sets. After developing a general analysis procedure, we present as an example a specific implementation of our method using a simplified network model. While player tracking data is not yet available in the public domain, we evaluate our model using simulated data and show that player skills can be accurately inferred by a simple statistical inference scheme. Finally, we use the model to analyze games from the 2011 playoff series between the Memphis Grizzlies and the Oklahoma City Thunder and we show that, even with a very limited data set, the model can consistently describe a player’s interactions with a given lineup based only on his performance with a different lineup
Application of Artificial Intelligence in Basketball Sport
Basketball is among the most popular sports in the world, and its related industries have also produced huge economic benefits. In recent years, the application of artificial intelligence (AI) technology in basketball has attracted a large amount of attention. We conducted a comprehensive review of the application research of AI in basketball through literature retrieval. Current research focuses on the AI analysis of basketball team and player performance, prediction of competition results, analysis and prediction of shooting, AI coaching system, intelligent training machine and arena, and sports injury prevention. Most studies have shown that AI technology can improve the training level of basketball players, help coaches formulate suitable game strategies, prevent sports injuries, and improve the enjoyment of games. At the same time, it is also found that the number and level of published papers are relatively limited. We believe that the application of AI in basketball is still in its infancy. We call on relevant industries to increase their research investment in this area, and promote the improvement of the level of basketball, making the game increasingly exciting as its worldwide popularity continues to increase
PRISTUP RUDARENJU PODATAKA ZA ANALIZU POSLOVNE VRIJEDNOSTI U KOŠARCI
With the rapidly increasing volume of data, novel methods and technologies for their analysis, and
opportunities to support decision-making processes emerge in the domain of sports analytics. This,
in particular, applies to analysing athletes\u27 performance and calculating related business added-value
in major sports leagues such as the National Basketball Association (NBA). Specifically, the financial
success of a team/franchise depends not only on the results of the games but also on the success of
attracting marketable individuals who bring higher business value. In that regard, this paper aims to
demonstrate the potential and importance of data mining methods to uncover the factors influencing the
decisions related to the player selection based on individual results, physical characteristics, and
professional contract salaries in the NBA. For the study, 22 datasets were integrated into three large
datasets. The data covers the period from 1946 (when the league was founded) to 2017. Data mining models
were developed in RapidMiner, enabling correlation, cluster and regression analysis. Change in the factors
affecting the selection of new players in recent years was uncovered, while the classification revealed,
for example, that more than 50% of players have below-average coefficients of efficiency and individual
result contribution. An artificial neural network algorithm was used to identify discrepancies for
players with high-salary contracts as many do not meet high-performance standards. The study demonstrates
how classification and prediction models can serve sports analysts and managers in making decisions
related to future professional contracts and predict future salaries for active players, among other
contributions.Povećanjem obujma generiranih podataka, te pojavom metoda i dostupnih tehnologija za njihovu analizu,
otvaraju se nove prilike za podršku procesima donošenja odluka u domeni sportske analitike. Navedeno se
posebno odnosi i na analizu performansi sportaša i izračun dodane poslovne vrijednosti koju pojedinci
nose u velikim sportskim natjecanjima kao što je National Basketball Association (NBA) liga. Financijski
uspjeh klubova tako ne ovisi isključivo o sportskom rezultatu nego i o uspješnosti privlačenja marketinški
isplativih kadrova koji sa sobom nose veću poslovnu vrijednost. U tom kontekstu, cilj ovog rada je
demonstrirati potencijal i važnost metoda rudarenja podataka kako bi se istražili čimbenici koji utječu
na donošenje odluka o selekciji igraćih kadrova temeljem ostvarenih individualnih rezultata, fizičkih
karakteristika i pratećih iznosa profesionalnih ugovora. Za potrebe istraživanja kreirana su 3 velika
seta podataka, uređena i integrirana temeljem 22 različita seta podataka iz otvorenih izvora. Podaci
obuhvaćaju razdoblje od osnivanja lige 1946. godine do 2017. godine. Prilikom provođenja analize u
alatu RapidMiner razvijeni su modeli za korelacijsku, klaster i regresijska analizu. Prepoznata je
promjena ključnih čimbenika koji su utjecali na odabir novih igrača posljednjih godina, dok je
klasifikacijska analiza otkrila da, na primjer, više od 50% igrača ima ispodprosječne koeficijente
učinkovitosti i individualni doprinos rezultatu. Izrađen je i model s umjetnim neuronskim mrežama za
utvrđivanje odstupanja kod igrača s ugovorima s visokim plaćama od kojih mnogi ne udovoljavaju visokim
standardima performansi. Među ostalim doprinosima, ova studija prikazuje kako modele klasifikacije i
predviđanja mogu razvijati sportski analitičari i menadžeri pri donošenju odluka povezanih s formiranjem
profesionalnih ugovora i predviđanju budućih plaća aktivnih igrača
- …