10 research outputs found

    Functional Data Analysis in Electronic Commerce Research

    Full text link
    This paper describes opportunities and challenges of using functional data analysis (FDA) for the exploration and analysis of data originating from electronic commerce (eCommerce). We discuss the special data structures that arise in the online environment and why FDA is a natural approach for representing and analyzing such data. The paper reviews several FDA methods and motivates their usefulness in eCommerce research by providing a glimpse into new domain insights that they allow. We argue that the wedding of eCommerce with FDA leads to innovations both in statistical methodology, due to the challenges and complications that arise in eCommerce data, and in online research, by being able to ask (and subsequently answer) new research questions that classical statistical methods are not able to address, and also by expanding on research questions beyond the ones traditionally asked in the offline environment. We describe several applications originating from online transactions which are new to the statistics literature, and point out statistical challenges accompanied by some solutions. We also discuss some promising future directions for joint research efforts between researchers in eCommerce and statistics.Comment: Published at http://dx.doi.org/10.1214/088342306000000132 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Smoothing sparse and unevenly sampled curves using semiparametric mixed models: An application to online auctions

    Get PDF
    Functional data analysis can be challenging when the functional objects are sampled only very sparsely and unevenly. Most approaches rely on smoothing to recover the underlying functional object from the data which can be difficult if the data is irregularly distributed. In this paper we present a new approach that can overcome this challenge. The approach is based on the ideas of mixed models. Specifically, we propose a semiparametric mixed model with boosting to recover the functional object. While the model can handle sparse and unevenly distributed data, it also results in conceptually more meaningful functional objects. In particular, we motivate our method within the framework of eBay's online auctions. Online auctions produce monotonic increasing price curves that are often correlated across two auctions. The semiparametric mixed model accounts for this correlation in a parsimonious way. It also estimates the underlying increasing trend from the data without imposing model-constraints. Our application shows that the resulting functional objects are conceptually more appealing. Moreover, when used to forecast the outcome of an online auction, our approach also results in more accurate price predictions compared to standard approaches. We illustrate our model on a set of 183 closed auctions for Palm M515 personal digital assistants

    Interactive Data and Information Visualization: Unpacking its Characteristics and Influencing Aspects on Decision-making

    Get PDF
    Background: Interactive data and information visualization (IDIV) enhances information presentations by providing users with multiple visual representations, active controls, and analytics. Users have greater control over IDIV presentations than standard presentations and as such IDIV becomes a more popular and relevant means of supporting data analytics (DA), as well as augmenting human intellect. Thus, IDIV enables provision of information in a format better suited to users’ decision-making. Method: Synthesizing past literature, we unpack IDIV characteristics and their influence on decision-making. This study adopts a narrative review method. Our conceptualization of IDIV and the proposed decision-making model are derived from a substantial body of literature from within the information systems (IS) and psychology disciplines. Results: We propose an IS centered model of IDIV enhanced decision-making incorporating four bases of decision-making (i.e., predictors, moderators, mediators, and outcomes). IDIV is specifically characterized by rich features compared with standard information presentations, therefore, formulating the model is critical to understanding how IDIV affects decision processes, perceptual evaluations, and decision outcomes and quality. Conclusions: This decision-making model could provide a meaningful frame of reference for further IDIV research and greater specificity in IS theorizing. Overall, we contribute to the systematic description and explanation of IDIV and discuss a potential research agenda for future IDIV research into IS. Available at: https://aisel.aisnet.org/pajais/vol11/iss4/4

    Online Auctions: Theoretical and Empirical Investigations

    Get PDF
    This dissertation, which consists of three essays, studies online auctions both theoretically and empirically. The first essay studies a special online auction format used by eBay, “Buy-It- Now” (BIN) auctions, in which bidders are allowed to buy the item at a fixed BIN price set by the seller and end the auction immediately. I construct a two-stage model in which the BIN price is only available to one group of bidders. I find that bidders cutoff is lower in this model, which means, bidders are more likely to accept the BIN option, compared with the models assuming all bidders are offered the BIN. The results explain the high frequency of bidders accepting BIN price, and may also help explain the popularity of temporary BIN auctions in online auction sites, such as eBay, where BIN option is only offered to early bidders. In the second essay, I study how bidders’ risk attitude and time preference affect their behavior in Buy-It-Now auctions. I consider two cases, when both bidders enter the auction at the same time (homogenous bidders) thus BIN option is offered to both of them, and when two bidders enter the auction at two different stages (heterogenous bidders) thus the BIN option is only offered to the early bidder. Bidders’ optimal strategies are derived explicitly in both cases. In particular, given bidders’ risk attitude and time preference, the cutoff valuation, such that a bidder will accept BIN if his valuation is higher than the cutoff valuation and reject it otherwise, is calculated. I find that the cutoff valuation in the case of heterogenous bidders is lower than that in the case of homogenous bidders. The third essay focuses on the empirical modeling of the price processes of online auctions. I generalize the monotone series estimator to model the pooled price processes. Then I apply the model and the estimator to eBay auction data of a palm PDA. The results are shown to capture closely the overall pattern of observed price dynamics. In particular, early bidding, mid-auction draught, and sniping are well approximated by the estimated price curve

    Acoustic data optimisation for seabed mapping with visual and computational data mining

    Get PDF
    Oceans cover 70% of Earth’s surface but little is known about their waters. While the echosounders, often used for exploration of our oceans, have developed at a tremendous rate since the WWII, the methods used to analyse and interpret the data still remain the same. These methods are inefficient, time consuming, and often costly in dealing with the large data that modern echosounders produce. This PhD project will examine the complexity of the de facto seabed mapping technique by exploring and analysing acoustic data with a combination of data mining and visual analytic methods. First we test the redundancy issues in multibeam echosounder (MBES) data by using the component plane visualisation of a Self Organising Map (SOM). A total of 16 visual groups were identified among the 132 statistical data descriptors. The optimised MBES dataset had 35 attributes from 16 visual groups and represented a 73% reduction in data dimensionality. A combined Principal Component Analysis (PCA) + k-means was used to cluster both the datasets. The cluster results were visually compared as well as internally validated using four different internal validation methods. Next we tested two novel approaches in singlebeam echosounder (SBES) data processing and clustering – using visual exploration for outlier detection and direct clustering of time series echo returns. Visual exploration identified further outliers the automatic procedure was not able to find. The SBES data were then clustered directly. The internal validation indices suggested the optimal number of clusters to be three. This is consistent with the assumption that the SBES time series represented the subsurface classes of the seabed. Next the SBES data were joined with the corresponding MBES data based on identification of the closest locations between MBES and SBES. Two algorithms, PCA + k-means and fuzzy c-means were tested and results visualised. From visual comparison, the cluster boundary appeared to have better definitions when compared to the clustered MBES data only. The results seem to indicate that adding SBES did in fact improve the boundary definitions. Next the cluster results from the analysis chapters were validated against ground truth data using a confusion matrix and kappa coefficients. For MBES, the classes derived from optimised data yielded better accuracy compared to that of the original data. For SBES, direct clustering was able to provide a relatively reliable overview of the underlying classes in survey area. The combined MBES + SBES data provided by far the best accuracy for mapping with almost a 10% increase in overall accuracy compared to that of the original MBES data. The results proved to be promising in optimising the acoustic data and improving the quality of seabed mapping. Furthermore, these approaches have the potential of significant time and cost saving in the seabed mapping process. Finally some future directions are recommended for the findings of this research project with the consideration that this could contribute to further development of seabed mapping problems at mapping agencies worldwide

    Mixed models based on likelihood boosting

    Get PDF
    corecore