3,449,940 research outputs found

    Teaching Stats for Data Science

    Get PDF
    “Data science” is a useful catchword for methods and concepts original to the field of statistics, but typically being applied to large, multivariate, observational records. Such datasets call for techniques not often part of an introduction to statistics: modeling, consideration of covariates, sophisticated visualization, and causal reasoning. This article re-imagines introductory statistics as an introduction to data science and proposes a sequence of 10 blocks that together compose a suitable course for extracting information from contemporary data. Recent extensions to the mosaic packages for R together with tools from the “tidyverse” provide a concise and readable notation for wrangling, visualization, model-building, and model interpretation: the fundamental computational tasks of data science

    Models for Paired Comparison Data: A Review with Emphasis on Dependent Data

    Get PDF
    Thurstonian and Bradley-Terry models are the most commonly applied models in the analysis of paired comparison data. Since their introduction, numerous developments have been proposed in different areas. This paper provides an updated overview of these extensions, including how to account for object- and subject-specific covariates and how to deal with ordinal paired comparison data. Special emphasis is given to models for dependent comparisons. Although these models are more realistic, their use is complicated by numerical difficulties. We therefore concentrate on implementation issues. In particular, a pairwise likelihood approach is explored for models for dependent paired comparison data, and a simulation study is carried out to compare the performance of maximum pairwise likelihood with other limited information estimation methods. The methodology is illustrated throughout using a real data set about university paired comparisons performed by students.Comment: Published in at http://dx.doi.org/10.1214/12-STS396 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Point Spread Functions in Identification of Astronomical Objects from Poisson Noised Image

    Get PDF
    This article deals with modeling of astronomical objects, which is one of the most fundamental topics in astronomical science. Introduction part is focused on problem description and used methods. Point Spread Function Modeling part deals with description of basic models used in astronomical photometry and further on introduction of more sophisticated models such as combinations of interference, turbulence, focusing, etc. This paper also contains a~way of objective function definition based on the knowledge of Poisson distributed noise, which is included in astronomical data. The proposed methods are further applied to real astronomical data

    Applying project-based learning to teach software analytics and best practices in data science

    Get PDF
    Due to recent industry needs, synergies between data science and software engineering are starting to be present in data science and engineering academic programs. Two synergies are: applying data science to manage the quality of the software (software analytics) and applying software engineering best practices in data science projects to ensure quality attributes such as maintainability and reproducibility. The lack of these synergies on academic programs have been argued to be an educational problem. Hence, it becomes necessary to explore how to teach software analytics and software engineering best practices in data science programs. In this context, we provide hands-on for conducting laboratories applying project-based learning in order to teach software analytics and software engineering best practices to data science students. We aim at improving the software engineering skills of data science students in order to produce software of higher quality by software analytics. We focus in two skills: following a process and software engineering best practices. We apply project-based learning as main teaching methodology to reach the intended outcomes. This teaching experience shows the introduction of project-based learning in a laboratory, where students applied data science and best software engineering practices to analyze and detect improvements in software quality. We carried out a case study in two academic semesters with 63 data science bachelor students. The students found the synergies of the project positive for their learning. In the project, they highlighted both utility of using a CRISP-DM data mining process and best software engineering practices like a software project structure convention applied to a data science project.This paper was partly funded by a teaching innovation project of ICE@UPC-BarcelonaTech (entitled ‘‘Audiovisual and digital material for data engineering, a teaching innovation project with open science’’), and the ‘‘Beatriz Galindo’’ Spanish Program BEA-GAL18/00064.Peer ReviewedPostprint (published version

    CS/MTH 316/516: Numerical Methods for Digital Computers

    Get PDF
    Introduction to numerical methods used in the sciences. Methods of interpolation, data smoothing, functional approximation, numerical differentiation and integration. Solution techniques for linear and nonlinear equations. Discussion of sources of error in numerical methods. Applications of interest to engineering, science, and applied mathematics students are an integral part of the course. Special topics presented as schedule permits

    CS/MTH 316/516: Survey of Numerical Methods for Computational Science

    Get PDF
    Introduction to numerical methods used in the sciences and engineering. Included will be methods for interpolation, data smoothing, integration, differentiation, and solution of systems of linear and nonlinear equations. Discussion of sources of error in numerical methods. Applications to science, engineering and applied mathematics are an integral part of the course. Special topics presented as schedule permits. Four hours lecture

    A new kind of first year physics prac

    Get PDF
    In 2008 the first year units of the restructured SC01 applied science program were introduced. One of these new units is SCB110 Science Concepts and Global Systems, which covers a very broad range of subjects such as the history of philosophy, geology, physics, climate change etc. A practical exercise to compliment the physics component of the course was required. The purpose of this practical assignment was to introduce students to some fundamental aspects of experimental science that included topics such as hypothesis testing, statistics, experimental error, referencing, visual display of results etc. Due to logistical problems of providing a single hands-on physics experiment for approximately 250 students, an activity was devised which involved students viewing a 14-minute video of an experiment to measure the speed of light in two blocks of glass using a laser beam. The video was placed on Blackboard in Podcast and QuickTime formats and a DVD was placed in the Gardens Point and Carseldine libraries. Students were given a document (via Blackboard) which included data collected from the experiment shown in the video and instructions on how to analyse the data and write a short scientific report with a 200-word abstract, introduction, method, results, discussion and reference sections. The document also included a photograph and diagram of the experimental set-up for students to include in their reports. Students were required to work in teams of two or three although an allowance was made for students who couldn’t manage to find a partner or who were intent on writing the report on their own. A survey was included in the document on Blackboard. Of the 242 enrolled students, 191 (78.3%) performed the prac and 112 (58.6%) of these students responded to the survey. The survey results are presented in this poster
    corecore