1,564 research outputs found

    The SIKS/BiGGrid Big Data Tutorial

    Get PDF
    The School for Information and Knowledge Systems SIKS and the Dutch e-science grid BiG Grid organized a new two-day tutorial on Big Data at the University of Twente on 30 November and 1 December 2011, just preceding the Dutch-Belgian Database Day. The tutorial is on top of some exciting new developments in large-scale data processing and data centers, initiated by Google, and followed by many others such as Yahoo, Amazon, Microsoft, and Facebook. The course teaches how to process terabytes of data on large clusters, and discusses several core computer science topics adapted for big data, such as new file systems (Google File System and Hadoop FS), new programming paradigms (MapReduce), new programming languages and query languages (Sawzall, Pig Latin), and new 'noSQL' databases (BigTable, Cassandra and Dynamo)

    Multi-model Data Management : What's New and What's Next?

    Get PDF
    TutorialAs more businesses realized that data, in all forms and sizes, is critical to making the best possible decisions, we see the continued growth of systems that support massive volume of non-relational or unstructured forms of data. Nothing shows the picture more starkly than the Gartner Magic quadrant for operational database management systems, which assumes that, by 2017, all leading operational DBMSs will offer multiple data models, relational and NoSQL, in a single DBMS platform. Having a single data platform for managing both well-structured data and NoSQL data is beneficial to users; this approach reduces significantly integration, migration, development, maintenance, and operational issues. Therefore, a challenging research work is how to develop efficient consolidated single data management platform covering both relational data and NoSQL to reduce integration issues, simplify operations, and eliminate migration issues. In this tutorial, we review the previous work on multi-model data management and provide the insights on the research challenges and directions for future work. The slides and more materials of this tutorial can be found at http://udbms.cs.helsinki.fi/?tutorials/edbt2017.Peer reviewe

    Navigating Workload Compatibility Between a Recommender System and a NoSQL Database: An Interactive Tutorial

    Get PDF
    In this tutorial, the issue of compatibility between a big data storage technology and an analytic workload is explored using a fictitious streaming company as an example. The tutorial offers an interactive approach to help students understand the importance of considering workload compatibility when adopting new technologies. We provide instructors with two Jupyter Notebooks that analyze the compatibility, a detailed instructor guide on how to execute these notebooks, lessons learned, and appendices containing solutions and explanations. This tutorial provides a valuable resource for instructors teaching courses in database systems, big data, and analytic concepts, helping students develop practical skills to navigate the complexities of big data technologies effectively

    Teaching Big Data Management – An Active Learning Approach for Higher Education

    Get PDF
    Since big data analytics has become an imperative for business success in the digital economy, universities face the challenge to train data scientists and data engineers on various technological and managerial skills. In addition to traditional lectures, active learning formats ensure a practice-oriented education enabling students to handle novel big data technologies. In this paper, we present a big data management syllabus for master students in the field of big data analytics, which includes various hands-on and action learning elements. The course encompasses seven lectures and nine tutorials and takes place at Chemnitz University of Technology. It covers a broad range of big data applications and facilitates knowledge on various cognitive levels. The paper gives an overview of the course content and assigns learning objectives to lectures and tutorials using Krathwohl’s revised taxonomy. Finally, we present the feedback, which we have received by the students over the years

    Teaching Tip: Teaching NoSQL Databases in a Database Course for Business Students

    Get PDF
    NoSQL databases have been used in organizations for decades. Few database textbooks on the market, however, have suitable materials about NoSQL beyond general introductions for typical business students. In fact, users of the typical NoSQL systems on the software market need to have certain computer programming skills. This teaching tip introduces a small unit on NoSQL databases in a traditional database course for students in all business majors. The unit uses a Microsoft Excel-based NoSQL database example to explain the basis of NoSQL, describes the four essential types of NoSQL databases, and discusses representative NoSQL database management systems on the software market. As this unit does not require computer programming skills, it can be easily integrated into an existing relational database course for business students. The unit was tested twice. Students have demonstrated positive first-hand practice experiences of NoSQL beyond general concepts of NoSQL

    Integrating NoSQL in the Classroom

    Get PDF
    With the increasing popularity of big data, more and more organizations are turning to NoSQL databases as their preferred system for handling the unique demands of capturing and storing massive amounts of data. The likelihood that employees in all sizes of organizations will encounter NoSQL databases is growing every year. College students need to be exposed to this technology and begin to have a functional understanding of how it works and how to use it. This paper offers a teaching case for college instructors to integrate NoSQL into their existing database courses

    Designing a Modern Software Engineering Training Program with Cloud Computing

    Get PDF
    The software engineering industry is trending towards cloud computing. For our project, we assessed the various tools and practices used in modern software development. The main goals of this project were to create a reference model for developing cloud-based applications, to program a functional cloud-based prototype, and to develop an accompanying training manual. These materials will be incorporated into the software engineering courses at WPI, namely CS 3733 and CS 509
    • …
    corecore