14 research outputs found

    AsterixDB: A Scalable, Open Source BDMS

    Full text link
    AsterixDB is a new, full-function BDMS (Big Data Management System) with a feature set that distinguishes it from other platforms in today's open source Big Data ecosystem. Its features make it well-suited to applications like web data warehousing, social data storage and analysis, and other use cases related to Big Data. AsterixDB has a flexible NoSQL style data model; a query language that supports a wide range of queries; a scalable runtime; partitioned, LSM-based data storage and indexing (including B+-tree, R-tree, and text indexes); support for external as well as natively stored data; a rich set of built-in types; support for fuzzy, spatial, and temporal types and queries; a built-in notion of data feeds for ingestion of data; and transaction support akin to that of a NoSQL store. Development of AsterixDB began in 2009 and led to a mid-2013 initial open source release. This paper is the first complete description of the resulting open source AsterixDB system. Covered herein are the system's data model, its query language, and its software architecture. Also included are a summary of the current status of the project and a first glimpse into how AsterixDB performs when compared to alternative technologies, including a parallel relational DBMS, a popular NoSQL store, and a popular Hadoop-based SQL data analytics platform, for things that both technologies can do. Also included is a brief description of some initial trials that the system has undergone and the lessons learned (and plans laid) based on those early "customer" engagements

    Do Guest Workers Have an Impact on The Wages of Native Workers in Saudi Arabia?

    No full text
    Saudi Arabia, like many other countries that have a shortage in their labor force, brought temporary guest workers from different countries to work mostly in the private sector

    Spatial indexing in the Era of Social Media

    No full text
    The rapid adoption of smart phones and the social media boom has increased the interest in location-based services. A new set of applications and popular online services that utilize users' locations have been created, and many ordinary people are increasingly interacting with these services on a daily basis through their smart phones, tablets, cameras, etc., where most of those gadgets come equipped with GPS sensors. The new complex features provided by those applications and the scale of the massive data handled by them impose new and interesting challenges for spatial databases. In this thesis, we present spatial indexing and query processing techniques in response to some of these challenges. First, we study how to support approximate keyword search on spatial data. There are many popular websites that support keyword search on their spatial data, such as business listings and photos. In these systems, users may experience difficulties finding the entities they are looking for if they do not know their exact spelling, such as the name of a restaurant. We develop three algorithms for constructing a specialized index that can answer location- based approximate keyword queries, successively improving the time and space efficiency by exploiting the textual and spatial properties of the data. We experimentally demonstrate the efficiency of our techniques on real, large datasets. Second, we introduce a framework for converting an in-place update, disk-based data structure to a deferred-update, append-only data structure. We show that converting an R-tree index (and other non-totally ordered index) to an LSM index is non-trivial if the resultant index is expected to have performant read and write operations. Our framework enables the "LSM-ification" of any kind of index structure that supports certain primitive operations, enabling the index to ingest data efficiently. We have implemented our framework in the context of the AsterixDB system as a way to extend both the R-tree and the inverted keyword index to LSM-based indexes. Our results have shown that using an LSM-based version of the R-tree can significantly outperform its conventional counterpart for both ingestion and query speed. Third, we study how to optimize the performance of query workloads that favor recent data. There are many use cases where users of a database system are mostly interested in querying recent data. We propose a solution that exploits the natural partitioning property that LSM-based indexes provide for its components, allowing us to filter out many components when answering queries. Our solution is generalizable to any LSM-based index structure including LSM R-trees, and has been implemented in the context of the AsterixDB system. Our experiments show that we can reduce query times by up to 99% for selective range predicates

    Attitudes and habits of patients with type 1 diabetes during fasting Ramadan

    No full text
    Objective: Fasting Ramadan is associated with changes in lifestyle patterns of patients with diabetes who choose to perform fasting. We aimed to determine the attitude and habits of patients with type 1 diabetes during fasting Ramadan. Methods: The study comprised a prospective cohort of patients with type 1 diabetes who were on insulin pump or multiple daily insulin injections (MDI) regimen. Patient questionnaires included the frequency of self-monitoring of blood glucose (SMBG), the need to make changes in insulin regimen by patients, timings of insulin administration, performing carbohydrate counting and levels of physical activity. Results: A total of 156 patients were studied (61 patients on insulin pump and 95 patients on MDI). Patients on pump therapy performed SMBG more frequently than those on MDI regimen (4.8 ± 1.4 and 3.7 ± 1.7 times per day, respectively, P = 0.001) and were more likely to perform carbohydrate counting (32.7% and 8.4% of pump and MDI patients, respectively, P < 0.001). There was no difference in the percentage of patients who made changes in insulin doses (74.5% of the pump group and 77.3% of MDI patients) or those who had any level of physical activity (12.5% of the pump group and 21.1% of the MDI group). The timing of administering meal insulin in relation to sunset meal was variable with a preference to taking the injection immediately at sunset. There was no difference in glucose control between both groups as measured by frucotsamine levels or the number of days that patients have to stop fasting. Conclusion: Fasting Ramadan is associated with significant and variable changes in the attitude and behaviors of patients with type 1 diabetes with no difference in glucose control between patients on insulin pump or MDI regimen. Further studies are needed to define the role of education and its effect on these attitudes and patient care in this population.ClinicalTrials.gov Identifier: NCT01941238
    corecore