278 research outputs found

    Integrating R and Hadoop for Big Data Analysis

    Get PDF
    Analyzing and working with big data could be very diffi cult using classical means like relational database management systems or desktop software packages for statistics and visualization. Instead, big data requires large clusters with hundreds or even thousands of computing nodes. Offi cial statistics is increasingly considering big data for deriving new statistics because big data sources could produce more relevant and timely statistics than traditional sources. One of the software tools successfully and wide spread used for storage and processing of big data sets on clusters of commodity hardware is Hadoop. Hadoop framework contains libraries, a distributed fi le-system (HDFS), a resource-management platform and implements a version of the MapReduce programming model for large scale data processing. In this paper we investigate the possibilities of integrating Hadoop with R which is a popular software used for statistical computing and data visualization. We present three ways of integrating them: R with Streaming, Rhipe and RHadoop and we emphasize the advantages and disadvantages of each solution.Comment: Romanian Statistical Review no. 2 / 201

    Non-stationary iterative methods for solving macroeconomic numeric models

    Get PDF
    Macroeconometric modeling was influenced by the development of new and efficient computational techniques. Rational Expectations models, a particular class of macroeconometric models, give raise to very large systems of equations, the solution of which requires heavy computations. Therefore, such models are an interesting testing ground for the numerical methods addressed in this research. The most difficult problem is to obtain the solution of the linear system that arises during the Newton step. As an alternative to the direct methods, we propose non-stationary iterative methods, also called Krylov methods, to solve these models. Numerical experiments conducted by authors confirm the interesting features of these methods: low computational complexity and storage requirements

    USING SIMULTANEOUS EQUATIONS MODELS TO ANALYZE THE CAUSES OF CORRUPTION AND ITS IMPLICATIONS

    Get PDF
    For a country that is in the process of integration into EU structures, reducing the corruption is a good sign for attracting foreign investment and developing the economic environment. The paper estimates the parameters of a simultaneous equation model based on data sets obtained at a sample of employees in public administration. Statistical sample consist in 407 people and the maximum allowable error was estimated at ± 2.5%. For the effective development of the statistical questionnaire we identified major themes of public administration that are directly related to the problem of corruption: managing the institution, the civil service, transparency in the system, the decentralization process, causes and effects of corruption and the quality of the reform in the public administration. Based on the questionnaire we defined primary and secondary variables that have been used to define the model with simultaneous equations. For the variables in the model they have been divided into endogenous and exogenous

    A Comparative Study of Some Features of Higher Education in Romania, Bulgaria and Hungary

    Get PDF
    The transition from the centralized economy to the market economy has determined important changes in higher education in the countries of the former socialist bloc. In a relatively short period an impressive growth in the number of students has been recorded without taking into account the evolution of the number of pupils in high school and demographic phenomena, such as increased migration, reduced birth rate, and so on. In a medium and long time horizon the current increases in the number of students cannot be sustained. For the analysis of the data series we used a number of econometric techniques.regression models, cointegration, higher education statistics, education expenditures, ADF, unit roots

    Decomposition of the Inequality of Income Distribution by Income Types - Application for Romania

    Full text link
    This paper identifies the salient factors that characterize the inequality income distribution for Romania. Data analysis is rigorously carried out using sophisticated techniques borrowed from classical statistics (Theil). Decomposition of the inequalities measured by the Theil index is also performed. This study relies on an exhaustive (11.1 million records for 2014) data-set for total personal gross income of Romanian citizens.Comment: 12 pages, 5 figures, 4 tables, 49 reference
    • 

    corecore