135 research outputs found

    Statistical models in prognostic modelling with many skewed variables and missing data: a case study in breast cancer

    Get PDF
    Prognostic models have clinical appeal to aid therapeutic decision making. In the UK, the Nottingham Prognostic Index (NPI) has been used, for over two decades, to inform patient management. However, it has been commented that NPI is not capable of identifying a subgroup of patients with a prognosis so good that adjuvant therapy with potential harmful side effects can be withheld safely. Tissue Microarray Analysis (TMA) now makes possible measurement of biological tissue microarray features of frozen biopsies from breast cancer tumours. These give an insight to the biology of tumour and hence could have the potential to enhance prognostic modelling. I therefore wished to investigate whether biomarkers can add value to clinical predictors to provide improved prognostic stratification in terms of Recurrence Free Survival (RFS). However, there are very many biomarkers that could be measured, they usually exhibit skewed distribution and missing values are common. The statistical issues raised are thus number of variables being tested, form of the association, imputation of missing data, and assessment of the stability and internal validity of the model. Therefore the specific aim of this study was to develop and to demonstrate performance of statistical modelling techniques that will be useful in circumstances where there is a surfeit of explanatory variables and missing data; in particular to achieve useful and parsimonious models while guarding against instability and overfitting. I also sought to identify a subgroup of patients with a prognosis so good that a decision can be made to avoid adjuvant therapy. I aimed to provide statistically robust answers to a set of clinical question and develop strategies to be used in such data sets that would be useful and acceptable to clinicians. A unique data set of 401 Estrogen Receptor positive (ER+) tamoxifen treated breast cancer patients with measurement for a large panel of biomarkers (72 in total) was available. Taking a statistical approach, I applied a multi-faceted screening process to select a limited set of potentially informative variables and to detect the appropriate form of the association, followed by multiple imputations of missing data and bootstrapping. In comparison with the NPI, the final joint model derived assigned patients into more appropriate risk groups (14% of recurred and 4% of non-recurred cases). The actuarial 7-year RFS rate for patients in the lowest risk quartile was 95% (95% C.I.: 89%, 100%). To evaluate an alternative approach, biological knowledge was incorporated into the process of model development. Model building began with the use of biological expertise to divide the variables into substantive biomarker sets on the basis of presumed role in the pathway to cancer progression. For each biomarker family, an informative and parsimonious index was generated by combining family variables, to be offered to the final model as intermediate predictor. In comparison with NPI, patients into more appropriate risk groups (21% of recurred and 11% of non-recurred patients). This model identified a low-risk group with 7-year RFS rate at 98% (95% C.I.: 96%, 100%)

    Clinical Environment Assessment Based on DREEM Model from the Viewpoint of Interns and Residents of Hospitals Affiliated with Kerman University of Medical Sciences, Iran

    Get PDF
    Background & Objective: Clinical environments have a crucial role on medical students' training. Thus, the aim of this study was to assess clinical environments based on the (Dundee Ready Education Environment Measure) DREEM model from the viewpoint of interns and residents in hospitals affiliated with Kerman University of Medical Sciences, Iran, in 2012. Methods: This was a descriptive-analytic study. The data collection tool was the DREEM Questionnaire with 50 questions (5-point Likert scale) in the 5 domains of learning, teachers, educational environment, student's academic self-perceptions, and student's social self-perceptions. The study environment consisted of 4 main wards (internal, surgical, pediatrics, and gynecology) of hospitals affiliated with Kerman University of Medical Sciences. The study subjects consisted of 63 interns and 73 residents. Data was analyzed in SPSS software using Students' t-test and ANOVA. Results: Mean score of perception of educational environment in interns was 161.17 ± 22.30 and in residents was 157.45 ± 21.14. The comparison of different areas of clinical environment evaluation only showed a significant difference between the two groups in the area of student's social self-perceptions (P < 0.05). The interns' score was higher than that of the residents. No significant differences were observed between hospitals and the studied wards. Conclusion: The students' perceptions of their educational environment in clinical wards were desirable. Despite different literature's recommendation of using DREEM in order to evaluate weaknesses and strengths of clinical environments, the concurrent use of other methods and instruments for the assessment of the efficacy of this questionnaire is recommended. Key Words: DREEM model, Assessment, Residents, Interns, Ira

    A Guide to Selecting the Appropriate Statistical Tests for Proposals and Articles in Medical Sciences

    Get PDF
    Background & Objective: The main purpose of medical researches is to answer a research question or to solve a problem to promote the health of a society. The first objective is to answer the research question correctly with minimal errors. The second objective is the publication of the results in order to generalize them to a population and use in a wider dimension. To achieve these objectives, using biostatistics is necessary. Despite the importance of biostatistics in medical research, researchers have limited understanding of it or due to its complications they refrain from its use. Statistics help the researcher in different levels of research including writing a proposal and interpretation of other papers. Moreover, biostatisticians and epidemiologists also play a very important role in the preparation of manuscripts for publication. The present article has eloquently described the most important statistical tests in medical research with applied examples. Keywords Selecting statistical tests Parametric tests Non-parametric test

    Estimation of the Active Network Size of Kermanian Males

    Get PDF
    Background: Estimation of the size of hidden and hard-to-reach sub-populations, such as drug-abusers, is a very important but difficult task. Network scale up (NSU) is one of the indirect size estimation techniques, which relies on the frequency of people belonging to a sub-population of interest among the social network of a random sample of the general population. In this study, we estimated the social network size of Kermanian males (C) as one of the main prerequisites for using NSU. Methods: A 500 random sample of Kermanian males between 18 and 45 years old were interviewed. We asked the size of their active networks using direct questions. In addition, we received the frequency of six names from the vital registry office among Kermanian males, and we estimated C indirectly using the received frequencies and the frequency of these names among the networks of our sample. Findings: Although different methods showed quite different Cs between 100 and 350, the best estimation for C was 303, which means that on average each Kermanian male knows around 303 males between the age range of 18 and 45 years. The estimated C did not have any strong association with the demographic variables of our subjects. Conclusion: Using the estimated C we may use the NSU technique to assess the frequency of many important hidden sub-populations such as drug-abusers and those who have sexual contact with men and women. Keywords: Size estimation, Social network, Networking, Addiction, Hidden population, Hard to reach population

    Application of Random Forest Survival Models to Increase Generalizability of Decision Trees: A Case Study in Acute Myocardial Infarction

    Get PDF
    Background. Tree models provide easily interpretable prognostic tool, but instable results. Two approaches to enhance the generalizability of the results are pruning and random survival forest (RSF). The aim of this study is to assess the generalizability of saturated tree (ST), pruned tree (PT), and RSF. Methods. Data of 607 patients was randomly divided into training and test set applying 10-fold cross-validation. Using training sets, all three models were applied. Using Log-Rank test, ST was constructed by searching for optimal cutoffs. PT was selected plotting error rate versus minimum sample size in terminal nodes. In construction of RSF, 1000 bootstrap samples were drawn from the training set. C-index and integrated Brier score (IBS) statistic were used to compare models. Results. ST provides the most overoptimized statistics. Mean difference between C-index in training and test set was 0.237. Corresponding figure in PT and RSF was 0.054 and 0.007. In terms of IBS, the difference was 0.136 in ST, 0.021 in PT, and 0.0003 in RSF. Conclusion. Pruning of tree and assessment of its performance of a test set partially improve the generalizability of decision trees. RSF provides results that are highly generalizable

    Pattern of Alcohol Consumption among Men Consumers in Kerman, Iran, in 2014

    Get PDF
    Background: Alcohol consumption is a potential risk factor with acute and chronic health consequences and social impacts, which is more prominent among men. There is no precise statistics on the scope of alcohol consumption in Iran; however, there is some evidences showing an increasing trend, particularly among young generation. In order to evaluate the scope of this issue in Kerman, a large city in the south-east of Iran, this exploratory study was designed to approach a group of people having an experience of alcohol use.Methods: Samples were recruited to the study using a snowball sampling. 200 eligible subjects were questioned about the type of alcohol consumed, frequency of use, and other factors associated with alcohol consumption. In order to maximize the validity of responses, data were collected through self-administered questionnaires.Findings: The main alcoholic drinks consumed by individuals were the homemade distillates (46%), wine (22%), beer (14%), distilled spirits (11%), and medical alcohol (7%), respectively. The majority of individuals participating in the study (73%) used mostly homemade drinks; moreover, 63%, 26%, 9%, and 2% of subjects took monthly or less, two to four times a month, two to three times a week, and at least four times a week, respectively. Only 2% of the subjects were heavy consumers of alcoholic beverages.Conclusion: Due to the lack of control over homemade alcoholic beverages, its high levels can be a huge potential risk. Furthermore, it seems that both factors of access and price to be very effective in the amount of alcoholics taken by individuals. Therefore, further studies in this area will help to reduce the harm caused by alcohol consumption

    Estimating the Visibility Rate of Alcohol Consumption: A Case Study in Shiraz, Iran

    Get PDF
    Background: Network Scale Up (NSU) is applied in many settings to estimate the size of hidden populations.The visibility of alcohol consumption - as a hidden behavior - in Iran has not been yet set. Our aim is to estimatethe visibility factor (VF) of alcohol consumption in Iran which is an Islamic country in the Middle East.Methods: Ninety persons who had a history of alcohol consumption were recruited. Relationships in networkwere aligned in three main subgroups: immediate family, extended family, and non-family. According to thegame of contact methodology, participants answered questions about total and aware number of personsthey know in each relationship category. VF was calculated by dividing total number of people aware aboutthe respondent’s alcohol consumption by total number of respondent’s social network. The 95% confidenceintervals (CIs) were computed through bootstrapping.Findings: The mean and standard deviation (SD) of participants’ age was 32.9 ± 10.2, the sex ratio was 3.Overall VF (95% CI) was 40% (33% to 47%). VF was estimated at 44% and 23% among men and women’snetwork, respectively. The immediate family was the highest informed group, followed by non-family andextended family members.Conclusion: The visibility of alcohol consumption in Iran was not high. This is due to religious and legalprohibitions around i

    Evaluating Occupational Exposure of Workers for Metallurgy with Alkanol Amines

    Get PDF
    Liquids being used in metallurgy are a composition of dangerous chemicals including Alkanol amines. Alkanol amines include Mono-, Di- and 3- ethanol amine. Alkanol amines are used as lubricant in metallurgy. Dermal absorption of these chemical substances is so important and some studies are being done about carcinogenesis of these chemical substances. Meanwhile, ethanol amine has been recognized as a factor causing occupational asthma. The present study was done on 29 turnery and rolling workers in Cupper Industrial Complex of Sarcheshmeh in descriptive- sectional manner. Data related to concentration of Alkanol amines in the atmosphere were gathered with the method proposed by NIOSH and data for pulmonary function were extracted from spirometry experiments. Demographic data were obtained from medical files of the workers. Statistical tests were carried out using software SPSS. In this study, workers' Time Weighted Average (TWA) individual exposure to Mono-ethanol amine (MEA) with density scope 0.03- 1.16, exposure to Di-ethanol amine (DEA) with density scope 0.36-1.35 and exposure to TEA with density scope 0.49-1.28 equal 0.54, 0.87 and 0.85 mg/m3 respectively without occupational group separation for each. Also, FVC reduction in studied individuals without occupational group separation was 3.17% (SD= 6.55%). The results indicated that workers' Time Weighted Average exposure to Mono-Di-Tri- ethanol amine was lower than occupational legal limit. In rolling process, exposure to Alkanol amines is lower compared to other processes of metallurgy because of semi- enclosure of this process. Having done Pearson correlation test to determine relation between individuals' work experience and FVC reduction, it was observed that there is no meaningful relation between these two variables

    Evaluating Occupational Exposure of Workers for Metallurgy with Alkanol Amines

    Get PDF
    Liquids being used in metallurgy are a composition of dangerous chemicals including Alkanol amines. Alkanol amines include Mono-, Di- and 3- ethanol amine. Alkanol amines are used as lubricant in metallurgy. Dermal absorption of these chemical substances is so important and some studies are being done about carcinogenesis of these chemical substances. Meanwhile, ethanol amine has been recognized as a factor causing occupational asthma. The present study was done on 29 turnery and rolling workers in Cupper Industrial Complex of Sarcheshmeh in descriptive- sectional manner. Data related to concentration of Alkanol amines in the atmosphere were gathered with the method proposed by NIOSH and data for pulmonary function were extracted from spirometry experiments. Demographic data were obtained from medical files of the workers. Statistical tests were carried out using software SPSS. In this study, workers' Time Weighted Average (TWA) individual exposure to Mono-ethanol amine (MEA) with density scope 0.03- 1.16, exposure to Di-ethanol amine (DEA) with density scope 0.36-1.35 and exposure to TEA with density scope 0.49-1.28 equal 0.54, 0.87 and 0.85 mg/m3 respectively without occupational group separation for each. Also, FVC reduction in studied individuals without occupational group separation was 3.17% (SD= 6.55%). The results indicated that workers' Time Weighted Average exposure to Mono-Di-Tri- ethanol amine was lower than occupational legal limit. In rolling process, exposure to Alkanol amines is lower compared to other processes of metallurgy because of semi- enclosure of this process. Having done Pearson correlation test to determine relation between individuals' work experience and FVC reduction, it was observed that there is no meaningful relation between these two variables
    • …
    corecore