14 research outputs found
SegRap2023: A Benchmark of Organs-at-Risk and Gross Tumor Volume Segmentation for Radiotherapy Planning of Nasopharyngeal Carcinoma
Radiation therapy is a primary and effective NasoPharyngeal Carcinoma (NPC)
treatment strategy. The precise delineation of Gross Tumor Volumes (GTVs) and
Organs-At-Risk (OARs) is crucial in radiation treatment, directly impacting
patient prognosis. Previously, the delineation of GTVs and OARs was performed
by experienced radiation oncologists. Recently, deep learning has achieved
promising results in many medical image segmentation tasks. However, for NPC
OARs and GTVs segmentation, few public datasets are available for model
development and evaluation. To alleviate this problem, the SegRap2023 challenge
was organized in conjunction with MICCAI2023 and presented a large-scale
benchmark for OAR and GTV segmentation with 400 Computed Tomography (CT) scans
from 200 NPC patients, each with a pair of pre-aligned non-contrast and
contrast-enhanced CT scans. The challenge's goal was to segment 45 OARs and 2
GTVs from the paired CT scans. In this paper, we detail the challenge and
analyze the solutions of all participants. The average Dice similarity
coefficient scores for all submissions ranged from 76.68\% to 86.70\%, and
70.42\% to 73.44\% for OARs and GTVs, respectively. We conclude that the
segmentation of large-size OARs is well-addressed, and more efforts are needed
for GTVs and small-size or thin-structure OARs. The benchmark will remain
publicly available here: https://segrap2023.grand-challenge.orgComment: A challenge report of SegRap2023 (organized in conjunction with
MICCAI2023
Accuracy of Patient Health Questionnaire-9 (PHQ-9) for screening to detect major depression: individual participant data meta-analysis
Objective: To determine the accuracy of the Patient Health Questionnaire-9 (PHQ-9) for screening to detect major depression.
Design: Individual participant data meta-analysis.
Data sources: Medline, Medline In-Process and Other Non-Indexed Citations, PsycINFO, and Web of Science (January 2000-February 2015).
Inclusion criteria: Eligible studies compared PHQ-9 scores with major depression diagnoses from validated diagnostic interviews. Primary study data and study level data extracted from primary reports were synthesized. For PHQ-9 cut-off scores 5-15, bivariate random effects meta-analysis was used to estimate pooled sensitivity and specificity, separately, among studies that used semistructured diagnostic interviews, which are designed for administration by clinicians; fully structured interviews, which are designed for lay administration; and the Mini International Neuropsychiatric (MINI) diagnostic interviews, a brief fully structured interview. Sensitivity and specificity were examined among participant subgroups and, separately, using meta-regression, considering all subgroup variables in a single model.
Results: Data were obtained for 58 of 72 eligible studies (total n=17 357; major depression cases n=2312). Combined sensitivity and specificity was maximized at a cut-off score of 10 or above among studies using a semistructured interview (29 studies, 6725 participants; sensitivity 0.88, 95% confidence interval 0.83 to 0.92; specificity 0.85, 0.82 to 0.88). Across cut-off scores 5-15, sensitivity with semistructured interviews was 5-22% higher than for fully structured interviews (MINI excluded; 14 studies, 7680 participants) and 2-15% higher than for the MINI (15 studies, 2952 participants). Specificity was similar across diagnostic interviews. The PHQ-9 seems to be similarly sensitive but may be less specific for younger patients than for older patients; a cut-off score of 10 or above can be used regardless of age..
Conclusions: PHQ-9 sensitivity compared with semistructured diagnostic interviews was greater than in previous conventional meta-analyses that combined reference standards. A cut-off score of 10 or above maximized combined sensitivity and specificity overall and for subgroups.
Registration: PROSPERO CRD42014010673
Discrimination in Singapore job advertisements.
Singapore is a culturally rich society. At first glance, diverse racial and nationality groups in this city state seem harmonious. On closer examination, we realise that job market discrimination exists. While there have been many comments and debates about discrimination in the job market, few studies have been done to probe deeper into whether there is an existence of discrimination in Singapore labour market. The aim of this paper is to look at employers’ hiring patterns in advertisements which could imply discrimination. The job-advertisement methodology was chosen because employers in Singapore are not strictly governed by anti-discrimination legislation, Instead, they are only required to adhere to the non-mandatory Tripartite Guidelines, whose basic purpose is to ensure non-discriminatory job advertisements. Thus, employers are not encouraged to specify for certain requirements like gender, race or age. Without such legislations, the Tripartite Guidelines allow employers to ask for certain things such as photographs, language proficiency or state preferences for Singaporean nationality. Given such freely usage of requirements in job advertisements, the adoption of the job-advertisement approach in this study will enable us to detect discrimination at the initial stage of the hiring process. Job advertisements from an online job portal were collected over a span of one week as our sample. In the first model, we assessed the association between specification for both Mandarin and photograph in job advertisements. The intuition behind the first model is that when employers ask for Mandarin, it may be due to business needs where employees are required to liaise with Chinese clientele, or it may simply be employers’ desire to recruit Chinese candidates. In the latter case, there should be evidence that employers are more likely to ask for photograph when they ask for Mandarin to filter out non-Chinese candidates. Our results prove that there is relationship between Mandarin and photograph specification in job advertisements even after taking into account confounders. In the second model, we looked at which job industries have less preference for Singaporeans. The rationale behind this model is to determine whether the non-specification for Singaporeans in the job advertisements is due to the shortage of local labour or lack of foreign talents, or simply because employers do not favour Singaporeans.Bachelor of Art
Erratum: Equivalency of the diagnostic accuracy of the PHQ-8 and PHQ-9: A systematic review and individual participant data meta-analysis (Journal of Physical Chemistry (2019) DOI: 10.1017/S0033291719001314)
This article was published in Psychological Medicine with incorrect author information. MD Inagaki should be Masatoshi Inagaki and MD Stafford should be Lesley Stafford. This has since been updated as per the above
Equivalency of the diagnostic accuracy of the PHQ-8 and PHQ-9 : a systematic review and individual participant data meta-analysis
BACKGROUND: Item 9 of the Patient Health Questionnaire-9 (PHQ-9) queries about thoughts of death and self-harm, but not suicidality. Although it is sometimes used to assess suicide risk, most positive responses are not associated with suicidality. The PHQ-8, which omits Item 9, is thus increasingly used in research. We assessed equivalency of total score correlations and the diagnostic accuracy to detect major depression of the PHQ-8 and PHQ-9. METHODS: We conducted an individual patient data meta-analysis. We fit bivariate random-effects models to assess diagnostic accuracy. RESULTS: 16 742 participants (2097 major depression cases) from 54 studies were included. The correlation between PHQ-8 and PHQ-9 scores was 0.996 (95% confidence interval 0.996 to 0.996). The standard cutoff score of 10 for the PHQ-9 maximized sensitivity + specificity for the PHQ-8 among studies that used a semi-structured diagnostic interview reference standard (N = 27). At cutoff 10, the PHQ-8 was less sensitive by 0.02 (-0.06 to 0.00) and more specific by 0.01 (0.00 to 0.01) among those studies (N = 27), with similar results for studies that used other types of interviews (N = 27). For all 54 primary studies combined, across all cutoffs, the PHQ-8 was less sensitive than the PHQ-9 by 0.00 to 0.05 (0.03 at cutoff 10), and specificity was within 0.01 for all cutoffs (0.00 to 0.01). CONCLUSIONS: PHQ-8 and PHQ-9 total scores were similar. Sensitivity may be minimally reduced with the PHQ-8, but specificity is similar
External validation of a shortened screening tool using individual participant data meta-analysis: A case study of the Patient Health Questionnaire-Dep-4
Shortened versions of self-reported questionnaires may be used to reduce respondent burden. When shortened screening tools are used, it is desirable to maintain equivalent diagnostic accuracy to full-length forms. This manuscript presents a case study that illustrates how external data and individual participant data meta-analysis can be used to assess the equivalence in diagnostic accuracy between a shortened and full-length form. This case study compares the Patient Health Questionnaire-9 (PHQ-9) and a 4-item shortened version (PHQ-Dep-4) that was previously developed using optimal test assembly methods. Using a large database of 75 primary studies (34,698 participants, 3,392 major depression cases), we evaluated whether the PHQ-Dep-4 cutoff of ≥ 4 maintained equivalent diagnostic accuracy to a PHQ-9 cutoff of ≥ 10. Using this external validation dataset, a PHQ-Dep-4 cutoff of ≥ 4 maximized the sum of sensitivity and specificity, with a sensitivity of 0.88 (95% CI 0.81, 0.93), 0.68 (95% CI 0.56, 0.78), and 0.80 (95% CI 0.73, 0.85) for the semi-structured, fully structured, and MINI reference standard categories, respectively, and a specificity of 0.79 (95% CI 0.74, 0.83), 0.85 (95% CI 0.78, 0.90), and 0.83 (95% CI 0.80, 0.86) for the semi-structured, fully structured, and MINI reference standard categories, respectively. While equivalence with a PHQ-9 cutoff of ≥ 10 was not established, we found the sensitivity of the PHQ-Dep-4 to be non-inferior to that of the PHQ-9, and the specificity of the PHQ-Dep-4 to be marginally smaller than the PHQ-9