82 research outputs found

    Design and Evaluation of the Corpus of Everyday Japanese Conversation

    Get PDF
    application/pdfNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsGraduate School of Humanities, Chiba UniversityNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsNational Institute for Japanese Language and LinguisticsWe have constructed the Corpus of Everyday Japanese Conversation (CEJC) and published it in March 2022. The CEJC is designed to contain various kinds of everyday conversations in a balanced manner to capture their diversity. The CEJC features not only audio but also video data to facilitate precise understanding of the mechanism of real-life social behavior. The publication of a large-scale corpus of everyday conversations that includes video data is a new approach. The CEJC contains 200 hours of speech, 577 conversations, about 2.4 million words, and a total of 1675 conversants. In this paper, we present an overview of the corpus, including the recording method and devices, structure of the corpus, formats of video and audio files, transcription, and annotations. We then report some results of the evaluation of the CEJC in terms of conversant and conversation attributes. We show that the CEJC includes a good balance of adult conversants in terms of gender and age, as well as a variety of conversations in terms of conversation forms, places, activities, and numbers of conversants.conference pape

    DDBJ launches a new archive database with analytical tools for next-generation sequence data

    Get PDF
    The DNA Data Bank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp) has collected and released 1 701 110 entries/1 116 138 614 bases between July 2008 and June 2009. A few highlighted data releases from DDBJ were the complete genome sequence of an endosymbiont within protist cells in the termite gut and Cap Analysis Gene Expression tags for human and mouse deposited from the Functional Annotation of the Mammalian cDNA consortium. In this period, we started a novel user announcement service using Really Simple Syndication (RSS) to deliver a list of data released from DDBJ on a daily basis. Comprehensive visualization of a DDBJ release data was attempted by using a word cloud program. Moreover, a new archive for sequencing data from next-generation sequencers, the ‘DDBJ Read Archive’ (DRA), was launched. Concurrently, for read data registered in DRA, a semi-automatic annotation tool called the ‘DDBJ Read Annotation Pipeline’ was released as a preliminary step. The pipeline consists of two parts: basic analysis for reference genome mapping and de novo assembly and high-level analysis of structural and functional annotations. These new services will aid users’ research and provide easier access to DDBJ databases

    How many times can patients tolerate reoperation?

    Get PDF
    The frequency of resection for the recurrence of colorectal cancer has not been investigated in previous studies. Likewise, the related postoperative complications and the limit for indicating surgical resection has not been reported. Herein, we reported the complications of a highly frequent surgical approach for rectal cancer recurrence, i.e., exceeding three reoperations, based on our clinical experience. We included 15 cases exceeding two operations for the local recurrence of colorectal cancer from 2014 to 2019. We examined the postoperative complications classified as Clavien–Dindo IIIb. The positive rates of the complications were 0 (0.0%), 0 (0.0%), 2 (13.3%), 3 (37.5%), and 0 (0.0%) for the primary, 1st recurrent, 2nd recurrent, 3rd recurrent, and 4th recurrent operation group (p = 0.027), respectively. It is important to exercise caution in handling cases exceeding two reoperations (exceeding three reoperations including the primary operation)

    Microbe-Specific C3b Deposition in the Horseshoe Crab Complement System in a C2/Factor B-Dependent or -Independent Manner

    Get PDF
    Complement C3 plays an essential role in the opsonization of pathogens in the mammalian complement system, whereas the molecular mechanism underlying C3 activation in invertebrates remains unknown. To understand the molecular mechanism of C3b deposition on microbes, we characterized two types of C2/factor B homologs (designated TtC2/Bf-1 and TtC2/Bf-2) identified from the horseshoe crab Tachypleus tridentatus. Although the domain architectures of TtC2/Bf-1 and TtC2/Bf-2 were identical to those of mammalian homologs, they contained five-repeated and seven-repeated complement control protein domains at their N-terminal regions, respectively. TtC2/Bf-1 and TtC2/Bf-2 were synthesized and glycosylated in hemocytes and secreted to hemolymph plasma, which existed in a complex with C3 (TtC3), and their activation by microbes was absolutely Mg2+-dependent. Flow cytometric analysis revealed that TtC3b deposition was Mg2+-dependent on Gram-positive bacteria or fungi, but not on Gram-negative bacteria. Moreover, this analysis demonstrated that Ca2+-dependent lectins (C-reactive protein-1 and tachylectin-5A) were required for TtC3b deposition on Gram-positive bacteria, and that a Ca2+-independent lectin (Tachypleus plasma lectin-1) was definitely indispensable for TtC3b deposition on fungi. In contrast, a horseshoe crab lipopolysaccharide-sensitive protease factor C was necessary and sufficient to deposit TtC3b on Gram-negative bacteria. We conclude that plasma lectins and factor C play key roles in microbe-specific TtC3b deposition in a C2/factor B-dependent or -independent manner

    『日本語日常会話コーパス』における転記の基準と作成手法

    Get PDF
    国立国語研究所 研究系 音声言語研究領域 非常勤研究員千葉大学大学院 博士課程/国立国語研究所 研究系 音声言語研究領域 非常勤研究員国立国語研究所 コーパス開発センター 非常勤研究員国立国語研究所 コーパス開発センター国立国語研究所 研究系 音声言語研究領域Adjunct Researcher, Spoken Language Division, Research Department, NINJALDoctoral Student, Chiba University / Adjunct Researcher, Spoken Language Division, Research Department, NINJALAdjunct Researcher, Center for Corpus Development, NINJALCenter for Corpus Development, NINJALSpoken Language Division, Research Department, NINJAL本稿は,平成28年度から構築を進めている『日本語日常会話コーパス』における転記の基準と作成手法について述べる。本コーパスには,日常場面で自然に生じるさまざまなタイプの会話200時間がバランス良く収録される予定である。日常会話には,極めてくだけた表現や,聞き取りづらい,あるいは把握しづらい表現が頻出する。こうした会話データを多人数により均質に書き起こすには,転記のための基準を明確に定める必要がある。また,200時間という大量の会話を限られた期間で書き起こすために,効率的に作業をするための工夫が必要になる。本プロジェクトでは,実際の会話データを対象に転記を行いながら,効率的に作業をするための工程を検討し,ツールの開発や転記基準の改訂を行ってきた。本稿では,このようにして策定した転記基準と,作業を効率的に進めるために整備した方法について紹介する。This paper describes the criteria and composition method of transcription for the Corpus of Everyday Japanese Conversation, which has been in construction since 2016 and will contain 200 hours of various types of conversations in a balanced distribution. As some expressions are extremely informal, hard to hear, or hard to understand, it is necessary to establish clear criteria for transcription to ensure homogeneous transcription quality from a large number of staff. Methods are also required to transcribe no less than 200 hours of conversations efficiently and in a timely manner. As part of this project, procedures for efficient transcription have been considered, and the development of tools and the revision of criteria of transcription have been conducted. This paper presents said transcription criteria and methods

    Analysis of gut microbiome, host genetics, and plasma metabolites reveals gut microbiome-host interactions in the Japanese population

    Get PDF
    Interaction between the gut microbiome and host plays a key role in human health. Here, we perform a metagenome shotgun-sequencing-based analysis of Japanese participants to reveal associations between the gut microbiome, host genetics, and plasma metabolome. A genome-wide association study (GWAS) for microbial species (n = 524) identifies associations between the PDE1C gene locus and Bacteroides intestinalis and between TGIF2 and TGIF2-RAB5IF gene loci and Bacteroides acidifiaciens. In a microbial gene ortholog GWAS, agaE and agaS, which are related to the metabolism of carbohydrates forming the blood group A antigen, are associated with blood group A in a manner depending on the secretor status determined by the East Asian-specific FUT2 variant. A microbiome-metabolome association analysis (n = 261) identifies associations between bile acids and microbial features such as bile acid metabolism gene orthologs including bai and 7β-hydroxysteroid dehydrogenase. Our publicly available data will be a useful resource for understanding gut microbiome-host interactions in an underrepresented population.Tomofuji Yoshihiko, Kishikawa Toshihiro, Sonehara Kyuto, et al. Analysis of gut microbiome, host genetics, and plasma metabolites reveals gut microbiome-host interactions in the Japanese population. Cell Reports 42, 113324 (2023); https://doi.org/10.1016/j.celrep.2023.113324

    Involvement of the Precuneus/Posterior Cingulate Cortex Is Significant for the Development of Alzheimer’s Disease: A PET (THK5351, PiB) and Resting fMRI Study

    Get PDF
    Background: Imaging studies in Alzheimer’s disease (AD) have yet to answer the underlying questions concerning the relationship among tau retention, neuroinflammation, network disruption and cognitive decline. We compared the spatial retention patterns of 18F-THK5351 and resting state network (RSN) disruption in patients with early AD and healthy controls.Methods: We enrolled 23 11C-Pittsburgh compound B (PiB)-positive patients with early AD and 24 11C-PiB-negative participants as healthy controls. All participants underwent resting state functional MRI and 18F-THK5351 PET scans. We used scaled subprofile modeling/principal component analysis (SSM/PCA) to reduce the complexity of multivariate data and to identify patterns that exhibited the largest statistical effects (variances) in THK5351 concentration in AD and healthy controls.Findings: SSM/PCA identified a significant spatial THK5351 pattern composed by mainly three clusters including precuneus/posterior cingulate cortex (PCC), right and left dorsolateral prefrontal cortex (DLPFC) which accounted for 23.6% of the total subject voxel variance of the data and had 82.6% sensitivity and 79.1% specificity in discriminating AD from healthy controls. There was a significant relationship between the intensity of the 18F-THK5351 covariation pattern and cognitive scores in AD. The spatial patterns of 18F-THK5351 uptake showed significant similarity with intrinsic functional connectivity, especially in the PCC network. Seed-based connectivity analysis from the PCC showed significant decrease in connectivity over widespread brain regions in AD patients. An evaluation of an autopsied AD patient with Braak V showed that 18F-THK5351 retention corresponded to tau deposition, monoamine oxidase-B (MAO-B) and astrogliosis in the precuneus/PCC.Interpretation: We identified an AD-specific spatial pattern of 18F-THK5351 retention in the precuneus/PCC, an important connectivity hub region in the brain. Disruption of the functional connections of this important network hub may play an important role in developing dementia in AD

    DOCK2 is involved in the host genetics and biology of severe COVID-19

    Get PDF
    「コロナ制圧タスクフォース」COVID-19疾患感受性遺伝子DOCK2の重症化機序を解明 --アジア最大のバイオレポジトリーでCOVID-19の治療標的を発見--. 京都大学プレスリリース. 2022-08-10.Identifying the host genetic factors underlying severe COVID-19 is an emerging challenge. Here we conducted a genome-wide association study (GWAS) involving 2, 393 cases of COVID-19 in a cohort of Japanese individuals collected during the initial waves of the pandemic, with 3, 289 unaffected controls. We identified a variant on chromosome 5 at 5q35 (rs60200309-A), close to the dedicator of cytokinesis 2 gene (DOCK2), which was associated with severe COVID-19 in patients less than 65 years of age. This risk allele was prevalent in East Asian individuals but rare in Europeans, highlighting the value of genome-wide association studies in non-European populations. RNA-sequencing analysis of 473 bulk peripheral blood samples identified decreased expression of DOCK2 associated with the risk allele in these younger patients. DOCK2 expression was suppressed in patients with severe cases of COVID-19. Single-cell RNA-sequencing analysis (n = 61 individuals) identified cell-type-specific downregulation of DOCK2 and a COVID-19-specific decreasing effect of the risk allele on DOCK2 expression in non-classical monocytes. Immunohistochemistry of lung specimens from patients with severe COVID-19 pneumonia showed suppressed DOCK2 expression. Moreover, inhibition of DOCK2 function with CPYPP increased the severity of pneumonia in a Syrian hamster model of SARS-CoV-2 infection, characterized by weight loss, lung oedema, enhanced viral loads, impaired macrophage recruitment and dysregulated type I interferon responses. We conclude that DOCK2 has an important role in the host immune response to SARS-CoV-2 infection and the development of severe COVID-19, and could be further explored as a potential biomarker and/or therapeutic target
    corecore