16 research outputs found

    Learning to Program with Natural Language

    Full text link
    Large Language Models (LLMs) have shown remarkable performance in various basic natural language tasks, which raises hope for achieving Artificial General Intelligence. For completing the complex task, we still need a program for the task first and then ask LLMs to follow the program to generate the specific solution. We propose using natural language as a new programming language to describe task procedures, making them easily understandable to both humans and LLMs. ~The LLM is capable of directly generating natural language programs, but these programs may still contain factual errors or incomplete steps. Therefore, we further propose the Learning to Program (\text{LP}) method to ask LLMs themselves to learn the natural language program based on the training dataset of the complex task first and then use the learned program to guide the inference. Our experiments on the reasoning tasks of five different reasoning types (8 datasets) demonstrate the effectiveness of our approach. Further, our analysis experiment shows that the learned program can be directly used to guide another LLM to improve its performance, which reveals a new transfer learning paradigm.Comment: Work in progres

    Random Silicon Sampling: Simulating Human Sub-Population Opinion Using a Large Language Model Based on Group-Level Demographic Information

    Full text link
    Large language models exhibit societal biases associated with demographic information, including race, gender, and others. Endowing such language models with personalities based on demographic data can enable generating opinions that align with those of humans. Building on this idea, we propose "random silicon sampling," a method to emulate the opinions of the human population sub-group. Our study analyzed 1) a language model that generates the survey responses that correspond with a human group based solely on its demographic distribution and 2) the applicability of our methodology across various demographic subgroups and thematic questions. Through random silicon sampling and using only group-level demographic information, we discovered that language models can generate response distributions that are remarkably similar to the actual U.S. public opinion polls. Moreover, we found that the replicability of language models varies depending on the demographic group and topic of the question, and this can be attributed to inherent societal biases in the models. Our findings demonstrate the feasibility of mirroring a group's opinion using only demographic distribution and elucidate the effect of social biases in language models on such simulations.Comment: 25 pages, 4 figures, 19 Table

    The 5th International Conference on Biomedical Engineering and Biotechnology (ICBEB 2016)

    Get PDF

    MOOCs as a Research Agenda: Changes Over Time

    No full text
    MOOCs (massive open online courses) have attracted considerable attention from researchers. Fueled by constant change and developments in educational technology, the trends of MOOCs have varied greatly over the years. To detect and visualize the developments and changes in MOOC research, 4,652 articles published between 2009 and 2021 were retrieved from Web of Science and Scopus with the aid of CiteSpace. This study sought to explore the number of publications, co-citation network, cluster analysis, timeline analysis, burstness analysis, and dual-map overlays based on co-citation relationships. The first finding was that the number of publications on MOOCs had increased consistently, and grew especially quickly between 2013 and 2015. Second, the main topic of the top 10 co-cited studies revolved around the problem of learner continuance. Third, blended programs, task-technology fit, and comparative analysis have emerged as popular subjects. Fourth, the development of MOOC research has followed distinct phases, with 2009 to 2012 the starting phase, 2013 to 2015 the high growth phase, 2016 to 2018 the plateau phase, and 2019 to 2021 another peak phase. Lastly, both cluster analysis and dual-map overlays provided empirical evidence of cross-disciplinary research. Our findings provided an in-depth and dynamic understanding of the development and evolution of MOOC research and also proposed novel ideas for future studies

    Exploring Implicit Feedback for Open Domain Conversation Generation

    No full text
    User feedback can be an effective indicator to the success of the human-robot conversation. However, to avoid to interrupt the online real-time conversation process, explicit feedback is usually gained at the end of a conversation. Alternatively, users' responses usually contain their implicit feedback, such as stance, sentiment, emotion, etc., towards the conversation content or the interlocutors. Therefore, exploring the implicit feedback is a natural way to optimize the conversation generation process. In this paper, we propose a novel reward function which explores the implicit feedback to optimize the future reward of a reinforcement learning based neural conversation model. A simulation strategy is applied to explore the state-action space in training and test. Experimental results show that the proposed approach outperforms the Seq2Seq model and the state-of-the-art reinforcement learning model for conversation generation on automatic and human evaluations on the OpenSubtitles and Twitter datasets

    Structural Topic Model Analysis of Mask-Wearing Issue Using International News Big Data

    No full text
    Media plays an important role in the acquisition of health information worldwide. This was particularly evident in the face of the COVID-19 epidemic. Relatedly, it is practical and desirable for people to wear masks for health, fashion, and religious regions. However, depending on cultural differences, people naturally accept wearing a mask, or they look upon it negatively. In 2020, the COVID-19 pandemic led to widespread mask-wearing mandates worldwide. In the case of COVID-19, wearing a mask is strongly recommended, so by analyzing the news data before and after the spread of the epidemic, it is possible to see how the direction of crisis management is being structured. In particular, by utilizing big data analysis of international news data, discourses around the world can be analyzed more deeply. This study collected and analyzed 58,061 international news items related to mask-wearing from 1 January 2019 to 31 December 2020. The collected dataset was compared before and after the World Health Organization’s pandemic declaration by applying structural topic model analysis. The results revealed that prior to the declaration, issues related to the COVID-19 outbreak were emphasized, but afterward, issues related to movement restrictions, quarantine management, and local economic impacts emerged

    PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

    Full text link
    The growing dependence on Large Language Models (LLMs) for finishing user instructions necessitates a comprehensive understanding of their robustness to complex task completion in real-world situations. To address this critical need, we propose the PowerPoint Task Completion Robustness benchmark (PPTC-R) to measure LLMs' robustness to the user PPT task instruction and software version. Specifically, we construct adversarial user instructions by attacking user instructions at sentence, semantic, and multi-language levels. To assess the robustness of Language Models to software versions, we vary the number of provided APIs to simulate both the newest version and earlier version settings. Subsequently, we test 3 closed-source and 4 open-source LLMs using a benchmark that incorporates these robustness settings, aiming to evaluate how deviations impact LLMs' API calls for task completion. We find that GPT-4 exhibits the highest performance and strong robustness in our benchmark, particularly in the version update and the multilingual settings. However, we find that all LLMs lose their robustness when confronted with multiple challenges (e.g., multi-turn) simultaneously, leading to significant performance drops. We further analyze the robustness behavior and error reasons of LLMs in our benchmark, which provide valuable insights for researchers to understand the LLM's robustness in task completion and develop more robust LLMs and agents. We release the code and data at \url{https://github.com/ZekaiGalaxy/PPTCR}.Comment: LLM evaluation, Multi-turn, Multi-language, Multi-modal benchmar

    What Motivates Users to Keep Using Social Mobile Payments?

    No full text
    Due to the rapid diffusion of social mobile payment (SMP), the current research explores the post-adoption behavior of SMP users. It proposes a research model to determine the core predictors of users’ continuance intentions to use SMPs. Through the analysis of survey data from South Korea, it indicates that satisfaction strongly and positively affects users’ continuance intentions. Moreover, satisfaction is influenced by perceived usefulness (PU), security, and enjoyment. Interestingly, although perceived ease of use (PEU) does not directly affect satisfaction, it can indirectly influence satisfaction via users’ PU and perceived enjoyment. Furthermore, perceived ubiquity has strong effects on users’ PU and PEU. The study also discusses meaningful implications, and provides suggestions for future studies

    Teammate Familiarity in Distributed Computer-Supported Collaborative Learning: The Mediating Role of Social Presence

    No full text
    Owing to the limitations of computer-mediated communication (CMC), distributed CSCL (Computer-supported collaborative learning) has not always been as effective as desired. Despite recognizing the significance of group composition, the exploration of the function of teammate familiarity in distributed educational settings is restricted. This study explored the influence of teammate familiarity and social presence in a distributed CSCL setting by conducting an online survey of 288 Korean university students with experience in distributed CSCL. The results indicate that teammate familiarity increased the social presence experienced by students among their peers. Social presence subsequently enhanced teamwork satisfaction and, ultimately, increased self-assessed knowledge gain. More importantly, the relationship between teammate familiarity and teamwork satisfaction was mediated by social presence. Social media platforms and class webpages were the most widely used channels for students to get to know their teammates. Our study provided insights for improving the effectiveness of distributed CSCL and a framework for investigating social presence in satisfaction building in various contexts, including online education

    Bibliometric study on environmental, social, and governance research using CiteSpace

    No full text
    This paper offers an overview of the status of and emerging trends in environmental, social, and governance (ESG) research through a bibliometric approach using CiteSpace. In particular, our study aimed to elucidate the overall intellectual structure of the environmental, social, and governance academic field. To this end, we performed a topic search related to the environmental, social, and governance field and gathered published articles (2007–2021) from the Web of Science. Subsequently, we identified productive authors, institutes, and countries/regions to determine main research forces in the environmental, social, and governance field. Additionally, we conducted a co-citation analysis to identify highly cited authors, journals, and literatures in the environmental, social, and governance field. Furthermore, we performed a literature-co-citation-based cluster analysis and literature citation burst analysis to confirm the main themes and hotspots of the environmental, social, and governance field. These analyses can contribute to the investigations of key contributing forces in the environmental, social, and governance field at the author, institution, country/region, and journal levels and provide insights into the knowledge structures and orientations of the environmental, social, and governance field for future research.Published versionThis research was supported by a National Research Foundation of Korea (NRF) grant funded by the Korean Government (NRF2020R1A2C1014957)
    corecore