15,282 research outputs found
Reinforcement Learning from Passive Data via Latent Intentions
Passive observational data, such as human videos, is abundant and rich in
information, yet remains largely untapped by current RL methods. Perhaps
surprisingly, we show that passive data, despite not having reward or action
labels, can still be used to learn features that accelerate downstream RL. Our
approach learns from passive data by modeling intentions: measuring how the
likelihood of future outcomes change when the agent acts to achieve a
particular task. We propose a temporal difference learning objective to learn
about intentions, resulting in an algorithm similar to conventional RL, but
which learns entirely from passive data. When optimizing this objective, our
agent simultaneously learns representations of states, of policies, and of
possible outcomes in an environment, all from raw observational data. Both
theoretically and empirically, this scheme learns features amenable for value
prediction for downstream tasks, and our experiments demonstrate the ability to
learn from many forms of passive data, including cross-embodiment video data
and YouTube videos.Comment: Accompanying website at https://dibyaghosh.com/icvf
Reinforcement Learning Based Minimum State-flipped Control for the Reachability of Boolean Control Networks
To realize reachability as well as reduce control costs of Boolean Control
Networks (BCNs) with state-flipped control, a reinforcement learning based
method is proposed to obtain flip kernels and the optimal policy with minimal
flipping actions to realize reachability. The method proposed is model-free and
of low computational complexity. In particular, Q-learning (QL), fast QL, and
small memory QL are proposed to find flip kernels. Fast QL and small memory QL
are two novel algorithms. Specifically, fast QL, namely, QL combined with
transfer-learning and special initial states, is of higher efficiency, and
small memory QL is applicable to large-scale systems. Meanwhile, we present a
novel reward setting, under which the optimal policy with minimal flipping
actions to realize reachability is the one of the highest returns. Then, to
obtain the optimal policy, we propose QL, and fast small memory QL for
large-scale systems. Specifically, on the basis of the small memory QL
mentioned before, the fast small memory QL uses a changeable reward setting to
speed up the learning efficiency while ensuring the optimality of the policy.
For parameter settings, we give some system properties for reference. Finally,
two examples, which are a small-scale system and a large-scale one, are
considered to verify the proposed method
A Design Science Research Approach to Smart and Collaborative Urban Supply Networks
Urban supply networks are facing increasing demands and challenges and thus constitute a relevant field for research and practical development. Supply chain management holds enormous potential and relevance for society and everyday life as the flow of goods and information are important economic functions. Being a heterogeneous field, the literature base of supply chain management research is difficult to manage and navigate. Disruptive digital technologies and the implementation of cross-network information analysis and sharing drive the need for new organisational and technological approaches. Practical issues are manifold and include mega trends such as digital transformation, urbanisation, and environmental awareness.
A promising approach to solving these problems is the realisation of smart and collaborative supply networks. The growth of artificial intelligence applications in recent years has led to a wide range of applications in a variety of domains. However, the potential of artificial intelligence utilisation in supply chain management has not yet been fully exploited. Similarly, value creation increasingly takes place in networked value creation cycles that have become continuously more collaborative, complex, and dynamic as interactions in business processes involving information technologies have become more intense.
Following a design science research approach this cumulative thesis comprises the development and discussion of four artefacts for the analysis and advancement of smart and collaborative urban supply networks. This thesis aims to highlight the potential of artificial intelligence-based supply networks, to advance data-driven inter-organisational collaboration, and to improve last mile supply network sustainability. Based on thorough machine learning and systematic literature reviews, reference and system dynamics modelling, simulation, and qualitative empirical research, the artefacts provide a valuable contribution to research and practice
Corporate Social Responsibility: the institutionalization of ESG
Understanding the impact of Corporate Social Responsibility (CSR) on firm performance as it relates to industries reliant on technological innovation is a complex and perpetually evolving challenge. To thoroughly investigate this topic, this dissertation will adopt an economics-based structure to address three primary hypotheses. This structure allows for each hypothesis to essentially be a standalone empirical paper, unified by an overall analysis of the nature of impact that ESG has on firm performance. The first hypothesis explores the evolution of CSR to the modern quantified iteration of ESG has led to the institutionalization and standardization of the CSR concept. The second hypothesis fills gaps in existing literature testing the relationship between firm performance and ESG by finding that the relationship is significantly positive in long-term, strategic metrics (ROA and ROIC) and that there is no correlation in short-term metrics (ROE and ROS). Finally, the third hypothesis states that if a firm has a long-term strategic ESG plan, as proxied by the publication of CSR reports, then it is more resilience to damage from controversies. This is supported by the finding that pro-ESG firms consistently fared better than their counterparts in both financial and ESG performance, even in the event of a controversy. However, firms with consistent reporting are also held to a higher standard than their nonreporting peers, suggesting a higher risk and higher reward dynamic. These findings support the theory of good management, in that long-term strategic planning is both immediately economically beneficial and serves as a means of risk management and social impact mitigation. Overall, this contributes to the literature by fillings gaps in the nature of impact that ESG has on firm performance, particularly from a management perspective
People make Places
For centuries Glasgow, as a bucolic fishing village and ecclesiastical centre on the banks of the River Clyde, held little of strategic significance. When success and later threats came to the city, it was as a consequence of explosive growth during the industrial era that left a significant civic presence accompanied by social and environmental challenges. Wartime damage to the fabric of the city and the subsequent implementation of modernist planning left Glasgow with a series of existential threats to the lives and the health of its people that have taken time to understand and come to terms with.
In a few remarkable decades of late 20th century regeneration, Glasgow began to be put back together. The trauma of the second half of the 20th century is fading but not yet a distant memory. Existential threats from the climate emergency can provoke the reaction “what, again?” However, the resilience built over the last 50 years has instilled a belief that a constructive, pro-active and creative approach to face this challenge along with the recognition that such action can be transformational for safeguarding and improving people’s lives and the quality of their places. A process described as a just transition that has become central to Glasgow’s approach.
Of Scotland’s four big cities, three are surrounded by landscape and sea only Glasgow is surrounded by itself. Even with a small territory, Glasgow is still the largest of Scotland’s big cities and by some margin. When the wider metropolitan area is considered, Glasgow is – like Birmingham, Manchester and Liverpool – no mean city.
People make Places begins with a review of the concept and complexities of place, discusses why these matter and reviews the growing body of evidence that place quality can deliver economic, social and environmental value. The following chapters focus on the history and evolution of modern Glasgow in four eras of 19th and early 20th century industrialisation, de- industrialisation and modernism in mid 20th century, late 20th century regeneration and a 21st century recovery towards transition and renaissance, and document the process, synthesis and the results of a major engagement programme and to explore systematic approaches to place and consensus building around the principal issues.
The second half of the work reflects on a stocktaking of place in contemporary Glasgow, looking at the city through the lenses of an international, metropolitan and everyday city, concluding with a review of the places of Glasgow and what may be learned from them revealing some valuable insights presented in a series of Place Stories included.
The concluding chapter sets out the findings of the investigation and analysis reviewing place goals, challenges and opportunities for Glasgow over the decades to 2030 and 2040 and ends with some recommendations about what Glasgow might do better to combine place thinking and climate awareness and setting out practical steps to mobilise Glasgow’s ‘place ecosystem’
Peer teachers Taking the Lead in Classroom Instruction: Program Creation and Challenges Faced
Purpose – This paper discusses a program to train undergraduate students as near peer teachers delivering course-embedded information literacy instruction to undergraduate students.
Design/methodology/approach – The approach involved the development and delivery of a curriculum combining information literacy concepts and teaching pedagogy. Significant student feedback was gathered which determined the final program structure.
Findings – While the curriculum was successful in developing students’ information literacy competencies and pedagogical skills, stakeholder buy-in and the COVID-19 pandemic hindered the program. Additionally, the goal of the program - solo student teaching, was not realized.
Originality – Peer teaching is widely implemented in many disciplines, however, its application in academic libraries has focused more on peer reference, rather than peer teaching. This case study adds to the body of literature on this topic related to student peer teaching in academic libraries
Exploring Potential Domains of Agroecological Transformation in the United States
There is now substantial evidence that agroecology constitutes a necessary pathway towards socially just and ecologically resilient agrifood systems. In the United States, however, agroecology remains relegated to the margins of research and policy spaces. This dissertation explores three potential domains of agroecological transformation in the US. Domains of transformation are sites of contestation in which agroecology interfaces with the industrial agrifood system; these material and conceptual spaces may point to important pathways for scaling agroecology. To explore this concept, I examine formal agroecology education (Chapter 1), extension services and statewide discourses around soil health (Chapter 2), and models of farmland access not based on private property (Chapter 3). While these constitute three distinct topics, I seek to demonstrate that they are linked by similar forces that enable and constrain the extent to which these domains can be sites of agroecological transformation.
First, I use case study methodology to explore the evolution of an advanced undergraduate agroecology course at the University of Vermont. I examine how course content and pedagogy align with a transformative framing of agroecology as inherently transdisciplinary, participatory, action-oriented, and political. I find that student-centered pedagogies and experiential education on farms successfully promote transformative learning whereby students shift their understanding of agrifood systems and their role(s) within them. In my second chapter, I zoom out to consider soil health discourses amongst farmers and extension professionals in Vermont. Using co-created mental models and participatory analysis, I find that a singular notion of soil health based on biological, chemical, and physical properties fails to capture the diverse ways in which farmers and extension professionals understand soil health. I advocate for a principles-based approach to soil health that includes social factors and may provide a valuable heuristic for mobilizing knowledge towards agroecology transition pathways. My third chapter, conducted in collaboration with the national non-profit organization Agrarian Trust, considers equitable farmland access. Through semi-structured interviews with 13 farmers and growers across the US, I explore both farmer motivations for engaging with alternative land access models (ALAMs) and the potential role(s) these models may play within broader transformation processes. I argue that ALAMs constitute material and conceptual ‘third spaces’ within which the private property regime is challenged and new identities and language around land ownership can emerge; as such, ALAMs may facilitate a (re)imagining of land-based social-ecological relationships.
I conclude the dissertation by identifying conceptual and practical linkages across the domains explored in Chapters 1-3. I pay particular attention to processes that challenge neoliberal logics, enact plural ways of knowing, and prefigure just futures. In considering these concepts, I apply an expansive notion of pedagogy to explore how processes of teaching and (un)learning can contribute to cultivating foundational capacities for transition processes
Interactive System-wise Anomaly Detection
Anomaly detection, where data instances are discovered containing feature
patterns different from the majority, plays a fundamental role in various
applications. However, it is challenging for existing methods to handle the
scenarios where the instances are systems whose characteristics are not readily
observed as data. Appropriate interactions are needed to interact with the
systems and identify those with abnormal responses. Detecting system-wise
anomalies is a challenging task due to several reasons including: how to
formally define the system-wise anomaly detection problem; how to find the
effective activation signal for interacting with systems to progressively
collect the data and learn the detector; how to guarantee stable training in
such a non-stationary scenario with real-time interactions? To address the
challenges, we propose InterSAD (Interactive System-wise Anomaly Detection).
Specifically, first, we adopt Markov decision process to model the interactive
systems, and define anomalous systems as anomalous transition and anomalous
reward systems. Then, we develop an end-to-end approach which includes an
encoder-decoder module that learns system embeddings, and a policy network to
generate effective activation for separating embeddings of normal and anomaly
systems. Finally, we design a training method to stabilize the learning
process, which includes a replay buffer to store historical interaction data
and allow them to be re-sampled. Experiments on two benchmark environments,
including identifying the anomalous robotic systems and detecting user data
poisoning in recommendation models, demonstrate the superiority of InterSAD
compared with state-of-the-art baselines methods
- …