Search CORE

3,046 research outputs found

Calendar.help: Designing a Workflow-Based Scheduling Agent with Humans in the Loop

Author: Cranshaw Justin
Elwany Emad
Kocielnik Rafal
Monroy-Hernández Andrés
Newman Todd
Soni Sandeep
Teevan Jaime
Yu Bowen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 24/03/2017
Field of study

Although information workers may complain about meetings, they are an essential part of their work life. Consequently, busy people spend a significant amount of time scheduling meetings. We present Calendar.help, a system that provides fast, efficient scheduling through structured workflows. Users interact with the system via email, delegating their scheduling needs to the system as if it were a human personal assistant. Common scheduling scenarios are broken down using well-defined workflows and completed as a series of microtasks that are automated when possible and executed by a human otherwise. Unusual scenarios fall back to a trained human assistant who executes them as unstructured macrotasks. We describe the iterative approach we used to develop Calendar.help, and share the lessons learned from scheduling thousands of meetings during a year of real-world deployments. Our findings provide insight into how complex information tasks can be broken down into repeatable components that can be executed efficiently to improve productivity.Comment: 10 page

arXiv.org e-Print Archive

Crossref

Investigation of Team Formation in Dynamic Social Networks

Author: Selvarajah Kalyani
Publication venue: 'University of Windsor Leddy Library'
Publication date: 21/05/2020
Field of study

Team Formation Problem (TFP) in Social Networks (SN) is to collect the group of individuals who match the requirements of given tasks under some constraints. It has several applications, including academic collaborations, healthcare, and human resource management. These types of problems are highly challenging because each individual has his or her own demands and objectives that might conflict with team objectives. The major contribution of this dissertation is to model a computational framework to discover teams of experts in various applications and predict the potential for collaboration in the future from a given SN. Inspired by an evolutionary search technique using a higher-order cultural evolution, a framework is proposed using Knowledge-Based Cultural Algorithms to identify teams from co-authorship and industrial settings. This model reduces the search domain while guiding the search direction by extracting situational knowledge and updating it in each evolution. Motivated from the above results, this research examines the palliative care multidisciplinary networks to identify and measure the performance of the optimal team of care providers in a highly dynamic and unbalanced SN of volunteer, community, and professional caregivers. Thereafter, a visualization framework is designed to explore and monitor the evolution in the structure of the care networks. It helps to identify isolated patients, imbalanced resource allocation, and uneven service distribution in the network. This contribution is recognized by Hospice and the Windsor Essex Compassion Care Community in partnership with the Faculty of Nursing. In each setting, several cost functions are attempted to measure the performance of the teams. To support this study, the temporal nature of two important evaluation metrics is analyzed in Dynamic Social Networks (DSN): dynamic communication cost and dynamic expertise level. Afterward, a novel generic framework for TFP is designed by incorporating essential cost functions, including the above dynamic cost functions. The Multi-Objective Cultural Algorithms (MOCA) is used for this purpose. In each generation, it keeps track of the best solutions and enhances exploration by driving mutation direction towards unexplored areas. The experimental results reach closest to the exact algorithm and outperform well-known searching methods. Subsequently, this research focuses on predicting suitable members for the teams in the future, which is typically a real-time application of Link Prediction. Learning temporal behavior of each vertex in a given DSN can be used to decide the future connections of the individual with the teams. A probability function is introduced based on the activeness of the individual. To quantify the activeness score, this study examines each vertex as to how actively it interacts with new and existing vertices in DSN. It incorporates two more objective functions: the weighted shortest distance and the weighted common neighbor index. Because it is technically a classification problem, deep learning methods have been observed as the most effective solution. The model is trained and tested with Multilayer Perceptron. The AUC achieves above 93%. Besides this, analyzing common neighbors with any two vertices, which are expected to connect, have a high impact on predicting the links. A new method is introduced that extracts subgraph of common neighbors and examines features of each vertex in the subgraph to predict the future links. The sequence of subgraphs\u27 adjacency matrices of DSN can be ordered temporally and treated as a video. It is tested with Convolutional Neural Networks and Long Short Term Memory Networks for the prediction. The obtained results are compared against heuristic and state-of-the-art methods, where the results reach above 96% of AUC. In conclusion, the knowledge-based evolutionary approach performs well in searching through SN and recommending effective teams of experts to complete given tasks successfully in terms of time and accuracy. However, it does not support the prediction problem. Deep learning methods, however, perform well in predicting the future collaboration of the teams

Scholarship at UWindsor

Democratizing Self-Service Data Preparation through Example Guided Program Synthesis,

Author: Jin Zhongjun
Publication venue
Publication date: 01/01/2020
Field of study

The majority of real-world data we can access today have one thing in common: they are not immediately usable in their original state. Trapped in a swamp of data usability issues like non-standard data formats and heterogeneous data sources, most data analysts and machine learning practitioners have to burden themselves with "data janitor" work, writing ad-hoc Python, PERL or SQL scripts, which is tedious and inefficient. It is estimated that data scientists or analysts typically spend 80% of their time in preparing data, a significant amount of human effort that can be redirected to better goals. In this dissertation, we accomplish this task by harnessing knowledge such as examples and other useful hints from the end user. We develop program synthesis techniques guided by heuristics and machine learning, which effectively make data preparation less painful and more efficient to perform by data users, particularly those with little to no programming experience. Data transformation, also called data wrangling or data munging, is an important task in data preparation, seeking to convert data from one format to a different (often more structured) format. Our system Foofah shows that allowing end users to describe their desired transformation, through providing small input-output transformation examples, can significantly reduce the overall user effort. The underlying program synthesizer can often succeed in finding meaningful data transformation programs within a reasonably short amount of time. Our second system, CLX, demonstrates that sometimes the user does not even need to provide complete input-output examples, but only label ones that are desirable if they exist in the original dataset. The system is still capable of suggesting reasonable and explainable transformation operations to fix the non-standard data format issue in a dataset full of heterogeneous data with varied formats. PRISM, our third system, targets a data preparation task of data integration, i.e., combining multiple relations to formulate a desired schema. PRISM allows the user to describe the target schema using not only high-resolution (precise) constraints of complete example data records in the target schema, but also (imprecise) constraints of varied resolutions, such as incomplete data record examples with missing values, value ranges, or multiple possible values in each element (cell), so as to require less familiarity of the database contents from the end user.PHDComputer Science & EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/163059/1/markjin_1.pd

Deep Blue Documents at the University of Michigan

The design and development of an information technology application for community transport: a case study approach

Author: Simon C. Bennett (7120430)
Publication venue
Publication date: 01/01/1992
Field of study

The aim of the research has been to develop a single modular software package for all modes of Community Transport (CT) operations. In order to achieve this aim, the research has set out to examine CT operations, the scope for computerisation of those operations and existing software for them, to analyse the requirements of CT operators through a collaborative process, to develop a data model which supports their operations, and to implement a software package based on this model which provides both a data management system and operational functions. [Continues.

Loughborough University Institutional Repository

Data-driven conceptual modeling: how some knowledge drivers for the enterprise might be mined from enterprise data

Author: Santiputri Metta
Publication venue: School of Computing and Information Technology
Publication date: 01/01/2017
Field of study

As organizations perform their business, they analyze, design and manage a variety of processes represented in models with different scopes and scale of complexity. Specifying these processes requires a certain level of modeling competence. However, this condition does not seem to be balanced with adequate capability of the person(s) who are responsible for the task of defining and modeling an organization or enterprise operation. On the other hand, an enterprise typically collects various records of all events occur during the operation of their processes. Records, such as the start and end of the tasks in a process instance, state transitions of objects impacted by the process execution, the message exchange during the process execution, etc., are maintained in enterprise repositories as various logs, such as event logs, process logs, effect logs, message logs, etc. Furthermore, the growth rate in the volume of these data generated by enterprise process execution has increased manyfold in just a few years. On top of these, models often considered as the dashboard view of an enterprise. Models represents an abstraction of the underlying reality of an enterprise. Models also served as the knowledge driver through which an enterprise can be managed. Data-driven extraction offers the capability to mine these knowledge drivers from enterprise data and leverage the mined models to establish the set of enterprise data that conforms with the desired behaviour. This thesis aimed to generate models or knowledge drivers from enterprise data to enable some type of dashboard view of enterprise to provide support for analysts. The rationale for this has been started as the requirement to improve an existing process or to create a new process. It was also mentioned models can also serve as a collection of effectors through which an organization or an enterprise can be managed. The enterprise data refer to above has been identified as process logs, effect logs, message logs, and invocation logs. The approach in this thesis is to mine these logs to generate process, requirement, and enterprise architecture models, and how goals get fulfilled based on collected operational data. The above a research question has been formulated as whether it is possible to derive the knowledge drivers from the enterprise data, which represent the running operation of the enterprise, or in other words, is it possible to use the available data in the enterprise repository to generate the knowledge drivers? . In Chapter 2, review of literature that can provide the necessary background knowledge to explore the above research question has been presented. Chapter 3 presents how process semantics can be mined. Chapter 4 suggest a way to extract a requirements model. The Chapter 5 presents a way to discover the underlying enterprise architecture and Chapter 6 presents a way to mine how goals get orchestrated. Overall finding have been discussed in Chapter 7 to derive some conclusions

Research Online

JobHam-place with smart recommend job options and candidate filtering options

Author: Wu Shiyao
Publication venue
Publication date: 31/03/2023
Field of study

Due to the increasing number of graduates, many applicants experience the situation about finding a job, and employers experience difficulty filtering job applicants, which might negatively impact their effectiveness. However, most job-hunting websites lack job recommendation and CV filtering or ranking functionality, which are not integrated into the system. Thus, a smart job hunter combined with the above functionality will be conducted in this project, which contains job recommendations, CV ranking and even a job dashboard for skills and job applicant functionality. Job recommendation and CV ranking starts from the automatic keyword extraction and end with the Job/CV ranking algorithm. Automatic keyword extraction is implemented by Job2Skill and the CV2Skill model based on Bert. Job2Skill consists of two components, text encoder and Gru-based layers, while CV2Skill is mainly based on Bert and fine-tunes the pre-trained model by the Resume- Entity dataset. Besides, to match skills from CV and job description and rank lists of jobs and candidates, job/CV ranking algorithms have been provided to compute the occurrence ratio of skill words based on TFIDF score and match ratio of the total skill numbers. Besides, some advanced features have been integrated into the website to improve user experiences, such as the calendar and sweetalert2 plugin. And some basic features to go through job application processes, such as job application tracking and interview arrangement

arXiv.org e-Print Archive

Generating Literal and Implied Subquestions to Fact-check Complex Claims

Author: Chen Jifan
Choi Eunsol
Durrett Greg
Sriram Aniruddh
Publication venue
Publication date: 13/05/2022
Field of study

Verifying complex political claims is a challenging task, especially when politicians use various tactics to subtly misrepresent the facts. Automatic fact-checking systems fall short here, and their predictions like "half-true" are not very useful in isolation, since we have no idea which parts of the claim are true and which are not. In this work, we focus on decomposing a complex claim into a comprehensive set of yes-no subquestions whose answers influence the veracity of the claim. We present ClaimDecomp, a dataset of decompositions for over 1000 claims. Given a claim and its verification paragraph written by fact-checkers, our trained annotators write subquestions covering both explicit propositions of the original claim and its implicit facets, such as asking about additional political context that changes our view of the claim's veracity. We study whether state-of-the-art models can generate such subquestions, showing that these models generate reasonable questions to ask, but predicting the comprehensive set of subquestions from the original claim without evidence remains challenging. We further show that these subquestions can help identify relevant evidence to fact-check the full claim and derive the veracity through their answers, suggesting that they can be useful pieces of a fact-checking pipeline

arXiv.org e-Print Archive

Determining systematic differences in human graders for machine learning-based automated hiring

Author: Abhishek Unnam
Marios Kokkodis
Mike H. M. Teodorescu
Nailya Ordabayeva
Varun Aggarwal
Publication venue: 'Brookings Institution Press'
Publication date: 06/06/2022
Field of study

Firms routinely utilize natural language processing combined with other machine learning (ML) tools to assess prospective employees through automated resume classification based on pre-codified skill databases. The rush to automation can however backfire by encoding unintentional bias against groups of candidates. We run two experiments with human evaluators from two different countries to determine how cultural differences may affect hiring decisions. We use hiring materials provided by an international skill testing firm which runs hiring assessments for Fortune 500 companies. The company conducts a video-based interview assessment using machine learning, which grades job applicants automatically based on verbal and visual cues. Our study has three objectives: to compare the automatic assessments of the video interviews to assessments of the same interviews by human graders in order to assess how they differ; to examine which characteristics of human graders may lead to systematic differences in their assessments; and to propose a method to correct human evaluations using automation. We find that systematic differences can exist across human graders and that some of these differences can be accounted for by an ML tool if measured at the time of training

IssueLab

Technology assessment of advanced automation for space missions

Author
Publication venue
Publication date: 01/11/1982
Field of study

Six general classes of technology requirements derived during the mission definition phase of the study were identified as having maximum importance and urgency, including autonomous world model based information systems, learning and hypothesis formation, natural language and other man-machine communication, space manufacturing, teleoperators and robot systems, and computer science and technology

NASA Technical Reports Server

Creating business value from big data and business analytics : organizational, managerial and human resource implications

Author: Vidgen Richard T
Publication venue
Publication date: 01/07/2014
Field of study

This paper reports on a research project, funded by the EPSRC’s NEMODE (New Economic Models in the Digital Economy, Network+) programme, explores how organizations create value from their increasingly Big Data and the challenges they face in doing so. Three case studies are reported of large organizations with a formal business analytics group and data volumes that can be considered to be ‘big’. The case organizations are MobCo, a mobile telecoms operator, MediaCo, a television broadcaster, and CityTrans, a provider of transport services to a major city. Analysis of the cases is structured around a framework in which data and value creation are mediated by the organization’s business analytics capability. This capability is then studied through a sociotechnical lens of organization/management, process, people, and technology. From the cases twenty key findings are identified. In the area of data and value creation these are: 1. Ensure data quality, 2. Build trust and permissions platforms, 3. Provide adequate anonymization, 4. Share value with data originators, 5. Create value through data partnerships, 6. Create public as well as private value, 7. Monitor and plan for changes in legislation and regulation. In organization and management: 8. Build a corporate analytics strategy, 9. Plan for organizational and cultural change, 10. Build deep domain knowledge, 11. Structure the analytics team carefully, 12. Partner with academic institutions, 13. Create an ethics approval process, 14. Make analytics projects agile, 15. Explore and exploit in analytics projects. In technology: 16. Use visualization as story-telling, 17. Be agnostic about technology while the landscape is uncertain (i.e., maintain a focus on value). In people and tools: 18. Data scientist personal attributes (curious, problem focused), 19. Data scientist as ‘bricoleur’, 20. Data scientist acquisition and retention through challenging work. With regards to what organizations should do if they want to create value from their data the paper further proposes: a model of the analytics eco-system that places the business analytics function in a broad organizational context; and a process model for analytics implementation together with a six-stage maturity model

Repository@Hull - Worktribe