46 research outputs found

    GC-28 Modern Web Scraping

    Get PDF
    This project was developed for the IT7993 Capstone class in the May semester of 2021.The goal of the project is to scrape all names of key professionals of organizations in the open990.org website and insert that information into a structured database for query and analysis. The Key Professionals dataset aims to include global coverage of key investor and consultant professionals, beginning with US-based companies, involved in making an investment decision.   The overarching aim of this project is to create a one-stop center for institutional asset management distribution intelligence; the one spot to go for mandates, documentation and profiles of consultants, investors, and managers with key technical contact information by including coverage within the eVestment network for US investors and consultants.  From end to end, the key professional database project consists of creating a web crawler to retrieve information from the open990 website, wrangling the data into the desired structure, and inserting it into a database for comprehensive data analysis. The primary data source is the open990.org website. The team was given a list of names of organizations as targets to scrape information. Each organization has a page within the open990 website with the organization information, including names of the key professionals, which is the target data. Scraping data from the open990 website consisted of several challenges. First, the website is coded completely using JavaScript which requires specific techniques to render and scrape. Second, the different organization sites have different data structures, which causes problems for parsing. Third, most of the data is in tables that are delivered through a backend API. Fourth, due to delivery of the tables from a backend API, the HTML tags used for the data are not unique, so that identifying and parsing specific data using HTML tags was not possible. Lastly, by observing the network traffic using the Chrome browser tools, and examining the HAR data returned from Splash, we discovered the website is delivered through Cloudflare servers, which we believe blocked some of our attempts to scrape the data. Cloudflare is a network for content delivery featuring robust security services. The complexity of the webpage is an example of how modern, secure web development will change the landscape and require webscrapers to develop more advanced methods of automation.Advisors(s): Dr Meng HanTopic(s): Data/Data AnalyticsIT 799

    Risk Factors for Cardiovascular Diseases in Aircrew

    Get PDF
    The relation of atherosclerotic cardiovascular disease (ASCVD) to not only traditional but also new and emergent risk factors has been assessed in aircrew. Total flight hours (TFH), high altitude and weightlessness exposure have been accounted among traditional risk factors for CVD among the aircrew. The risk factors do not perform in loneliness. To predict the 10 years global CV risk, several scores are being applied either based on traditional CVD risk factors only or also including new and emergent risk factors. To prevent aircrew from developing CVD, one should focus on the control of behavioral and metabolic risks as well as the polymorphe treatment of high CV risk individuals

    Investigation of phase transformations and corrosion resistance in Co/CoCo2O4 nanowires and their potential use as a basis for lithium-ion batteries

    Get PDF
    The paper is devoted to the study of the effect of thermal annealing on the change in the structural properties and phase composition of metal Co nanostructures, as well as the prospects of their use as anode materials for lithium-ion batteries. During the study, a four-stage phase transition in the structure of nanowires consisting of successive transformations of the structure (Со-FCC/Co-HCP) → (Со-FCС) → (Со-FCC/СоСо2О4) → (СоСо2О4), accompanied by uniform oxidation of the structure of nanowires with an increase in temperature above 400 °C. In this case, an increase in temperature to 700 °C leads to a partial destruction of the oxide layer and surface degradation of nanostructures. During life tests, it was found that the lifetime for oxide nanostructures exceeds 500 charge/discharge cycles, for the initial nanostructures and annealed at a temperature of 300 °С, the lifetimes are 297 and 411 cycles, respectively. The prospects of using Co/CoCo2O4 nanowires as the basis for lithium-ion batteries is shown. © 2019, The Author(s)

    Use of ring-expanded diamino- and diamidocarbene ligands in copper catalyzed azide-alkyne "click" reactions

    Get PDF
    The two-coordinate ring-expanded N-heterocyclic carbene copper­(I) complexes [Cu­(RE-NHC)<sub>2</sub>]<sup>+</sup> (RE-NHC = 6-Mes, 7-<i>o</i>-Tol, 7-Mes) have been prepared and shown to be effective catalysts under neat conditions for the 1,3-dipolar cycloaddition of alkynes and azides. In contrast, the cationic diamidocarbene analogue [Cu­(6-MesDAC)<sub>2</sub>]<sup>+</sup> and the neutral species [(6-MesDAC)­CuCl]<sub>2</sub> and [(6-MesDAC)<sub>2</sub>(CuCl)<sub>3</sub>] show good activity when the catalysis is performed on water
    corecore