14,074 research outputs found

    Exploration of Feature Selection Techniques in Machine Learning Models on HPTLC Images for Rule Extraction

    Get PDF
    Research related to Biology often utilizes machine learning models that are ultimately uninterpretable by the researcher. It would be helpful if researchers could leverage the same computing power but instead gain specific insight into decision-making to gain a deeper understanding of their domain knowledge. This paper seeks to select features and derive rules from a machine learning classification problem in biochemistry. The specific point of interest is five species of Glycyrrhiza, or Licorice, and the ability to classify them using High-Performance Thin Layer Chromatography (HPTLC) images. These images were taken using HPTLC methods under varying conditions to provide eight unique views of each species. Each view contains 24 samples with varying counts of the individual species. There are a few techniques applied for feature selection and rule extraction. The first two are based on methods recently pioneered and presented as “Binary Encoding of Random Forests” and “Rule Extraction using Sparse Encoding” (Liu 2012). In addition, an independently developed technique called “Interval Extraction and Consolidation” was applied, which was conceptualized due to the particular nature of the dataset. Altogether, these techniques used in consort with standard machine learning models could narrow a feature space from around one-thousand candidates to only ten. These ten most critical features were then used to derive a set of rules for the classification of the five species of licorice. Regarding feature selection, compared to standard model parameter optimization, the Binary Encoding of Random Forests performed similarly, if not much better, in reducing the feature space in almost all cases. Additionally, the application of Interval Extraction and Consolidation excelled in further simplifying the reduced feature space, often by another factor of five to ten. The selected features were then used for relatively simple rule extraction using decision trees, allowing for a more interpretable model

    Gas chromatography on self assembled single walled carbon nanotubes

    Get PDF
    Carbon nanotubes (CNTs) are nano-sized carbon-based sorbents, which have high surface area, large aspect ratio, can be self-assembled and are known to be stable at high temperatures. It is therefore conceivable that separation techniques, such as, gas chromatography (GC) can benefit from their unique properties and nano-scale interactions. Self-assembly, in-contrast to packing these materials in a tube, prevents them from agglomeration and thus facilitates in retaining their nano-characteristics. In this research, novel substrates, such as, steel tubings, on a scaled-up level have been explored for the self-assembly process of CNTs, for applications such as gas chromatography, where the CNTs served as stationary phases. In the first part of this research, the self-assembly of multi-walled carbon nanotubes (MWCNTs) on the inside wall of long stainless steel tubings was studied. The CNTs were deposited by the chemical vapor deposition (CVD) using ethylene as the carbon source and the iron nanostructures in the stainless steel as the catalyst. Variation in uniformity in terms of the thickness and morphology of the deposited film and surface coverage were studied along the length of a tube by scanning electron microscopy (SEM). The effects of process conditions, such as flow rate and deposition time on the coating thickness, were studied. The catalytic effect of the iron nanostructures depended on surface conditioning of the tubing. It was found that the pretreatment temperature influenced the quality of the nanotube coating. The morphology of the CNT deposit supported the base-growth scheme and VLS (vapor—liquid—solid) growth mechanisms of CNTs. This study served as the basis for the development of CNTs in the larger scale application. Scaled up self-assembly of single-walled carbon nanotubes (SWCNTs) was studied in long tubes and finally they were used as GC columns. The strategy for selective SWCNT growth required the prevention of iron in the bulk steel from participating in the catalytic CVD process, as the presence of iron always led to MWCNT formation. Consequently, silica lined stainless steel tubings, such as, SilcosteelTM and SulfinertTM were selected. A SWCNT film with an average thickness of 300 nm was self-assembled by a unique single-step, catalytic CVD process consisting of dissolved cobalt and molybdenum salts in ethanol, where ethanol served as the precursor and cobalt and molybdenum as catalysts. Such large-scale assembly required process and catalyst optimization. A variety of organic compounds with varying polarity were separated at high resolution and the column efficiency demonstrated around 1000 theoretical plates/in, comparable to commercial GC columns. Evaluation of Van’t Hoff and Van deemter plots suggested that the CNTs followed classical chromatography behavior. Comparison of capacity factors (k’) and isosteric heats of adsorption (ΔHS) with a packed column containing a commercial sorbent (Carbopack CTM) showed comparable results. This demonstrated high capacity and strong sorbate-sorbent interactions on the SWCNT phase. Evaluation of the McReynolds constants suggested that the SWCNT was a non-polar phase. The high surface area of the SWCNT media allowed separations of gases, and at the same time, its high thermal stability (\u3e425°C) permitted separations of higher molecular weights at higher temperatures, thus extending the range of conventional chromatography on the same column

    Kromatografsko profiliranje primarnih metabolita kao alat za kemotaksonomsku klasifikaciju uzoraka sjemenki bobičastog voća

    Get PDF
    Research background. Considering the importance of consumption of berry fruits with proven health-beneficial properties and difficulties in quality control of products of specific botanical and geographic origin, a fingerprint method was developed, based on advanced data analysis (pattern recognition, classification), in order to relate the variability of nutrients in the selected cultivars to primary metabolite profile. Experimental approach. Forty-five samples of genuine berry fruit cultivars (strawberry, raspberry, blackberry, black currant, blueberry, gooseberry, chokeberry, cape gooseberry and goji berry) were characterized according to chromatographic profiles of primary metabolites (sugars, lipids and fatty acids) obtained by three chromatographic techniques (high-performance thin-layer chromatography, gas chromatography coupled to mass spectrometry, and high-performance anion-exchange chromatography with pulsed amperometric detection). Results and conclusions. Comprehensive analysis allowed monitoring and identification of metabolites belonging to polar lipids, mono-, di- and triacylglycerols, free fatty acids, free sterols, sterol esters, mono- to heptasaccharides and sugar alcohols. Chemical fingerprint of berry seeds showed the uniformity of primary metabolites within each fruit species, but revealed differences depending on the botanical origin. All three chromatographic methods provided a discriminative, informative and predictive metabolomics methodology, which proved to be useful for chemotaxonomic classification. Novelty and scientific contribution. A novel methodology for the identification of bioactive compounds from primary metabolites of natural products was described. The proposed untargeted metabolite profiling approach could be used in the future as a routine method for tracing of novel bioactive compounds. The knowledge of metabolite composition obtained in this study can provide a better assessment of genotypic and phenotypic differences between berry fruit species and varieties, and could contribute to the development of new breeding programs.Pozadina istraživanja. Imajući u vidu značaj konzumiranja bobičastog voća dokazanog blagotvornog učinka na organizam, ali i teškoće u kontroli kvalitete proizvoda specifičnog botaničkog i geografskog podrijetla, u radu je predložena metodologija zasnovana na kemijskom profiliranju i naprednoj analizi podataka (prepoznavanje obrazaca i klasifikacija), koja bi se mogla koristiti za procjenu autentičnosti određenih vrsta na osnovu njihovog profila primarnih metabolita. Eksperimentalni pristup. Ukupno je okarakterizirano 45 uzoraka različitih sorata bobičastog voća (jagoda, malina, kupina, crni ribiz, borovnica, ogrozd, aronija, peruanska jagoda i goji) na osnovu kemijskih profila primarnih metabolita (šećera, lipida i masnih kiselina) dobivenih pomoću triju kromatografskih tehnika (tankoslojnom kromatografijom velike učinkovitosti, plinskom kromatografijom spregnutom s masenom spektrometrijom i ionskom kromatografijom s pulsnom amperometrijskom detekcijom). Rezultati i zaključci. Sveobuhvatnom kemijskom analizom identificirane su različite klase metabolita: polarni lipidi, mono-, di- i triacilgliceroli, slobodne masne kiseline, slobodni steroli, sterolni esteri, mono- do heptasaharidi i šećerni alkoholi. Rezultati pokazuju da uzorci koji pripadaju istoj biljnoj vrsti imaju sličan kemijski profil, a različite vrste imaju različit sastav primarnih metabolita. Sve tri kromatografske metode pružaju diskriminativnu, informativnu i prediktivnu metabolomičku metodologiju primjenjivu u kemotaksonomskoj klasifikaciji. Novina i znanstveni doprinos. Opisana je nova metodologija identifikacije bioaktivnih spojeva iz primarnih metabolita prirodnih proizvoda. Predloženi pristup neciljanog profiliranja metabolita mogao bi se koristiti kao rutinska metoda pronalaska novih bioaktivnih spojeva. Poznavanje sastava metabolita omogućuje bolju procjenu genotipskih i fenotipskih razlika između sorata bobičastog voća, što može pridonijeti razvoju novih programa oplemenjivanja

    Engineering and validation of a novel lipid thin film for biomembrane modeling in lipophilicity determination of drugs and xenobiotics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Determination of lipophilicity as a tool for predicting pharmacokinetic molecular behavior is limited by the predictive power of available experimental models of the biomembrane. There is current interest, therefore, in models that accurately simulate the biomembrane structure and function. A novel bio-device; a lipid thin film, was engineered as an alternative approach to the previous use of hydrocarbon thin films in biomembrane modeling.</p> <p>Results</p> <p>Retention behavior of four structurally diverse model compounds; 4-amino-3,5-dinitrobenzoic acid (ADBA), naproxen (NPX), nabumetone (NBT) and halofantrine (HF), representing 4 broad classes of varying molecular polarities and aqueous solubility behavior, was investigated on the lipid film, liquid paraffin, and octadecylsilane layers. Computational, thermodynamic and image analysis confirms the peculiar amphiphilic configuration of the lipid film. Effect of solute-type, layer-type and variables interactions on retention behavior was delineated by 2-way analysis of variance (ANOVA) and quantitative structure property relationships (QSPR). Validation of the lipid film was implemented by statistical correlation of a unique chromatographic metric with Log P (octanol/water) and several calculated molecular descriptors of bulk and solubility properties.</p> <p>Conclusion</p> <p>The lipid film signifies a biomimetic artificial biological interface capable of both hydrophobic and specific electrostatic interactions. It captures the hydrophilic-lipophilic balance (HLB) in the determination of lipophilicity of molecules unlike the pure hydrocarbon film of the prior art. The potentials and performance of the bio-device gives the promise of its utility as a predictive analytic tool for early-stage drug discovery science.</p

    Chemometric Classification Of Herb � Orthosiphon Stamineus According To Its Geographical Origin Using Virtual Chemical Sensor Based Upon Fast GC

    Get PDF
    An analytical method using Electronic Nose (E-nose) instrument for analysis of volatile organic compound from Orthosiphon stamineus raw samples have been developed

    Currency security and forensics: a survey

    Get PDF
    By its definition, the word currency refers to an agreed medium for exchange, a nation’s currency is the formal medium enforced by the elected governing entity. Throughout history, issuers have faced one common threat: counterfeiting. Despite technological advancements, overcoming counterfeit production remains a distant future. Scientific determination of authenticity requires a deep understanding of the raw materials and manufacturing processes involved. This survey serves as a synthesis of the current literature to understand the technology and the mechanics involved in currency manufacture and security, whilst identifying gaps in the current literature. Ultimately, a robust currency is desire
    corecore