42 research outputs found

    Selectivity estimation on set containment search

    Full text link
    © Springer Nature Switzerland AG 2019. In this paper, we study the problem of selectivity estimation on set containment search. Given a query record Q and a record dataset S, we aim to accurately and efficiently estimate the selectivity of set containment search of query Q over S. The problem has many important applications in commercial fields and scientific studies. To the best of our knowledge, this is the first work to study this important problem. We first extend existing distinct value estimating techniques to solve this problem and develop an inverted list and G-KMV sketch based approach IL-GKMV. We analyse that the performance of IL-GKMV degrades with the increase of vocabulary size. Motivated by limitations of existing techniques and the inherent challenges of the problem, we resort to developing effective and efficient sampling approaches and propose an ordered trie structure based sampling approach named OT-Sampling. OT-Sampling partitions records based on element frequency and occurrence patterns and is significantly more accurate compared with simple random sampling method and IL-GKMV. To further enhance performance, a divide-and-conquer based sampling approach, DC-Sampling, is presented with an inclusion/exclusion prefix to explore the pruning opportunities. We theoretically analyse the proposed techniques regarding various accuracy estimators. Our comprehensive experiments on 6 real datasets verify the effectiveness and efficiency of our proposed techniques

    QuickSel: Quick Selectivity Learning with Mixture Models

    Full text link
    Estimating the selectivity of a query is a key step in almost any cost-based query optimizer. Most of today's databases rely on histograms or samples that are periodically refreshed by re-scanning the data as the underlying data changes. Since frequent scans are costly, these statistics are often stale and lead to poor selectivity estimates. As an alternative to scans, query-driven histograms have been proposed, which refine the histograms based on the actual selectivities of the observed queries. Unfortunately, these approaches are either too costly to use in practice---i.e., require an exponential number of buckets---or quickly lose their advantage as they observe more queries. In this paper, we propose a selectivity learning framework, called QuickSel, which falls into the query-driven paradigm but does not use histograms. Instead, it builds an internal model of the underlying data, which can be refined significantly faster (e.g., only 1.9 milliseconds for 300 queries). This fast refinement allows QuickSel to continuously learn from each query and yield increasingly more accurate selectivity estimates over time. Unlike query-driven histograms, QuickSel relies on a mixture model and a new optimization algorithm for training its model. Our extensive experiments on two real-world datasets confirm that, given the same target accuracy, QuickSel is 34.0x-179.4x faster than state-of-the-art query-driven histograms, including ISOMER and STHoles. Further, given the same space budget, QuickSel is 26.8%-91.8% more accurate than periodically-updated histograms and samples, respectively

    On the Variability of the Length Weight Relationship for Atlantic Bluefin Tuna, Thunnus thynnus (L.)

    Full text link
    Following extensive review, a model of the Atlantic bluefin tuna (ABFT), Thunnus thynnus (L.), length–weight relationship for the eastern Atlantic and Mediterranean (RW = 0.0000188 SFL3.01247; Ec 1) is presented on the basis of samples of ABFT spawners, with an average value of index K = 2.03 ± 0.15SD, collected by the Atlantic traps of Portugal and Spain in the Strait of Gibraltar (1963; 1996–1998; 2000–2012), and a set of samples of juvenile fishes from ICCAT–GBYP (n = 707). The resulting model (Ec 1), together with the model used for the eastern stock assessment (RW = 0.000019607 SFL3.0092; Ec 2) and a recently adopted by ICCAT Standing Committee on Research and Statistics (SCRS) (RW = 0.0000315551 SFL2.898454; EAST) are analyzed in using a bi-variant sample [SFL (cm), RW (kg)] of 474 pairs of data with the aim of validating them and establishing which model(s) best fit the reality represented by the sample and, therefore, will have the greatest descriptive and predictive power. The result of the analysis indicates that the model EAST clearly underestimates the weight of spawning ABFT and that model Ec 2 overestimates it slightly, being model Ec 1 that best explains the data of the sample. The result of the classical statistical analysis is confirmed by means of the quantile regression technique, selecting the quantiles 5, 25, 50, 75, and 95%. Other fisheries and biological indicators also conclude that the model EAST gradually underestimates the weight of ABFT spawners (of 2–3 m) by 9–12.5 %, and does not meet the criterion that for RW = 725 kg (Wmax), SFL = 319.93 ± 11.3 cm (Lmax).Cort, JL.; Estruch Fuster, VD.; Neves Dos Santos, M.; Di Natale, A.; Abid, N.; De La Serna, JM. (2015). On the Variability of the Length Weight Relationship for Atlantic Bluefin Tuna, Thunnus thynnus (L.). Reviews in Fisheries Science & Aquaculture. 23(1):23-38. doi:10.1080/23308249.2015.1008625S2338231Aguado-Giménez, F., & García-García, B. (2005). Changes in some morphometric relationships in Atlantic bluefin tuna (Thunnus thynnus thynnus Linnaeus, 1758) as a result of fattening process. Aquaculture, 249(1-4), 303-309. doi:10.1016/j.aquaculture.2005.04.064Block, B. A., Teo, S. L. H., Walli, A., Boustany, A., Stokesbury, M. J. W., Farwell, C. J., … Williams, T. D. (2005). Electronic tagging and population structure of Atlantic bluefin tuna. Nature, 434(7037), 1121-1127. doi:10.1038/nature03463Chapman, E. W., Jørgensen, C., & Lutcavage, M. E. (2011). Atlantic bluefin tuna (Thunnus thynnus): a state-dependent energy allocation model for growth, maturation, and reproductive investment. Canadian Journal of Fisheries and Aquatic Sciences, 68(11), 1934-1951. doi:10.1139/f2011-109Cort, J. L., Arregui, I., Estruch, V. D., & Deguara, S. (2014). Validation of the Growth Equation Applicable to the Eastern Atlantic Bluefin Tuna,Thunnus thynnus(L.), UsingLmax, Tag-Recapture, and First Dorsal Spine Analysis. Reviews in Fisheries Science & Aquaculture, 22(3), 239-255. doi:10.1080/23308249.2014.931173Cort, J. L., Deguara, S., Galaz, T., Mèlich, B., Artetxe, I., Arregi, I., … Idrissi, M. (2013). Determination ofLmaxfor Atlantic Bluefin Tuna,Thunnus thynnus(L.), from Meta-Analysis of Published and Available Biometric Data. Reviews in Fisheries Science, 21(2), 181-212. doi:10.1080/10641262.2013.793284Fraser, K.Possessed. World Record Holder for Bluefin Tuna. Kingstown, Nova Scotia: T & S Office Essentials and printing, 243 pp. (2008).Fromentin, J.-M., & Powers, J. E. (2005). Atlantic bluefin tuna: population dynamics, ecology, fisheries and management. Fish and Fisheries, 6(4), 281-306. doi:10.1111/j.1467-2979.2005.00197.xHattour, A.Contribution a l’étude des Scombridés de Tunisie. Université de Tunis. Faculté des Sciences, 168 pp. (1979).Karakulak, S., Oray, I., Corriero, A., Deflorio, M., Santamaria, N., Desantis, S., & De Metrio, G. (2004). Evidence of a spawning area for the bluefin tuna (Thunnus thynnus L.) in the eastern Mediterranean. Journal of Applied Ichthyology, 20(4), 318-320. doi:10.1111/j.1439-0426.2004.00561.xKoenker, R., & Bassett, G. (1978). Regression Quantiles. Econometrica, 46(1), 33. doi:10.2307/1913643Koenker, R. (2005). Quantile Regression. doi:10.1017/cbo9780511754098Milatou, N., & Megalofonou, P. (2014). Age structure and growth of bluefin tuna (Thunnus thynnus, L.) in the capture-based aquaculture in the Mediterranean Sea. Aquaculture, 424-425, 35-44. doi:10.1016/j.aquaculture.2013.12.037Perçin, F., & Akyol, O. (2009). Lengthâ weight and lengthâ length relationships of the bluefin tuna,Thunnus thynnusL., in the Turkish part of the eastern Mediterranean Sea. Journal of Applied Ichthyology, 25(6), 782-784. doi:10.1111/j.1439-0426.2009.01288.xPercin, F., & Akyol, O. (2010). Some Morphometric Relationships in Fattened Bluefin Tuna, Thunnus thynnus L., from the Turkish Aegean Sea. Journal of Animal and Veterinary Advances, 9(11), 1684-1688. doi:10.3923/javaa.2010.1684.1688Rooker, J. R., Alvarado Bremer, J. R., Block, B. A., Dewar, H., de Metrio, G., Corriero, A., … Secor, D. H. (2007). Life History and Stock Structure of Atlantic Bluefin Tuna (Thunnus thynnus). Reviews in Fisheries Science, 15(4), 265-310. doi:10.1080/10641260701484135Sinovcic, G., Franicevic, M., Zorica, B., & Cikes-Kec, V. (2004). Length-weight and length-length relationships for 10 pelagic fish species from the Adriatic Sea (Croatia). Journal of Applied Ichthyology, 20(2), 156-158. doi:10.1046/j.1439-0426.2003.00519.xTičina, V., Grubišić, L., Šegvić Bubić, T., & Katavić, I. (2011). Biometric characteristics of small Atlantic bluefin tuna (Thunnus thynnus, Linnaeus, 1758) of Mediterranean Sea origin. Journal of Applied Ichthyology, 27(4), 971-976. doi:10.1111/j.1439-0426.2011.01752.

    A practical guide to photoacoustic tomography in the life sciences

    Get PDF
    The life sciences can benefit greatly from imaging technologies that connect microscopic discoveries with macroscopic observations. One technology uniquely positioned to provide such benefits is photoacoustic tomography (PAT), a sensitive modality for imaging optical absorption contrast over a range of spatial scales at high speed. In PAT, endogenous contrast reveals a tissue's anatomical, functional, metabolic, and histologic properties, and exogenous contrast provides molecular and cellular specificity. The spatial scale of PAT covers organelles, cells, tissues, organs, and small animals. Consequently, PAT is complementary to other imaging modalities in contrast mechanism, penetration, spatial resolution, and temporal resolution. We review the fundamentals of PAT and provide practical guidelines for matching PAT systems with research needs. We also summarize the most promising biomedical applications of PAT, discuss related challenges, and envision PAT's potential to lead to further breakthroughs

    Intraperitoneal drain placement and outcomes after elective colorectal surgery: international matched, prospective, cohort study

    Get PDF
    Despite current guidelines, intraperitoneal drain placement after elective colorectal surgery remains widespread. Drains were not associated with earlier detection of intraperitoneal collections, but were associated with prolonged hospital stay and increased risk of surgical-site infections.Background Many surgeons routinely place intraperitoneal drains after elective colorectal surgery. However, enhanced recovery after surgery guidelines recommend against their routine use owing to a lack of clear clinical benefit. This study aimed to describe international variation in intraperitoneal drain placement and the safety of this practice. Methods COMPASS (COMPlicAted intra-abdominal collectionS after colorectal Surgery) was a prospective, international, cohort study which enrolled consecutive adults undergoing elective colorectal surgery (February to March 2020). The primary outcome was the rate of intraperitoneal drain placement. Secondary outcomes included: rate and time to diagnosis of postoperative intraperitoneal collections; rate of surgical site infections (SSIs); time to discharge; and 30-day major postoperative complications (Clavien-Dindo grade at least III). After propensity score matching, multivariable logistic regression and Cox proportional hazards regression were used to estimate the independent association of the secondary outcomes with drain placement. Results Overall, 1805 patients from 22 countries were included (798 women, 44.2 per cent; median age 67.0 years). The drain insertion rate was 51.9 per cent (937 patients). After matching, drains were not associated with reduced rates (odds ratio (OR) 1.33, 95 per cent c.i. 0.79 to 2.23; P = 0.287) or earlier detection (hazard ratio (HR) 0.87, 0.33 to 2.31; P = 0.780) of collections. Although not associated with worse major postoperative complications (OR 1.09, 0.68 to 1.75; P = 0.709), drains were associated with delayed hospital discharge (HR 0.58, 0.52 to 0.66; P < 0.001) and an increased risk of SSIs (OR 2.47, 1.50 to 4.05; P < 0.001). Conclusion Intraperitoneal drain placement after elective colorectal surgery is not associated with earlier detection of postoperative collections, but prolongs hospital stay and increases SSI risk

    Resilient monotone submodular function maximization

    No full text
    In this paper, we focus on applications in machine learning, optimization, and control that call for the resilient selection of a few elements, e.g. features, sensors, or leaders, against a number of adversarial denial-of-service attacks or failures. In general, such resilient optimization problems are hard, and cannot be solved exactly in polynomial time, even though they often involve objective functions that are monotone and submodular. Notwithstanding, in this paper we provide the first scalable algorithm for their approximate solution, that is valid for any number of attacks or failures, and which, for functions with low curvature, guarantees superior approximation performance. Notably, the curvature has been known to tighten approximations for several non-resilient maximization problems, yet its effect on resilient maximization had hitherto been unknown. We complement our theoretical analyses with supporting empirical evaluations

    Immune cell imaging using multi-spectral optoacoustic tomography.

    No full text
    Multispectral optoacoustic tomography (MSOT) offers the potential to image in high-resolution cells tagged with optical labels. In contrast to single wavelength imaging, multispectral excitation and spectral unmixing can differentiate labeled moieties over tissue absorption in the absence of background measurements. This feature can enable longitudinal cellular biology studies well beyond the depths reached by optical microscopy. However, the relation between spectrally resolved fluorescently labeled cells and optoacoustic detection has not been systematically investigated. Herein, we measured titrations of fluorescently labeled cells and establish the optoacoustic signal generated by these cells as a function of cell number and across different cell types. We then assess the MSOT sensitivity to resolve cells implanted in animals

    Olympic agents

    No full text
    We present an agent-oriented middle-tier architecture deployed during the realisation of the Athens 2004 Olympics results internet broadcasting. The system involved the online processing of messages (XML in nature) and their publishing to the www.athens2004.com internet site. Those messages were containing the Games intermediate and final results and were originated from the Olympic venues. For the accomplishment of this task a number of systems and applications needed to be integrated. Also the domain posed some unique problems regarding the fact that for the first time in the history of the Games a real time approach for broadcasting results was deployed and furthermore due to the reliability and performance requirements of the system. Various enterprise application integration patterns were used in conjunction with an agent oriented design approach. Asynchronous intercommunicating agents were deployed for realizing the architectural components of the system. © Springer-Verlag Berlin Heidelberg 2007

    Olympic agents

    No full text
    corecore