4,682 research outputs found

    Image based crop row detection using global optimization methods

    Get PDF
    U ovoj doktorskoj disertaciji naglasak je stavljen na razvoj metoda za prepoznavanje redova usjeva na slici. Cilj je razviti metodu koja posjeduje sljedeca svojstva: robusnost s obzirom na prisutnost korova i sjene, mogucnost prepoznavanja redova usjeva za razlicite kulture u razlicitim stadijima rasta, mogucnost prepoznavanja razlicitog broja proizvoljno razmaknutih redova usjeva te mogucnost prepoznavanja ravnih i zakrivljenih redova usjeva. Razvijene su dvije metode koje su nazvane CRDI metoda i TMGEM metoda. CRDI metoda zasnovana je na inkrementalnoj metodi za trazenje priblizno optimalne particije skupa podataka te primjeni DIRECT algoritma za globalnu optimizaciju. Metoda je pogodna za prepoznavanje ravnih redova usjeva, pri cemu broj redova usjeva mora biti unaprijed poznat. TMGEM metoda zasnovana je racunanju podudaranja s predloskom i minimizaciji funkcije globalne energije primjenom dinamickog programiranja. Funkcija globalne energije, uz podatke sa slike, koristi i prethodno znanje o geometrijskoj strukturi redova usjeva, cime se postize veca tocnost prepoznavanja redova usjeva. Predlozena metoda ima mogucnost prepoznavanja proizvoljnog broja redova usjeva za razlicite kulture u razlicitim stadijima rasta te pri razlicitim geometrijama polja. Nadalje, predlozena TMGEM metoda robusna je na zakrivljenost redova usjeva. Osim navedenih metoda, u radu je predlozen novi evaluacijski okvir za usporedbu metoda prepoznavanja redova usjeva koji ukljucuje bazu slika redova usjeva, postupak rucnog generiranja referentnih vrijednosti redova usjeva te kriterije za odredivanje tocnosti prepoznavanja redova usjeva. Na slikama u bazi snimljeni su redovi usjeva kukuruza, celera, krumpira, luka, suncokreta i soje. Koristenjem predlozenog evaluacijskog okvira moguce je ekasnije i objektivnije usporediti novo razvijene metode za prepoznavanje redova usjeva s postojecim metodama. U eksperimentalnoj evaluaciji CRDI metoda je usporedena s IMLD i HT metodom na umjetno generiranim skupovima podataka. Tocnost prepoznavanja odredena je primjenom CRHID kriterija, a rezultati pokazuju da CRDI metoda prepoznaje redove usjeva sa znatno vecom tocnoscu u odnosu na ostale razmatrane metode. TMGEM metoda testirana je na 225 stvarnih slika redova usjeva iz baze slika te je usporedena s HT, HTT i LR metodom. Tocnost prepoznavanja redova usjeva odredena je koristenjem CRDA kriterija, a rezultati pokazuju da predlozena metoda znacajno nadmasuje ostale tri razmatrane metode pri prepoznavanju ravnih redova usjeva te je robusna na zakrivljenost redova usjeva.The topic of this doctoral thesis is the development of image based crop row detection methods. The main goal of the research is to develop a method which is: highly insensitive to the presence of weeds and shadows, capable of detecting crop rows of dierent crop types at dierent stages of growth, capable of detecting straight and curved crop rows and insensitive to the number and spacing of crop rows. Two methods are proposed entitled CRDI and TMGEM. CRDI method is based on incremental method of searching for an approximate globally optimal partition of a set of data points and on the DIRECT algorithm for global optimization. The method is capable of detecting straight crop rows, wherein the number of crop rows must be known in advance. TMGEM method is based on template matching followed by global energy minimization with dynamic programming technique. For accurate crop row detection, the global energy function combines image evidence and prior knowledge about the geometric structure of crop rows. The proposed method is insensitive to the number and spacing of crop rows and is capable of detecting crop rows of dierent crop types at dierent stages of growth. Furthermore, the proposed TMGEM method is capable of detecting curved crop rows. A new evaluation framework is proposed that consists of a crop row image database, manual ground truth image creation approach and two crop row detection performance measures. The image database includes images of dierent crop types including maize, celery, potato, onion, sun ower and soybean. The proposed evaluation framework enables ecient and objective comparison of new crop row detection methods with existing ones. The experimental evaluation of CRDI method includes comparison with IMLD and HT method on synthetic datasets, based on the proposed CRHID performance measure. The results show that CRDI method outperforms other considered methods. TMGEM method is evaluated on a set of 225 real-world crop row images from the image database and it is compared with HT, HTT and LR method. The proposed CRDA measure is used as a performance measure in the comparison. The results show that TMGEM signicantly outperforms the other considered methods in straight crop row detection and is capable of detecting curved crop rows

    Detect or Track: Towards Cost-Effective Video Object Detection/Tracking

    Full text link
    State-of-the-art object detectors and trackers are developing fast. Trackers are in general more efficient than detectors but bear the risk of drifting. A question is hence raised -- how to improve the accuracy of video object detection/tracking by utilizing the existing detectors and trackers within a given time budget? A baseline is frame skipping -- detecting every N-th frames and tracking for the frames in between. This baseline, however, is suboptimal since the detection frequency should depend on the tracking quality. To this end, we propose a scheduler network, which determines to detect or track at a certain frame, as a generalization of Siamese trackers. Although being light-weight and simple in structure, the scheduler network is more effective than the frame skipping baselines and flow-based approaches, as validated on ImageNet VID dataset in video object detection/tracking.Comment: Accepted to AAAI 201

    Automatic Brain Tumor Segmentation by Deep Convolutional Networks and Graph Cuts

    Get PDF
    Brain tumor segmentation in magnetic resonance imaging (MRI) is helpful for diagnostics, growth rate prediction, tumor volume measurements and treatment planning of brain tumor. The difficulties for brain tumor segmentation are mainly due to high variation of brain tumors in size, shape, regularity, location, and their heterogeneous appearance (e.g., contrast, intensity and texture variation for different tumors). Due to recent advances in deep convolutional neural networks for semantic image segmentation, automatic brain tumor segmentation is a promising research direction. This thesis investigates automatic brain tumor segmentation by combining deep convolutional neural network with regularization by a graph cut. We investigate several deep convolutional network structures that have been successful in semantic and medical image segmentation. Since the tumor pixels account for a very small portion in the whole brain slice, segmenting the tumor from the background is a highly imbalanced dense prediction task. We use a loss function that takes the imbalance of the training data into consideration. In the second part of the thesis, we improve the segmentation results of a deep neural network by using optimization framework with graph cuts. The graph cut framework can improve segmentation boundaries by making them more smooth and regular. The main issue when using the segmentation results of convolutional neural networks for the graph cut optimization framework is to convert tumor probabilities learned by a convolutional network into data terms. We investigate several possible ways that take into consideration the segmentation artifacts by convolutional neural networks. In experiments, we present the segmentation results by different deep convolutional neural network structures, e.g., fully convolutional neural network, dilated residual network and UNet. Also, we compare the combination of U-Net with different data terms for graph cut regularization to improve the neural network segmentation results. Experimental results show that the U-Net performs best with the intersection over union (IoU) for tumors of 0.7286. The IoU for tumors is improved to 0.7530 by training on three slices. Also, the IoU for tumors is improved to 0.7713 by U-Net with balanced loss function. The IoU for tumors is further improved to 0.8078 by graph cut regularization
    corecore