Search CORE

6 research outputs found

Hyper: Distributed Cloud Processing for Large-Scale Deep Learning Tasks

Author: Buniatyan Davit
Publication venue
Publication date: 16/10/2019
Field of study

Training and deploying deep learning models in real-world applications require processing large amounts of data. This is a challenging task when the amount of data grows to a hundred terabytes, or even, petabyte-scale. We introduce a hybrid distributed cloud framework with a unified view to multiple clouds and an on-premise infrastructure for processing tasks using both CPU and GPU compute instances at scale. The system implements a distributed file system and failure-tolerant task processing scheduler, independent of the language and Deep Learning framework used. It allows to utilize unstable cheap resources on the cloud to significantly reduce costs. We demonstrate the scalability of the framework on running pre-processing, distributed training, hyperparameter search and large-scale inference tasks utilizing 10,000 CPU cores and 300 GPU instances with the overall processing power of 30 petaflops

arXiv.org e-Print Archive

Crossref

PZnet: Efficient 3D ConvNet Inference on Manycore CPUs

Author: Buniatyan Davit
Li Kai
Popovych Sergiy
Seung H. Sebastian
Zlateski Aleksandar
Publication venue
Publication date: 18/03/2019
Field of study

Convolutional nets have been shown to achieve state-of-the-art accuracy in many biomedical image analysis tasks. Many tasks within biomedical analysis domain involve analyzing volumetric (3D) data acquired by CT, MRI and Microscopy acquisition methods. To deploy convolutional nets in practical working systems, it is important to solve the efficient inference problem. Namely, one should be able to apply an already-trained convolutional network to many large images using limited computational resources. In this paper we present PZnet, a CPU-only engine that can be used to perform inference for a variety of 3D convolutional net architectures. PZNet outperforms MKL-based CPU implementations of PyTorch and Tensorflow by more than 3.5x for the popular U-net architecture. Moreover, for 3D convolutions with low featuremap numbers, cloud CPU inference with PZnet outperfroms cloud GPU inference in terms of cost efficiency

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

Deep Lake: a Lakehouse for Deep Learning

Author: Buniatyan Davit
Ghukasyan Levon
Hakobyan Tatevik
Hambardzumyan Sasun
Harutyunyan Mikayel
Isayan David
Rahman Fariz
Stranic Ivo
Topchyan Hrant
Tuli Abhinav
Publication venue
Publication date: 22/09/2022
Field of study

Traditional data lakes provide critical data infrastructure for analytical workloads by enabling time travel, running SQL queries, ingesting data with ACID transactions, and visualizing petabyte-scale datasets on cloud storage. They allow organizations to break down data silos, unlock data-driven decision-making, improve operational efficiency, and reduce costs. However, as deep learning takes over common analytical workflows, traditional data lakes become less useful for applications such as natural language processing (NLP), audio processing, computer vision, and applications involving non-tabular datasets. This paper presents Deep Lake, an open-source lakehouse for deep learning applications developed at Activeloop. Deep Lake maintains the benefits of a vanilla data lake with one key difference: it stores complex data, such as images, videos, annotations, as well as tabular data, in the form of tensors and rapidly streams the data over the network to (a) Tensor Query Language, (b) in-browser visualization engine, or (c) deep learning frameworks without sacrificing GPU utilization. Datasets stored in Deep Lake can be accessed from PyTorch, TensorFlow, JAX, and integrate with numerous MLOps tools

arXiv.org e-Print Archive

Microrheology, advances in methods and insights

Author: Alcor
Allan
Alona
Antony
Ashkin
Babaye Khorasani
Babcock
Backlund
Banks
Bausch
Berret
Berthelon
Block
Caggioni
Chenouard
Cipelletti
Cipelletti
Cloitre
Cordelieres
Corrigan
Crocker
Cucheval
Córdoba
Daniels
Dasgupta
Daumas
Davit Buniatyan
Debeir
Degerman
Deshmukh
Dhar
Dickinson
Ding
Domínguez-García
Duri
Duri
Dzyubachyk
Engelman
Ernst
Fakhri
Fielding
Foegeding
Fricks
Frith
Fu
Fusco
Gado
Gennes
Gomez-Gonzalez
Gu
Guo
Helfer
Houghton
Howard
Huhle
Huining Xiao
Hurtado
Indei
Indei
Isa
Jin
Jones
Jordens
Kalwarczyk
Kan
Kass
Kolobov
Kotlarchyk
Kues
Kuo
Langevin
Larsen
Lee
Levi
Liang
Lidong Wang
Lim
Liu
Low-Nam
Lux
MacKintosh
Magde
Mahmoud
Mandelbrot
Martin
Martínez-Pedrero
Mason
Mason
Matthias
Mayer
McKinley
Mendes
Metzler
Meyer
Mirchev
Moffitt
Moschakis
Moschakis
Narita
Neuman
Nitta
Novotny
Ogasawara
Oppong
Oppong
Papagiannopoulos
Park
Parry
Parthasarathy
Pease
Peterman
Piechocka
Pouget
Prasad
Pusey
Qiang
Qiuyang Xia
Rathgeber
Reverey
Rogers
Roy
Sakhtianchi
Sanamrad
Savin
Saxton
Schultz
Schultz
Schultz
Schultz
Sharma
Shav-Tal
Shayegan
Shibayama
Shlomovitz
Sikora
Silva
Sonn-Segev
Sriram
Stuhrmann
Suh
Suh
Taylor
Tierno
Toyota
Trappe
Tseng
Tuinier
Vale
Valentine
van der Linden
Vincent
Wang
Wehrman
Weihs
Wirtz
Xu
Yasuda
Yasuda
Yuanfeng Pan
Zimmer
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref