Search CORE

21 research outputs found

FaaSwap: SLO-Aware, GPU-Efficient Serverless Inference via Model Swapping

Author: Chen Dong
Chen Ruichuan
Li Zhuohao
Luo Xiaonan
Nie Dapeng
Wang Ao
Wang Wei
Yang Haoran
Yu Haoxuan
Yu Minchen
Publication venue
Publication date: 06/06/2023
Field of study

The dynamic request patterns of machine learning (ML) inference workloads have driven an increasing trend towards exploiting serverless computing for scalable ML model serving. However, today's serverless platforms lack efficient support for GPUs -- provisioning functions on GPUs incurs extremely high overhead, forcing them to keep long-running even when idling for reduced cold starts. This leads to significant resource waste to perform ML inference and hinders the pay-per-use billing for GPUs. In this paper, we present FaaSwap, a serverless platform enabling fine-grained, request-level GPU sharing for resource-efficient ML inference. FaaSwap leverages model swapping to support fast inference execution at low resource cost. It keeps models in a host which has a large amount of cheap memory and quickly swaps models to GPUs when requested, reducing per-function keep-alive cost and enabling efficient GPU sharing across much more functions. FaaSwap also supports swapping models between GPUs for load balancing and improved inference performance. In FaaSwap, we design sophisticated request scheduling and memory management algorithms that efficiently exploit model swapping to reduce GPU cost and meet latency service-level objectives (SLOs) for all inference functions. We have implemented and integrated FaaSwap into Alibaba Cloud Function Compute (FC), one of the world's largest commercial serverless platform. Evaluation results show that FaaSwap can achieve low-latency model swapping, efficiently share a GPU across hundreds of functions, and satisfy per-function latency SLOs at scale

arXiv.org e-Print Archive

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization

Author: Chen Hao
Jiang Chaoya
Wang Cunxiang
Wang Jindong
Wang Yidong
Xie Rui
Xie Xing
Yang Linyi
Ye Wei
Yu Zhuohao
Zeng Zhengran
Zhang Shikun
Zhang Yue
Publication venue
Publication date: 08/06/2023
Field of study

Instruction tuning large language models (LLMs) remains a challenging task, owing to the complexity of hyperparameter selection and the difficulty involved in evaluating the tuned models. To determine the optimal hyperparameters, an automatic, robust, and reliable evaluation benchmark is essential. However, establishing such a benchmark is not a trivial task due to the challenges associated with evaluation accuracy and privacy protection. In response to these challenges, we introduce a judge large language model, named PandaLM, which is trained to distinguish the superior model given several LLMs. PandaLM's focus extends beyond just the objective correctness of responses, which is the main focus of traditional evaluation datasets. It addresses vital subjective factors such as relative conciseness, clarity, adherence to instructions, comprehensiveness, and formality. To ensure the reliability of PandaLM, we collect a diverse human-annotated test dataset, where all contexts are generated by humans and labels are aligned with human preferences. Our results indicate that PandaLM-7B achieves 93.75% of GPT-3.5's evaluation ability and 88.28% of GPT-4's in terms of F1-score on our test dataset. PandaLM enables the evaluation of LLM to be fairer but with less cost, evidenced by significant improvements achieved by models tuned through PandaLM compared to their counterparts trained with default Alpaca's hyperparameters. In addition, PandaLM does not depend on API-based evaluations, thus avoiding potential data leakage. All resources of PandaLM are released at https://github.com/WeOpenML/PandaLM

arXiv.org e-Print Archive

1分子力学測定によるSecM翻訳アレスト解除機構の解析

Author: Yang Zhuohao
楊倬皓
Publication venue
Publication date
Field of study

Institutional Repositories DataBase (IRDB)

Nascent SecM Chain Outside the Ribosome Reinforces Translation Arrest

Author: Ryo Iizuka (214078)
Takashi Funatsu (214085)
Zhuohao Yang (711592)
Publication venue
Publication date: 01/01/2015
Field of study

<div>SecM, a bacterial secretion monitor protein, contains a specific amino acid sequence at its C-terminus, called arrest sequence, which interacts with the ribosomal tunnel and arrests its own translation. The arrest sequence is sufficient and necessary for stable translation arrest. However, some previous studies have suggested that the nascent chain outside the ribosome affects the stability of translation arrest. To clarify this issue, we performed in vitro translation assays with HaloTag proteins fused to the C-terminal fragment of E. coli SecM containing the arrest sequence or the full-length SecM. We showed that the translation of HaloTag proteins, which are fused to the fragment, is not effectively arrested, whereas the translation of HaloTag protein fused to full-length SecM is arrested efficiently. In addition, we observed that the nascent SecM chain outside the ribosome markedly stabilizes the translation arrest. These results indicate that changes in the nascent polypeptide chain outside the ribosome can affect the stability of translation arrest; the nascent SecM chain outside the ribosome stabilizes the translation arrest.</div

Directory of Open Access Journals

PubMed Central

FigShare

In vitro translation of HaloTag proteins harbouring the arrest sequence.

Author: Ryo Iizuka (214078)
Takashi Funatsu (214085)
Zhuohao Yang (711592)
Publication venue
Publication date
Field of study

(A) Halo-L8-SecM133–170 (lane 1), Halo-L17-SecM133–170 (lane 2), Halo-L26-SecM133–170 (lane 3), Halo-pD-L8-SecM133–170 (lane 4) and Halo-SecM1–170 (lane 5) were translated in the presence of HaloTag TMR Ligand using the PURExpress ΔRibosome Kit at 37°C for 20 min. Puromycin (1 mg/mL) was added to the reaction mixture at 0 min, and the reaction mixture was incubated at 37°C for 3 min. Aliquots were withdrawn before the addition of puromycin and after 3-min incubation and subjected to NuPAGE. Polypeptides labelled with HaloTag TMR Ligand were detected using Molecular Imager FX. Black and white arrowheads indicate the translation arrest products (polypeptidyl-tRNA) and released products, respectively. The results shown are representative of three independent experiments with similar results. (B) Myc-Halo-L8-SecM133–170 (lane 1), myc-Halo-L17-SecM133–170 (lane 2), myc-Halo-L26-SecM133–170 (lane 3), myc-Halo-pD-L8-SecM133–170 (lane 4) and myc-Halo-SecM1–170 (lane 5) were translated in the absence of HaloTag TMR Ligand using the PURExpress ΔRibosome Kit at 37°C for 20 min. Puromycin (1 mg/mL) was added at 0 min, and the reaction mixture was incubated at 37°C for 3 min. Aliquots were withdrawn before the addition of puromycin and after a 3-min incubation and subjected to NuPAGE. Myc-tagged polypeptides were detected by western blotting with anti-c-myc-tag. Black and white arrowheads indicate the translation arrest products (polypeptidyl-tRNA) and released products, respectively. The results shown are representative of three independent experiments with similar results. (C) Fractions of translation arrest products in the absence (left) and the presence of puromycin (right). Filled bars, fluorescence detection using HaloTag TMR Ligand; open bars, detection by western blotting. Error bars represent the standard deviation (SD) of three independent experiments. The asterisk indicates statistical significance as determined by the Student's t-test (p < 0.05).</p

FigShare

Lifetimes of the translation arrest of HaloTag proteins harbouring the arrest sequence.

Author: Ryo Iizuka (214078)
Takashi Funatsu (214085)
Zhuohao Yang (711592)
Publication venue
Publication date
Field of study

(A-D) Time-course analyses of polypeptidyl-tRNA remaining after the addition of puromycin. Halo-L17-SecM133–170 (A), Halo-L26-SecM133–170 (B), Halo-pD-L8-SecM133–170 (C) and Halo-SecM1–170 (D) were translated in the presence of HaloTag TMR Ligand using the PURExpress ΔRibosome Kit at 37°C for 20 min. Puromycin (1 mg/mL) was added to the reaction mixture at 0 min, and the mixture was incubated at 37°C. Aliquots removed at the indicated time points were subjected to NuPAGE. Polypeptides labelled with HaloTag TMR Ligand were detected using Molecular Imager FX. Black and white arrowheads indicate the translation arrest products (polypeptidyl-tRNA) and released products, respectively. (E) Plots of the fraction of polypeptidyl-tRNA remaining in the presence of puromycin as a function of time. Squares, Halo-L17-SecM133–170; diamonds, Halo-L26-SecM133–170; triangles, Halo-pD-L8-SecM133–170; circles, Halo-SecM1–170. Data points represent means ± SD of three independent experiments. The solid and dotted lines show the fit to the data obtained using a single exponential function. The lifetimes of the translation arrest of Halo-L17-SecM133–170, Halo-L26-SecM133–170, Halo-pD-L8-SecM133–170 and Halo-SecM1–170 were 5.6 ± 0.066, 11 ± 0.22, 9.4 ± 0.63 and 51 ± 1.6 min, respectively (the errors represent fitting errors). (F) Time-course analysis of myc-SecM1–170 polypeptidyl-tRNA remaining after the addition of puromycin. Myc-SecM1–170 was translated using the PURExpress ΔRibosome Kit at 37°C for 40 min. Puromycin (1 mg/mL) was added at 0 min, and the mixture was incubated at 37°C. Aliquots were withdrawn at indicated time points and subjected to NuPAGE. Myc-SecM1–170 was detected by western blotting with anti-c-myc-tag. Black and white arrowheads indicate the translation arrest products (polypeptidyl-tRNA) and released products, respectively. (G) The fraction of myc-SecM1–170 polypeptidyl-tRNA remaining in the presence of puromycin as a function of time. Data points with error bars represent means ± SD for three independent experiments. The solid line shows the fit to the data obtained using a single exponential function. The lifetime of the translation arrest of myc-SecM1–170 was 48 min ± 4.3 min (the error corresponds to fitting error).</p

FigShare

In vitro translation of HaloTag proteins with mutated arrest sequence.

Author: Ryo Iizuka (214078)
Takashi Funatsu (214085)
Zhuohao Yang (711592)
Publication venue
Publication date
Field of study

Each protein construct, with or without a mutation (R163A or P166A) in the arrest sequence, was translated in the presence of HaloTag TMR Ligand using the PURExpress ΔRibosome Kit at 37°C for 20 min. Puromycin (1 mg/mL) was added at 0 min, and the reaction mixture incubated at 37°C for 3 min. Aliquots were withdrawn before and 3 min after the addition of puromycin and subjected to NuPAGE. Polypeptides labelled with HaloTag TMR Ligand were detected using Molecular Imager FX. A, Halo-L8-SecM133–170; B, Halo-L17-SecM133–170; C, Halo-L26-SecM133–170; D, Halo-pD-L8-SecM133–170; E, Halo-SecM1–170. Black and white arrowheads indicate the translation arrest products (polypeptidyl-tRNA) and released products, respectively. The results shown are representative of three independent experiments with similar results.</p

FigShare

Optimization of Ni0.95−xZnxCo0.05Fe1.90Mn0.02O4 ceramics with promising magneto-dielectric properties for VHF antenna miniaturization

Author: Chuanhu Wang
Lie Liu
Ling Bing Kong
Zhihong Yang
Zhuohao Xiao
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2018
Field of study

Magnetic, dielectric and DC conductive properties of Ni0.95−xZnxCo0.05Fe1.90Mn0.02O4 (with x=0-0.20 at an interval of 0.05) ferrite ceramics were studied, in order to develop magneto-dielectric materials with almost equal values of relative permeability and permittivity, for the miniaturization of HF (3–30MHz) and VHF (30–90MHz and 100–300MHz) antennas. The ferrite ceramics were prepared by using the conventional two-step sintering process. The real part of relative permeability is increased almost linearly with increasing concentration of Zn, while that of relative permittivity keeps nearly unchanged. It is found that promising magneto-dielectric materials, with close values of real permeability and permittivity over 30–90 MHz (VHF), can be obtained for the samples at Zn concentrations between x=0.05 and x=0.10

Directory of Open Access Journals

DR-NTU (Digital Repository of NTU)

Rapid processing of ferrite ceramics with promising magneto-dielectric characteristics

Author: Kong Ling Bing
Liu Lie
Wang Chuanhu
Xiao Zhuohao
Yang Zhihong
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2017
Field of study

Ferrite ceramics, Ni0.88Zn0.07Co0.05Fe1.98O4, with the addition of 4wt.% Bi2O3 as sintering aid, were fabricated by using a simple one-step processing without involving the step of calcination. X-ray diffraction (XRD) results indicated that single phase ferrite ceramics can be achieved after sintering at 1000∘C for 2h. The samples demonstrated relative densities in the range of 97–99%. Desired magneto-dielectric properties have been approached by adjusting the sintering temperature and sintering time duration. This technique is believed to be applicable to other ceramic materials.Published versio

Directory of Open Access Journals

DR-NTU (Digital Repository of NTU)

Microfluidic preparation of optical sensors for biomedical applications

Author: Chong Wang
Jiali Wang
Luoran Shang
Qiao Wang
Xinyuan Yang
Zhuohao Zhang
Publication venue: 'Wiley'
Publication date: 01/02/2023
Field of study

Abstract Optical biosensors are platforms that translate biological information into detectable optical signals, and have extensive applications in various fields due to their characteristics of high sensitivity, high specificity, dynamic sensing, etc. The development of optical sensing materials is an important part of optical sensors. In this review, we emphasize the role of microfluidic technology in the preparation of optical sensing materials and the application of the derived optical sensors in the biomedical field. We first present some common optical sensing mechanisms and the functional responsive materials involved. Then, we describe the preparation of these sensing materials by microfluidics. Afterward, we enumerate the biomedical applications of these optical materials as biosensors in disease diagnosis, drug evaluation, and organ‐on‐a‐chip. Finally, we discuss the challenges and prospects in this field

Directory of Open Access Journals