Search CORE

9 research outputs found

Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts

Author: Daxberger Erik
Du Xianzhi
Eichner Marcin
Emmersberger Michael
Gunter Tom
Pang Ruoming
Toshev Alexander
Weers Floris
Yang Yinfei
Zhang Bowen
Publication venue
Publication date: 08/09/2023
Field of study

Sparse Mixture-of-Experts models (MoEs) have recently gained popularity due to their ability to decouple model size from inference efficiency by only activating a small subset of the model parameters for any given input token. As such, sparse MoEs have enabled unprecedented scalability, resulting in tremendous successes across domains such as natural language processing and computer vision. In this work, we instead explore the use of sparse MoEs to scale-down Vision Transformers (ViTs) to make them more attractive for resource-constrained vision applications. To this end, we propose a simplified and mobile-friendly MoE design where entire images rather than individual patches are routed to the experts. We also propose a stable MoE training procedure that uses super-class information to guide the router. We empirically show that our sparse Mobile Vision MoEs (V-MoEs) can achieve a better trade-off between performance and efficiency than the corresponding dense ViTs. For example, for the ViT-Tiny model, our Mobile V-MoE outperforms its dense counterpart by 3.39% on ImageNet-1k. For an even smaller ViT variant with only 54M FLOPs inference cost, our MoE achieves an improvement of 4.66%

arXiv.org e-Print Archive

Financial Stability Monitoring

Author: Acharya
Acharya
Acharya
Adam Ashcraft
Adam Copeland
Adam Copeland
Adi Sunderam
Adrian
Adrian
Adrian
Allen Malz
Ana Fostel
Andrew Haughwout
Antoine Martin
Antonio Falato
Arvind Krishnamurthy
Atif Mian
Atif Mian
Atif Mian
Ben Bernanke
Ben Bernanke
Bengt Holmstr�m
Carlos Arteta
Celso Brunetti
Charles Goodhart
Charles Goodhart
Charles Himmelberg
Claudio Borio
Claudio Borio
Claudio Borio
Daniel Covitz
Daniel M. Covitz
Dimitrios Bisias
Don H Kim
Douglas W Diamond
Duygan-Bump
Edward Altman
Edwin Elton
Gabriel Jim�nez
Gary B Gorton
Gary B Gorton
Gene Amromin
Giovanni Dell&apos
Ian Christensen
Ignazio Angeloni
Ing Cheng
Javier Bianchi
Jens Christensen
Jeremy C Stein
Jeremy C Stein
Jeremy Stein
Jing-Zhi Huang
John
John Geanakoplos
John H Cochrane
John Y Campbell
John Y Campbell
Karl Case
Kenneth Froot
Kenneth Garbade
Kiyotaki Nobuhiro
Kristopher Gerardi
Marcin Kacperczyk
Mark Gertler
Markus Brunnermeier
Markus Brunnermeier
Markus Brunnermeier
Markus Brunnermeier
Matthew J Eichner
Michael J Fleming
Michael Kiley
Michael Woodford
Monika Piazzesi
Morgan Ricks
Narayana Kocherlakota
Nellie Liang
Nellie Liang
Niccola Gennaioli
Paolo Angelini
Patrick E Mccabe
Rama Cont
Richard Bookstaber
Robert Novy-Marx
Robin Greenwood
Rochelle M Edge
Rodney Garratt
Russell Wermers
Samuel G Hanson
Sean D Campbell
Sean D Campbell
Simon Gilchrist
Song Han
Stefano Eusepi
Steffanie Brady
Stephen Morris
Steve Sharpe
Steven N Kaplan
Sudheer Chava
Teodora Paligorova
Tim Opler
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Tobias Adrian
Victoria Ivashina
William Bassett
William Dudley
Xin Huang
Zhiguo He
Zhiguo He
Zhiguo He
Zhiguo He
Zoltan Pozsar
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Crossref

Articulated Pose Estimation of Multiple Persons

Author: Eichner Marcin
Publication venue: ETH
Publication date: 01/01/2012
Field of study

Repository for Publications and Research Data

Video Retrieval by Mimicking Poses

Author: Andrew Zisserman
C. V. Jawahar
Marcin Eichner
Nataraj Jammalamadaka
Vittorio Ferrari
Publication venue
Publication date: 01/01/2012
Field of study

We describe a method for real time video retrieval where the task is to match the 2D human pose of a query. A user can form a query by (i) interactively controlling a stickman on a web based GUI, (ii) uploading an image of the desired pose, or (iii) using the Kinect and acting out the query himself. The method is scalable and is applied to a dataset of 18 films totaling more than three million frames. The real time performance is achieved by searching for approximate nearest neighbors to the query using a random forest of K-D trees. Apart from the query modalities, we introduce two other areas of novelty. First, we show that pose retrieval can proceed using a low dimensional representation. Second, we show that the precision of the results can be improved substantially by combining the outputs of independent human pose estimation algorithms. The performance of the system is assessed quantitatively over a range of pose queries

CiteSeerX

Crossref

Improvement of solid particle erosion and corrosion resistance using TiAlSiN/Cr multilayer coatings

Author: Abadias
Anaraki
ASTM-G76 ASTM International
Bhowmick
Bobzin
Bobzin
Bonu
Borawski
Bousser
Bousser
Cao
Carvalho
Chipatecua
Creus
Dachen Deng
Dang
Dang
DeMasi-Marcin
Du
Ehiasarian
Eichner
El-Rahman
Fenker
Feuerstein
Finnie
Gachon
Grewal
Gu
Guo
Guodong Li
Hassani
Holleck
Hui Peng
Hutchings
Jiabin Gu
Khanna
Kong
Kouznetsov
Leyland
Leyland
Leyland
Li
Lin
Liu
Liu
Liu
Liuhe Li
Ma
Matthews
Meng
Meng Ai
Messier
Mori
Münz
Naveed
Oka
Pang
Parameswaran
Park
Patsalas
Peipei Zhang
Pelleg
Potts
Poursaeidi
Purandare
Rajendran
Reedy
Reinhard
Ruff
Savaloni
Selvadurai
Sida Luo
Song
Su
Swadźba
Swadźba
Tan
Ting
Vaz
Velicu
Vepřek
Vepřek
Wang
Wang
Wang
Wieciński
Xie
Yang
Yang
Ye Xu
Yi Xu
Yoo
Zhou
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref