Search CORE

96 research outputs found

Western Mortgage Loan Corporation v. Cottonwood Construction Company, a Corporation, et al. : Appellant\u27s Brief

Author: Tronchon Henri
Publication venue: BYU Law Digital Commons
Publication date: 01/01/1935
Field of study

Intermediate Appeal from Interlocutory Pretrial Rulings of the 3rd District Court for Salt Lake County, Honorable Aldon J. Anderson, Judg

Brigham Young University Law School

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Author: Laurençon Hugo
Sanh Victor
Tronchon Léo
Publication venue
Publication date: 13/03/2024
Field of study

Using vision-language models (VLMs) in web development presents a promising strategy to increase efficiency and unblock no-code solutions: by providing a screenshot or a sketch of a UI, a VLM could generate the code to reproduce it, for instance in a language like HTML. Despite the advancements in VLMs for various tasks, the specific challenge of converting a screenshot into a corresponding HTML has been minimally explored. We posit that this is mainly due to the absence of a suitable, high-quality dataset. This work introduces WebSight, a synthetic dataset consisting of 2 million pairs of HTML codes and their corresponding screenshots. We fine-tune a foundational VLM on our dataset and show proficiency in converting webpage screenshots to functional HTML code. To accelerate the research in this area, we open-source WebSight

arXiv.org e-Print Archive

Les oeuvres posthumes de Jean Fekete de Galántha : voltairien de Hongrie

Author: Tronchon Henri
Publication venue
Publication date: 01/01/1934
Field of study

University of Szeged

L'art populaire hongrois

Author: Tronchon Henri
Publication venue
Publication date: 01/01/1929
Field of study

University of Szeged

Helvétius (De l'esprit) : jugé par un voltairien de Hongrie

Author: Tronchon Henri
Publication venue
Publication date: 01/01/1924
Field of study

University of Szeged

What matters when building vision-language models?

Author: Cord Matthieu
Laurençon Hugo
Sanh Victor
Tronchon Léo
Publication venue
Publication date: 03/05/2024
Field of study

The growing interest in vision-language models (VLMs) has been driven by improvements in large language models and vision transformers. Despite the abundance of literature on this subject, we observe that critical decisions regarding the design of VLMs are often not justified. We argue that these unsupported decisions impede progress in the field by making it difficult to identify which choices improve model performance. To address this issue, we conduct extensive experiments around pre-trained models, architecture choice, data, and training methods. Our consolidation of findings includes the development of Idefics2, an efficient foundational VLM of 8 billion parameters. Idefics2 achieves state-of-the-art performance within its size category across various multimodal benchmarks, and is often on par with models four times its size. We release the model (base, instructed, and chat) along with the datasets created for its training

arXiv.org e-Print Archive