Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Web2Code: A Large-scale Webpage-to-Code Datasetand Evaluation Framework for Multimodal LLMs

Authors
윤석민
Issue Date
Dec-2024
Publisher
NeurIPS Foundation
Citation
Conference on Neural Information Processing Systems, pp 1 - 24
Pages
24
Indexed
FOREIGN
Journal Title
Conference on Neural Information Processing Systems
Start Page
1
End Page
24
URI
https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/121999
DOI
10.48550/arXiv.2406.20098 Focus to learn more
Abstract
Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks.we propose Web2Code, a benchmark consisting of a new large-scale webpage-to-code dataset for instruction tuning and an evaluation framework for the webpage understanding and HTML code translation abilities of MLLMs. For dataset construction, we leveraging pretrained LLMs to enhance existing webpage-to-code datasets as well as generate a diverse pool of new webpages rendered into images.To evaluate model performance in these tasks, we develop an evaluation framework for testing MLLMs' abilities in webpage understanding and web-to-code generation.Extensive experiments show that our proposed dataset is beneficial not only to our proposed tasks but also in the general visual domain, while previous datasets result in worse performance. We hope our work will contribute to the development of general MLLMs suitable for web-based content generation and task automation.
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF COMPUTING > DEPARTMENT OF ARTIFICIAL INTELLIGENCE > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yun, Sukmin photo

Yun, Sukmin
ERICA 소프트웨어융합대학 (DEPARTMENT OF ARTIFICIAL INTELLIGENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE