Web2Code: A Large-scale Webpage-to-Code Datasetand Evaluation Framework for Multimodal LLMs
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 윤석민 | - |
dc.date.accessioned | 2025-01-13T06:00:21Z | - |
dc.date.available | 2025-01-13T06:00:21Z | - |
dc.date.issued | 2024-12 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/121999 | - |
dc.description.abstract | Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks.we propose Web2Code, a benchmark consisting of a new large-scale webpage-to-code dataset for instruction tuning and an evaluation framework for the webpage understanding and HTML code translation abilities of MLLMs. For dataset construction, we leveraging pretrained LLMs to enhance existing webpage-to-code datasets as well as generate a diverse pool of new webpages rendered into images.To evaluate model performance in these tasks, we develop an evaluation framework for testing MLLMs' abilities in webpage understanding and web-to-code generation.Extensive experiments show that our proposed dataset is beneficial not only to our proposed tasks but also in the general visual domain, while previous datasets result in worse performance. We hope our work will contribute to the development of general MLLMs suitable for web-based content generation and task automation. | - |
dc.format.extent | 24 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | NeurIPS Foundation | - |
dc.title | Web2Code: A Large-scale Webpage-to-Code Datasetand Evaluation Framework for Multimodal LLMs | - |
dc.type | Article | - |
dc.identifier.doi | 10.48550/arXiv.2406.20098 Focus to learn more | - |
dc.identifier.bibliographicCitation | Conference on Neural Information Processing Systems, pp 1 - 24 | - |
dc.citation.title | Conference on Neural Information Processing Systems | - |
dc.citation.startPage | 1 | - |
dc.citation.endPage | 24 | - |
dc.type.docType | Proceeding | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | foreign | - |
dc.subject.keywordPlus | Computer Vision and Pattern Recognition (cs.CV) | - |
dc.subject.keywordPlus | Artificial Intelligence (cs.AI) | - |
dc.subject.keywordPlus | Computation and Language (cs.CL) | - |
dc.identifier.url | https://mbzuai-llm.github.io/webpage2code/ | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.