Web2Code: A Large-scale Webpage-to-Code Datasetand Evaluation Framework for Multimodal LLMs

윤석민

doi:10.48550/arXiv.2406.20098 Focus to learn more

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Web2Code: A Large-scale Webpage-to-Code Datasetand Evaluation Framework for Multimodal LLMs

Full metadata record

DC Field	Value	Language
dc.contributor.author	윤석민	-
dc.date.accessioned	2025-01-13T06:00:21Z	-
dc.date.available	2025-01-13T06:00:21Z	-
dc.date.issued	2024-12	-
dc.identifier.uri	https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/121999	-
dc.description.abstract	Multimodal large language models (MLLMs) have shown impressive success across modalities such as image, video, and audio in a variety of understanding and generation tasks.we propose Web2Code, a benchmark consisting of a new large-scale webpage-to-code dataset for instruction tuning and an evaluation framework for the webpage understanding and HTML code translation abilities of MLLMs. For dataset construction, we leveraging pretrained LLMs to enhance existing webpage-to-code datasets as well as generate a diverse pool of new webpages rendered into images.To evaluate model performance in these tasks, we develop an evaluation framework for testing MLLMs' abilities in webpage understanding and web-to-code generation.Extensive experiments show that our proposed dataset is beneficial not only to our proposed tasks but also in the general visual domain, while previous datasets result in worse performance. We hope our work will contribute to the development of general MLLMs suitable for web-based content generation and task automation.	-
dc.format.extent	24	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	NeurIPS Foundation	-
dc.title	Web2Code: A Large-scale Webpage-to-Code Datasetand Evaluation Framework for Multimodal LLMs	-
dc.type	Article	-
dc.identifier.doi	10.48550/arXiv.2406.20098 Focus to learn more	-
dc.identifier.bibliographicCitation	Conference on Neural Information Processing Systems, pp 1 - 24	-
dc.citation.title	Conference on Neural Information Processing Systems	-
dc.citation.startPage	1	-
dc.citation.endPage	24	-
dc.type.docType	Proceeding	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	foreign	-
dc.subject.keywordPlus	Computer Vision and Pattern Recognition (cs.CV)	-
dc.subject.keywordPlus	Artificial Intelligence (cs.AI)	-
dc.subject.keywordPlus	Computation and Language (cs.CL)	-
dc.identifier.url	https://mbzuai-llm.github.io/webpage2code/	-

Files in This Item: Go to Link

Appears in Collections: COLLEGE OF COMPUTING > DEPARTMENT OF ARTIFICIAL INTELLIGENCE > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Yun, Sukmin photo

Yun, Sukmin: ERICA 소프트웨어융합대학 (DEPARTMENT OF ARTIFICIAL INTELLIGENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE