Cited 0 time in
ACR: Adaptive Confidence Re-Scoring for Reliable Answer Selection Among Multiple Candidates
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Jeong, Eunhye | - |
| dc.contributor.author | Choi, Yong Suk | - |
| dc.date.accessioned | 2025-10-02T02:00:10Z | - |
| dc.date.available | 2025-10-02T02:00:10Z | - |
| dc.date.issued | 2025-08 | - |
| dc.identifier.issn | 2076-3417 | - |
| dc.identifier.issn | 2076-3417 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208862 | - |
| dc.description.abstract | With the improved reasoning capabilities of large language models (LLMs), their applications have rapidly expanded across a wide range of tasks. In recent question answering tasks, performance gains have been achieved through Self-Consistency, where LLMs generate multiple reasoning paths and determine the final answer via majority voting. However, this approach can fail when the correct answer is generated but does not appear frequently enough to be selected, highlighting its vulnerability to inconsistent generations. To address this, we propose Adaptive Confidence Re-scoring (ACR)-a method that adaptively evaluates and re-scores candidate answers to select the most trustworthy one when LLMs fail to generate consistent reasoning. Experiments on arithmetic and logical reasoning benchmarks show that ACR maintains or improves answer accuracy while significantly reducing inference cost. Compared to existing verification methods such as FOBAR, ACR reduces the number of inference calls by up to 95%, while improving inference efficiency-measured as accuracy gain per inference call-by a factor of 2x to 17x, depending on the dataset and model. | - |
| dc.format.extent | 16 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | MDPI | - |
| dc.title | ACR: Adaptive Confidence Re-Scoring for Reliable Answer Selection Among Multiple Candidates | - |
| dc.type | Article | - |
| dc.publisher.location | 스위스 | - |
| dc.identifier.doi | 10.3390/app15179587 | - |
| dc.identifier.scopusid | 2-s2.0-105015529723 | - |
| dc.identifier.wosid | 001569542100001 | - |
| dc.identifier.bibliographicCitation | Applied Sciences-basel, v.15, no.17, pp 1 - 16 | - |
| dc.citation.title | Applied Sciences-basel | - |
| dc.citation.volume | 15 | - |
| dc.citation.number | 17 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 16 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | Y | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Chemistry | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Materials Science | - |
| dc.relation.journalResearchArea | Physics | - |
| dc.relation.journalWebOfScienceCategory | Chemistry, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Materials Science, Multidisciplinary | - |
| dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
| dc.subject.keywordPlus | Natural language processing systems | - |
| dc.subject.keywordPlus | Question answering | - |
| dc.subject.keywordAuthor | natural language processing | - |
| dc.subject.keywordAuthor | question answering | - |
| dc.subject.keywordAuthor | large language models | - |
| dc.subject.keywordAuthor | prompt engineering | - |
| dc.subject.keywordAuthor | verification | - |
| dc.identifier.url | https://www.mdpi.com/2076-3417/15/17/9587 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
