Cited 0 time in
Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Junyeob | - |
| dc.contributor.author | Lee, Sang-Goo | - |
| dc.contributor.author | Kim, Taeuk | - |
| dc.date.accessioned | 2026-06-01T05:00:08Z | - |
| dc.date.available | 2026-06-01T05:00:08Z | - |
| dc.date.issued | 2026-03 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212912 | - |
| dc.description.abstract | Recent reinforcement learning (RL)-trained language models have demonstrated strong performance on complex reasoning tasks by producing long and detailed reasoning traces. However, despite these advancements, they often struggle with finding the right balance in reasoning length: some terminate prematurely before reaching a correct answer (underthinking), while others continue reasoning beyond necessity, leading to inefficiency or even degraded accuracy (overthinking).To address these challenges, we propose a method for optimizing reasoning length via self-assessed confidence. By prompting the model to evaluate its own confidence at intermediate reasoning steps, we enable dynamic stopping once sufficient reasoning is achieved.Experiments across multiple reasoning benchmarks show that our approach improves computational efficiency without compromising answer quality. Furthermore, we find that confidence estimates from RL-trained reasoning models are more reliable than those from standard LLMs, making it a valuable internal signal for controlling reasoning depth. | - |
| dc.format.extent | 7 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Association for Computational Linguistics (ACL) | - |
| dc.title | Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.18653/v1/2026.findings-eacl.263 | - |
| dc.identifier.scopusid | 2-s2.0-105039052399 | - |
| dc.identifier.bibliographicCitation | 19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026, pp 5000 - 5006 | - |
| dc.citation.title | 19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026 | - |
| dc.citation.startPage | 5000 | - |
| dc.citation.endPage | 5006 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordPlus | Benchmarking | - |
| dc.subject.keywordPlus | Computational linguistics | - |
| dc.subject.keywordPlus | Machine learning | - |
| dc.subject.keywordPlus | Natural language processing systems | - |
| dc.identifier.url | https://aclanthology.org/2026.findings-eacl.263/ | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
