Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models

Kim, Junyeob; Lee, Sang-Goo; Kim, Taeuk

doi:10.18653/v1/2026.findings-eacl.263

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Junyeob	-
dc.contributor.author	Lee, Sang-Goo	-
dc.contributor.author	Kim, Taeuk	-
dc.date.accessioned	2026-06-01T05:00:08Z	-
dc.date.available	2026-06-01T05:00:08Z	-
dc.date.issued	2026-03	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/212912	-
dc.description.abstract	Recent reinforcement learning (RL)-trained language models have demonstrated strong performance on complex reasoning tasks by producing long and detailed reasoning traces. However, despite these advancements, they often struggle with finding the right balance in reasoning length: some terminate prematurely before reaching a correct answer (underthinking), while others continue reasoning beyond necessity, leading to inefficiency or even degraded accuracy (overthinking).To address these challenges, we propose a method for optimizing reasoning length via self-assessed confidence. By prompting the model to evaluate its own confidence at intermediate reasoning steps, we enable dynamic stopping once sufficient reasoning is achieved.Experiments across multiple reasoning benchmarks show that our approach improves computational efficiency without compromising answer quality. Furthermore, we find that confidence estimates from RL-trained reasoning models are more reliable than those from standard LLMs, making it a valuable internal signal for controlling reasoning depth.	-
dc.format.extent	7	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computational Linguistics (ACL)	-
dc.title	Think Just Enough: Leveraging Self-Assessed Confidence for Adaptive Reasoning in Language Models	-
dc.type	Article	-
dc.identifier.doi	10.18653/v1/2026.findings-eacl.263	-
dc.identifier.scopusid	2-s2.0-105039052399	-
dc.identifier.bibliographicCitation	19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026, pp 5000 - 5006	-
dc.citation.title	19th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2026	-
dc.citation.startPage	5000	-
dc.citation.endPage	5006	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	Benchmarking	-
dc.subject.keywordPlus	Computational linguistics	-
dc.subject.keywordPlus	Machine learning	-
dc.subject.keywordPlus	Natural language processing systems	-
dc.identifier.url	https://aclanthology.org/2026.findings-eacl.263/	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Taeuk photo

Kim, Taeuk: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE