Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models

Beigi, Mohammad; Wang, Sijia; Shen, Ying; Lin, Zihao; Kulkarni, Adithya; He, Jianfeng; Chen, Feng; Jin, Ming; Cho, Jin-Hee; Zhou, Dawei; Lu, Chang-Tien; Huang, Lifu

Computer Science > Artificial Intelligence

arXiv:2410.20199 (cs)

[Submitted on 26 Oct 2024]

Title:Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models

Authors:Mohammad Beigi, Sijia Wang, Ying Shen, Zihao Lin, Adithya Kulkarni, Jianfeng He, Feng Chen, Ming Jin, Jin-Hee Cho, Dawei Zhou, Chang-Tien Lu, Lifu Huang

View PDF HTML (experimental)

Abstract:In recent years, Large Language Models (LLMs) have become fundamental to a broad spectrum of artificial intelligence applications. As the use of LLMs expands, precisely estimating the uncertainty in their predictions has become crucial. Current methods often struggle to accurately identify, measure, and address the true uncertainty, with many focusing primarily on estimating model confidence. This discrepancy is largely due to an incomplete understanding of where, when, and how uncertainties are injected into models. This paper introduces a comprehensive framework specifically designed to identify and understand the types and sources of uncertainty, aligned with the unique characteristics of LLMs. Our framework enhances the understanding of the diverse landscape of uncertainties by systematically categorizing and defining each type, establishing a solid foundation for developing targeted methods that can precisely quantify these uncertainties. We also provide a detailed introduction to key related concepts and examine the limitations of current methods in mission-critical and safety-sensitive applications. The paper concludes with a perspective on future directions aimed at enhancing the reliability and practical adoption of these methods in real-world scenarios.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.20199 [cs.AI]
	(or arXiv:2410.20199v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2410.20199

Submission history

From: Sijia Wang [view email]
[v1] Sat, 26 Oct 2024 15:07:15 UTC (9,135 KB)

Computer Science > Artificial Intelligence

Title:Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Rethinking the Uncertainty: A Critical Review and Analysis in the Era of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators