SimpleStrat: Diversifying Language Model Generation with Stratification

Wong, Justin; Orlovskiy, Yury; Luo, Michael; Seshia, Sanjit A.; Gonzalez, Joseph E.

Computer Science > Computation and Language

arXiv:2410.09038 (cs)

[Submitted on 11 Oct 2024]

Title:SimpleStrat: Diversifying Language Model Generation with Stratification

Authors:Justin Wong, Yury Orlovskiy, Michael Luo, Sanjit A. Seshia, Joseph E. Gonzalez

View PDF HTML (experimental)

Abstract:Generating diverse responses from large language models (LLMs) is crucial for applications such as planning/search and synthetic data generation, where diversity provides distinct answers across generations. Prior approaches rely on increasing temperature to increase diversity. However, contrary to popular belief, we show not only does this approach produce lower quality individual generations as temperature increases, but it depends on model's next-token probabilities being similar to the true distribution of answers. We propose \method{}, an alternative approach that uses the language model itself to partition the space into strata. At inference, a random stratum is selected and a sample drawn from within the strata. To measure diversity, we introduce CoverageQA, a dataset of underspecified questions with multiple equally plausible answers, and assess diversity by measuring KL Divergence between the output distribution and uniform distribution over valid ground truth answers. As computing probability per response/solution for proprietary models is infeasible, we measure recall on ground truth solutions. Our evaluation show using SimpleStrat achieves higher recall by 0.05 compared to GPT-4o and 0.36 average reduction in KL Divergence compared to Llama 3.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.09038 [cs.CL]
	(or arXiv:2410.09038v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.09038

Submission history

From: Justin Wong [view email]
[v1] Fri, 11 Oct 2024 17:54:14 UTC (8,105 KB)

Computer Science > Computation and Language

Title:SimpleStrat: Diversifying Language Model Generation with Stratification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SimpleStrat: Diversifying Language Model Generation with Stratification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators