Query-Focused Submodular Demonstration Selection for In-Context Learning in Large Language Models

Research output: Chapter in Book/Report/Conference proceedingsConference proceedingpeer-review

Abstract

The increase in dataset and parameter size of large language models has given rise to an emergent ability known as In-context Learning (ICL). This approach allows models to perform tasks based on human instructions and a few demonstration examples in a prompt. ICL differs from traditional fine-tuning methods by enabling the adaptation of pretrained models to new tasks without modifying their core parameters or requiring gradient updates. Despite its potential, the intri-cacies of ICL, particularly the methods for choosing effective demonstration examples to enhance predictive performance, are not fully understood, with prior research often relying on random selection. Our research addresses this gap in two ways. Firstly, we advocate the use of query-focused submodular mutual information functions for selecting demonstration examples in ICL. These functions help identify examples that are both diverse and representative, thereby improving few-shot performance in comparison to random and zero-shot baselines. Our experiments validate this approach. Secondly, we introduce an interactive tool to explore the impact of hyperparameters on model performance. These parameters include the quantity and generation methods of demonstration examples, and their influence on data manifolds and clusters. Our results show that carefully chosen examples can lead to performance improvements of up to 20%. For instance, in sentiment classification, we observed an f1-score of 88.35% compared to 51.95%, and in topic classification, 90.56% versus 31.38%.

Original languageEnglish
Title of host publication2023 31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350360219
DOIs
Publication statusPublished - 2023
Event31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023 - Letterkenny, Ireland
Duration: 7 Dec 20238 Dec 2023

Publication series

Name2023 31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023

Conference

Conference31st Irish Conference on Artificial Intelligence and Cognitive Science, AICS 2023
Country/TerritoryIreland
CityLetterkenny
Period7/12/238/12/23

Keywords

  • Data Selection
  • In-context Learning
  • Language Models
  • Submodular Optimization
  • Visualization

Fingerprint

Dive into the research topics of 'Query-Focused Submodular Demonstration Selection for In-Context Learning in Large Language Models'. Together they form a unique fingerprint.

Cite this