login
Home / Papers / Touché-25-Advertisement-in-Retrieval-Augmented-Generation

Touché-25-Advertisement-in-Retrieval-Augmented-Generation

1283 Citations2024
Payal Bajaj, Daniel Campos, Nick Craswell

The size of the dataset and the fact that the questions are derived from real user search queries distinguishes MS MARCO from other well-known publicly available datasets for machine reading comprehension and question-answering.

Abstract

Data for Sub-Task 1 of the Advertisement in Retrieval-Augmented Generation task at Touché 2025. The dataset contains segments retrieved from the segmented version of MS MARCO V2.1. The queries used in retrieval are taken from the Webis Generated Native Ads 2024 dataset.