Data Contamination of LLMs

Context

A major challenge in evaluating modern LLMs is determining whether a model has previously seen benchmark data during training. This project focuses on detecting and mitigating training data contamination.

Motivation

Students will develop frameworks that:

Goal

Requirements

Contact