Behavior-Aware Data Valuation for LLMs at Scale
EB2 3001 890 Oval Drive, Raleigh, NC, United StatesTitle: Behavior-Aware Data Valuation for LLMs at Scale Abstract: Large Language Models (LLMs) depend on massive datasets whose quality and influence remain largely opaque. Data valuation offers principled methods to…