Skip to content

Month: September 2024

The Cross-Sectionality Problem in Machine Learning Benchmarking Datasets

(Written with Claude-3.5-Sonnet. I have checked and edited the content. -Joni) Summary of the Issue The holdout paradigm, commonly implemented as a train-test split, is a fundamental technique in machine learning for assessing model performance. However, when applied to cross-sectional data, it can lead to significant challenges in terms of model generalizability. Cross-sectional data, by its nature, provides a snapshot…

The Agency Theory of Chat-Based AI Personas: Insights from Economics

Written in collaboration with Claude-3.5.-Sonnet. All information checked by human author. Background Because of the utility and value of large language models (LLMs) in tasks supporting persona creation and users’ interaction with personas, chat-based AI personas are becoming increasingly prevalent. However, these personas present unique challenges that can be understood through the lens of agency theory (also known as the…