Large Language Models: An Applied Econometric Framework

December 09, 2024 · Declared Dead · 🏛 Social Science Research Network

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Jens Ludwig, Sendhil Mullainathan, Ashesh Rambachan arXiv ID 2412.07031 Category econ.EM Cross-listed cs.AI Citations 33 Venue Social Science Research Network Last Checked 3 months ago

Abstract

Large language models (LLMs) enable researchers to analyze text at unprecedented scale and minimal cost. Researchers can now revisit old questions and tackle novel ones with rich data. We provide an econometric framework for realizing this potential in two empirical uses. For prediction problems -- forecasting outcomes from text -- valid conclusions require ``no training leakage'' between the LLM's training data and the researcher's sample, which can be enforced through careful model choice and research design. For estimation problems -- automating the measurement of economic concepts for downstream analysis -- valid downstream inference requires combining LLM outputs with a small validation sample to deliver consistent and precise estimates. Absent a validation sample, researchers cannot assess possible errors in LLM outputs, and consequently seemingly innocuous choices (which model, which prompt) can produce dramatically different parameter estimates. When used appropriately, LLMs are powerful tools that can expand the frontier of empirical economics.

📄 View on arXiv 🌐 View on ar5iv 📑 PDF 🎉 Report Code Found

Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

📜 Similar Papers

In the same crypt — econ.EM

R.I.P. 👻 Ghosted

Design-based Analysis in Difference-In-Differences Settings with Staggered Adoption

Susan Athey, Guido Imbens

econ.EM 🏛 J.E 📚 731 cites 7 years ago

R.I.P. 👻 Ghosted

Machine Learning Advances for Time Series Forecasting

Ricardo P. Masini, Marcelo C. Medeiros, Eduardo F. Mendes

econ.EM 🏛 Journal of economic surveys (Print) 📚 408 cites 5 years ago

R.I.P. 👻 Ghosted

Deep Neural Networks for Estimation and Inference

Max H. Farrell, Tengyuan Liang, Sanjog Misra

econ.EM 🏛 Econometrica 📚 261 cites 7 years ago

R.I.P. 👻 Ghosted

Take a Look Around: Using Street View and Satellite Images to Estimate House Prices

Stephen Law, Brooks Paige, Chris Russell

econ.EM 🏛 ACM TIST 📚 150 cites 7 years ago

R.I.P. 👻 Ghosted

Discrete Choice and Rational Inattention: a General Equivalence Result

Mogens Fosgerau, Emerson Melo, ... (+2 more)

econ.EM 🏛 International Economic Review 📚 97 cites 8 years ago

R.I.P. 👻 Ghosted

Estimating Heterogeneous Consumer Preferences for Restaurants and Travel Time Using Mobile Location Data

Susan Athey, David Blei, ... (+3 more)

econ.EM 🏛 arXiv 📚 69 cites 8 years ago

Died the same way — 👻 Ghosted

R.I.P. 👻 Ghosted

Federated Learning: Strategies for Improving Communication Efficiency

Jakub Konečný, H. Brendan McMahan, ... (+4 more)

cs.LG 🏛 arXiv 📚 5.2K cites 9 years ago

R.I.P. 👻 Ghosted

In-Datacenter Performance Analysis of a Tensor Processing Unit

Norman P. Jouppi, Cliff Young, ... (+73 more)

cs.AR 🏛 ISCA 📚 5.1K cites 9 years ago

R.I.P. 👻 Ghosted

Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning

Hoo-Chang Shin, Holger R. Roth, ... (+7 more)

cs.CV 🏛 IEEE TMI 📚 4.9K cites 10 years ago

R.I.P. 👻 Ghosted

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

cs.AI 🏛 AI 📚 4.9K cites 8 years ago