No video

Jonathan Mellon "Using LLMs to code open-text social survey responses at scale"

  Рет қаралды 234

Rohan Alexander

Rohan Alexander

Күн бұрын

Friday 8 March 2024, noon - 1pm
Jonathan Mellon, West Point
"Using LLMs to code open-text social survey responses at scale”
Open Access link to paper (published in Research & Politics):
Jonathan Mellon, Jack Bailey, Ralph Scott, James Breckwoldt, Marta Miori, Phillip Schmedeman
journals.sagep...
Replication code/data: dataverse.harv...
Includes all prompts/API calls/local models etc.
We compare the accuracy of six LLMs using a few-shot approach, three supervised learning algorithms (SVM, DistilRoBERTa, and a neural network trained on BERT embeddings), and a second human coder on the task of categorizing “most important issue” responses from the British Election Study Internet Panel into 50 categories. For the scenario where a researcher lacks existing training data, the accuracy of the highest-performing LLM (Claude-1.3: 93.9%) neared human performance (94.7%) and exceeded the highest-performing supervised approach trained on 1000 randomly sampled cases (neural network: 93.5%). In a scenario where previous data has been labeled but a researcher wants to label novel text, the best LLM’s (Claude-1.3: 80.9%) few-shot performance is only slightly behind the human (88.6%) and exceeds the best supervised model trained on 576,000 cases (DistilRoBERTa: 77.8%). PaLM-2, Llama-2, and the SVM all performed substantially worse than the best LLMs and supervised models across all metrics and scenarios. Our results suggest that LLMs may allow for greater use of open-ended survey questions in the future."
Jonathan Mellon is an Associate Professor at West Point’s Department of Systems Engineering and co-director of the British Election Study. His research focuses on improving measurement and causal inference in social science. He studies electoral behavior, online citizen engagement, and measuring public opinion. He holds a DPhil in Sociology from Nuffield College, University of Oxford.

Пікірлер
GraphRAG: The Most Incredible RAG Strategy Revealed
10:38
Mervin Praison
Рет қаралды 31 М.
What is Retrieval-Augmented Generation (RAG)?
6:36
IBM Technology
Рет қаралды 667 М.
Ouch.. 🤕
00:30
Celine & Michiel
Рет қаралды 48 МЛН
7 Days Stranded In A Cave
17:59
MrBeast
Рет қаралды 82 МЛН
Doing This Instead Of Studying.. 😳
00:12
Jojo Sim
Рет қаралды 35 МЛН
طردت النملة من المنزل😡 ماذا فعل؟🥲
00:25
Cool Tool SHORTS Arabic
Рет қаралды 11 МЛН
Bradley Congelio - Introduction to NFL Analytics with R
33:30
Rohan Alexander
Рет қаралды 193
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 938 М.
A Survey of Techniques for Maximizing LLM Performance
45:32
Abel Brodeur - Mass Reproducibility and Replicability: A New Hope
50:26
Kosuke Imai "Does AI help humans make better decisions?"
1:07:28
Rohan Alexander
Рет қаралды 90
AI, Machine Learning, Deep Learning and Generative AI Explained
10:01
IBM Technology
Рет қаралды 89 М.
Ouch.. 🤕
00:30
Celine & Michiel
Рет қаралды 48 МЛН