Large-Scale Data Collection Using ChatGPT API and R

Collecting novel and large-scale data has become essential to social science research. This workshop instructed attendees on how to use R and Chatgpt API to automate the large-scale data collection and cleaning process, dramatically reducing the time and costs put into collecting and cleaning data. The workshop discussed how Chatgpt can extract or summarize information from unstructured PDF files and which Chatgpt models (e.g., Chatgpt3.5, Chatgpt4, Chatgpt4-turbo) are suitable for specific data collection processes.