Computational Social Science Series – Fall 2025
Introduction to R, R Projects, R Markdown and the Basics of LLMs
October 3 | 12-1:30pm | PUP 438
Advancing Social Science Research: A Workshop Series on AI, LLMs, and Computational Methods
“Introduction to R, R Projects, R Markdown and the Basics of LLMs”
Dr. Eric Stokan, Director, CS3 and Associate Professor, Department of Political Science
Session one
-
Install and set up R and RStudio
-
R fundamentals: objects, vectors, data frames, scripts, packages (tidyverse)
-
R Markdown/Quarto: document structure, code chunks, rendering to HTML/PDF
-
LLMs 101: what they are, common research use cases, and calling them through an R package called ellmer.
Takeaways
-
A working R/RStudio environment
-
A template R Markdown report you can reuse
-
A checklist for responsible LLM use in research

Inductive Analysis of Texts with Embeddings
November 5 | 12-1:30pm | Commons 329
Advancing Social Science Research: A Workshop Series on AI, LLMs, and Computational Methods
“Inductive Analysis of Texts with Embeddings”
Dr. Dustin Stoltz, Assistant Professor of Sociology and Cognitive Science at Lehigh University
Word or text embeddings are a central component in modern language models, including those powering generative AI. Embeddings represent word meanings as positions in space, where words that are closer together are used in similar contexts or evoke similar concepts — even if those words never actually co-occur. We navigate the meaning space created by embeddings directly using basic arithmetic, and in doing so, explore how meaning changes overtime or how meaning differs between different collections of texts.
Dustin Stoltz is an Assistant Professor of Sociology and Cognitive Science at Lehigh University. He studies a variety of topics in cultural and economic sociology and specializes in computational methods. Five copies of his recently published book Mapping Texts: Computational Text Analysis for the Social Sciences (coauthored by Marshall Taylor) was raffled off to workshop registrants.

Dealing with unstructured data (text, images), APIs, and API calls to LLMs
November 7 | 12-1:30pm | PUP 438
Advancing Social Science Research: A Workshop Series on AI, LLMs, and Computational Methods
“Dealing with unstructured data (text, images), APIs, and API calls to LLMs”
Dr. Eric Stokan, Director, CS3 and Associate Professor, Department of Political Science
- Used the R programming language to work with unstructured data (text and images)
- Used OpenAI (makers of ChatGPT) to analyze unstructured data through an API
- Compared LLMs to other Natural Lnaguage Processing techniques (topic models, Named Entity Recognition)
- Discussed improved ways to prompt LLMs

Prompt-engineering and Fine-tuning in R
December 12 | 12-1:30pm | PUP 438
Advancing Social Science Research: A Workshop Series on AI, LLMs, and Computational Methods
“Prompt-engineering and Fine-tuning in R”
Dr. Eric Stokan, Director, CS3 and Associate Professor, Department of Political Science
This session focused on the following topics:
1. Prompt engineering techniques
2. Discussion of fine-tuning techniques
3. Evaluation techniques for estimating the effectiveness of LLMs and LRMs
