11-697: Introduction to Question Answering with Large Language Models
Fall 2025: MW 12:30-1:50P, DH 1211
Instructors: Eric Nyberg and Teruko Mitamura
Teaching Assistant: Kimihiro Hasegawa
Course Description: This course is designed to be
accessible to Masters and advanced undergraduate students who seek
the basic skills necessary to implement practical Question
Answering (QA) applications using Large Language Models (LLMs) in
specific information domains. The syllabus includes learning
materials on the core concepts of QA and LLMs, and how they are
applied in closed commercial systems (e.g. ChatGPT) as well as
open systems (e.g. Llama, T5). Students complete a set of hands-on
exercises in Python that develop skills in applying LLMs for
various open-source QA datasets. The course is also a
prerequisite for 11-797 Question Answering (an advanced
project-oriented course).
Prerequisite Knowledge: A course in Statistics and Probability and at least
intermediate Python programming skills
Course Goals: Students acquire basic knowledge of QA approaches and tasks,
including Data Analysis, Solution Design, Metrics, Evaluation and Error Analysis.
Grading:
- Quizzes = 24% (8 x 3%)
- Homeworks = 66% (5 x 10% + 1 x 16%)
- Attendance/Class Participation = 10%
Outline of Learning Materials:
- Foundations (Course Prerequisites, Definitions, Concepts, etc.)
- A First Example: LLMs for QA (e.g. ChatGPT)
- What are LLMs? How can LLMs be incorporated into QA systems?
- What happens when ChatGPT is evaluated as a QA system?
- Survey of tasks, domains, methods
- Tasks: factoid, list factoid, summary, yes/no, etc.
- Domains: media collections, Q/A datasets, languages & distributions
- Classic Methods and Pipelines: retrieval-based, NLP-based; multi-strategy architecture
- Task Curation & Evaluation
- Curation (Dataset design, sourcing, preliminary analysis, bias)
- Evaluation (Metrics & significance, overlap / error analysis & prioritization)
- Sparse and Dense Retrieval for QA
- Extractive vs. Generative QA
- Retrieval Augmented Generation and Prompt Engineering
- Supervised Fine-Tuning and Parameter-Efficient Fine-Tuning
- Putting it All Together: Practical Engineering of QA Systems
- Advanced Topics
- Multi-Hop QA
- Conversational QA
- Multimodal QA
- Generative LLMs
- Wrap-Up
- Discuss learning objectives / outcomes, material covered, feedback
- b. Discuss open challenges, possible project topics for 11-797
Last Updated: June 26, 2025