Skip to main content

Logistics

Description and Objectives

Recent progress of Artificial Intelligence has been largely driven by advances in large language models (LLMs) and other generative methods. These models are often very large (e.g. 175 billion parameters for GPT3) and requires increasingly larger data to train (e.g. 300 billion tokens for ChatGPT). Training, serving, fine-tuning, and evaluating LLMs require sophisticated engineering with modern hardware and software stacks. Developing scalable systems for large language models is critical to advance AI.

In this course, students will learn the essential skills to design and implement LLM systems. This includes algorithms and system techniques to efficiently train LLMs with huge data, efficient embedding storage and retrieval, data efficient fine-tuning, communication efficient algorithms, efficient implementation of reinforcement learning with human feedback, acceleration on GPU and other hardware, model compression for deployment, and online maintenance. We will cover the latest advances about LLM systems in machine learning, natural language processing, and system research.

Time and Location

Monday and Wednesday, 12:30-1:50pm, TEP 1403 (in-person expected)

Office Hours

DayTimeLocationInstructor/TA
Monday5pm - 6pmGHC 6403Lei Li
Tuesday5pm - 6pmGHC 5417Rishi Shah
Wednesday3pm - 4pmGHC 5417Kath Choi
Thursday4pm - 5pmGHC 5417Amogh Gyaneshwar
Friday3pm - 4pmTCS 244Hector He

Starting Feb 3, we will change to the following office hours where instructor and/or multiple TAs will be there to answer any questions.

DayTimeLocation
Monday5pm - 6pmGHC 6417
Friday3pm - 4pmGHC 6417

Prerequisites

You are highly recommended to take either Deep Learning (11785) or Advanced NLP (11-611 or 11-711) course previously. Please see FAQ for alternative courses.

Class Format

Each class may contain some combination of the following

  • Lectures
  • Code-walk throuth (if available)
  • Small quiz problems
  • Homework review

Occasionally, we will invite industrial speakers to present latest advancement and engineering practice in building real LLM systems.

Discussion Forum

We will use the Ed platform for discussions, but coming to office hours is also encouraged. Please do not send email to individual TA/Instructor.

Textbook and Course Material

No text book is required. A select set of recent papers on LLM systems and algorithms will be provided. Students are expected to read the assigned material and papers before each lecture.

Homework, Exam and Grading

The course will have four graded components. Please submit your homework on canvas.

Percentage
Homework10% each, 40% in total
Quiz10%
Forum Answer5% (+5%)
Project45%

Required Reading

Students are highly encouraged to read the material or paper before each session.

Computing Resources

Each student will receive $150 credits for computation on AWS cloud. We will also provide access to Pittsburgh Supercomputing clusters (PSC). Students may also use Google co-lab if they are new users.

Policies


Late Day Policy

Each student could use at most 100 late hours for each individual assignments (counted in whole hours!). Each late hour incurs a penalty of 1%. However, NO late hour is allowed for final project report.

We still encourage everybody to complete their work by the designated deadlines. This prevents cascading tardiness from overwhelming both students and teaching staff.

Extensions However, sometimes there are situations that call for extensions. Some examples from the last few years include the following:

  • The death of friend or family member
  • A wedding in the family
  • A serious accident
  • A surgery
  • A significant illness
  • A mental health crisis or episode
  • An important religious or national holiday

We care about you and your well being more than we care about deadlines and if something difficult is happening in your life which is making it hard for you to complete an assignment on time please contact us so we can talk. We have found that, often, the students who most need some leeway are those least likely to ask for it. It never hurts to ask. We will work out a plan so you can complete the requirements of the course with your physical and psychological health intact. Do not feel ashamed to reach out to us. We are eager to see you succeed.

Academic integrity

Any cheating or plagiarism will be dealt with according to the University policies on academic integrity. In general, high-level discussion of tools, concepts, and formalisms is acceptable collaboration and is encouraged. Sharing specific aspects of solutions or results with other students, or consulting work from previous semesters or other universities, is considered cheating. Using Github copilot or any AI is prohibited. You are responsible for any content you submitted.

Disability

Many people have disabilities, including members of our own families. We see disabilities as deficits not in disabled people but in the institutions and societies that are structured such that they are disadvantaged. We wish to do our part to overcome this disparate treatment. If you have a disability (visible or invisible), please let us know as soon as possible (you don’t need to tell us the nature of the disability) and work with Disability Service to develop a set of accommodations which we can then approve. These may include extra time on exams, a quiet place in which to take an exam, alt text on all images, documents that work for people with differences in vision, sign language interpretation, captioning, etc.

Diversity, Equity, and Inclusion

Throughout human history, some people have been denied the rights and opportunities available to others on the basis of their race, gender, economic class, caste, ancestry, language community, age, religion, beliefs, political affiliation, and abilities (visible and invisible). A single course cannot undo the injustices of history, but we—as a teaching staff—are committed to fighting inequity and promoting inclusion. We encourage you to join us. If you feel that you, or those around you, have been treated unfairly based upon their identity (or perceived identity) by us, by other members of the teaching staff, or by other students in the course, we ask that you share your experience with Ethics Reporting Hotline. Students, faculty, and staff can anonymously file a report by calling 844-587-0793 or visiting cmu.ethicspoint.com.