Name	Name	Last commit message	Last commit date
parent directory ..
README.md	README.md

DATASCI 266: Natural Language Processing With Deep Learning

Course Overview
Course Evaluation
Final Project
Course Resources
Tentative Schedule and Readings

Course Overview

Course Description

Understanding language is fundamental to human interaction. Our brains have evolved language-specific circuitry that helps us learn it very quickly; however, this also means that we have great difficulty explaining how exactly meaning arises from sounds and symbols. This course provides an introduction to natural language processing, linguistic phenomena, and our attempts to analyze them using modern deep learning approaches. We cover a wide range of concepts with a focus on practical applications such as information extraction, machine translation, sentiment analysis, and summarization. The focus of this course prioritizes practical application over theory. Each week, the course is centered on Python notebooks that include exercises and code examples. The asynchronous lectures provide a foundation to build upon in the live sessions. We expect students to review the async material before attending their live session.

Course Prerequisites

MIDS 207 (Machine Learning): We assume you know what gradient descent is. We review simple linear classifiers and softmax at a high level, but make sure you've at least heard of these! You should also be comfortable with linear algebra, which we use for vector representations and when we discuss deep learning.
Language: All assignments are in Python using Jupyter notebooks, Google Colab, NumPy, TensorFlow, and Keras.
Time: There are three to four substantial assignments in this course as well as a term project. Make sure you give yourself enough time to be successful! In particular, you may be in for a rough semester if you have other significant commitments at work or home, or take both this course and any of 210 (Capstone), 261, or 271.

Course Goals and Objectives

By the completion of this course, students will be able to:

Understand and describe multiple facets of linguistic phenomena related to natural language processing.
Describe fundamental concepts, techniques, problems, and modern approaches in the domain of natural language processing (NLP).
Understand the assumptions, strengths, and limitations of NLP and related deep learning techniques, and make appropriate decisions about application of techniques and solutions to NLP problems.
Analyze textual data using a number of machine learning and deep learning-based NLP techniques.
Demonstrate familiarity with and comprehension of existing NLP techniques for solving practical problems.
Demonstrate an ability to stay current with a constantly evolving field: by developing a familiarity with the neural network architectures that underpin current state of the art, and an ability to seek out and understand new advances published in the field.
Demonstrate expertise in using existing libraries and tools related to NLP work, along with an ability to familiarize oneself to new libraries and tools that are key to NLP practitioners as the domain evolves.

Communication and Resources

Course website: GitHub datasci-w266/2023-spring-main
Ed Discussion: We'll use this for collective Q&A, and this will be the fastest way to reach the course staff. Note that you can post anonymously, or make posts visible only to instructors for private questions.
Email list for course staff (expect a somewhat slower response here): mids-nlp-instructors@googlegroups.com

Live Sessions

Section 1: Monday 6:30 - 8pm PST (Jennifer Zhu)
Section 2: Tuesday 2:00 - 3:30pm PST (Peter Grabowski)
Section 3: Tuesday 4 - 5:30pm PST (Daniel Cer)
Section 4: Wednesday 4 - 5:30pm PST (Natalie Ahn)
Section 5: Wednesday 6:30 - 8pm PST (Mike Tamir/Paul Spiegelhalter)
Section 6: Thursday 6:30 - 8pm PST (Mark Butler)

Teaching Staff Office Hours

Daniel Cer: Monday at noon PST
Rajiv Nair: Monday at 4pm PST
Peter Grabowski: Tuesday at 1pm PST
Gurdit Chahal: Wednesday at 2:30pm PST
Natalie Ahn: Wednesday at 6pm PST
Mike Tamir/Paul Spiegelhalter: Wednesday immediately after the live session
Jennifer Zhu: Thursday at 5:30pm PST
Mark Butler: Friday at 4pm PST

Office hours are for the whole class; students from any section are welcome to attend any of the times above.

Async Instructors

James Kunz
Joachim Rahmfeld
Mark Butler

Course Evaluation

Breakdown

Assignment	Topic	Release	Deadline	Weight
Assignment 0	Course Set Up GitHub Ed Discussion Google Cloud	Jan 9	Jan 15	0%
Assignment 1	Basic Neural Nets	Jan 13	Jan 22	5%
Assignment 2	Text Classification	Jan 27	Feb 12	15%
Assignment 3	Question Answering	Feb 17	Mar 5	15%
Assignment 4	Multimodal NLP	Mar 10	Mar 26	10%
Final Project	Final Project Guidelines		Apr 16	55%

Your assignment grade report can be found at https://w266grades.appspot.com. We will make an announcement on Ed Discussion when grades are available.

General Grading Philosophy

A word of warning: Given that we (effectively) release solutions to some parts of assignments in the form of unit tests, it shouldn't be surprising that most students earn high scores. Since the variance is so low, assignment scores aren't the primary driver of the final letter grade for most students. A good assignment score is necessary but not sufficient for a strong grade in the class. A well-structured, novel project with good analysis is what makes the difference between a high B/B+ and an A-/A.

As mentioned above, this course is a lot of work. Give it the time it deserves and you'll be rewarded intellectually and on your transcript.

Late Submission Policy

We recognize that sometimes things happen in life outside the course, especially in MIDS where we all have full-time jobs and family responsibilities to attend to. To help with these situations, we are giving you 5 "late days" to use throughout the term as you see fit. Each late day gives you a 24-hour (or any part thereof) extension to any deliverable in the course except the final project presentation or report. (UC Berkeley needs grades submitted very shortly after the end of classes.)

Once you run out of late days, each 24-hour period (or any part thereof) results in a 10 percentage-point deduction on that deliverable's grade.

You can use a maximum of 2 late days on any single deliverable. We will not be accepting any submissions more than 48 hours past the original due date, even if you have late days. (We want to be more flexible here, but your fellow students also want their graded assignments back promptly!)

We don't anticipate granting extensions beyond these policies. Plan your time accordingly!

More Serious Issues

If you run into a more serious issue that will affect your ability to complete the course, please email the instructors mailing list and cc MIDS Student Services. A word of warning: In previous sections, we have had students ask for an Incomplete (INC) grade because their lives were otherwise busy. Mostly we have declined, opting instead for the student to complete the course to the best of their ability and have a grade assigned based on that work. (MIDS prefers to avoid giving INCs, as they have been abused in the past.) The sooner you start this process, the more options we (and the department) have to help. Don't wait until you're suffering from the consequences to tell us what's going on!

Collaboration Policy/Academic Integrity

All students —undergraduate, graduate, professional full time, part time, law, etc.— must be familiar with and abide by the provisions of the "Student Code of Conduct" including those provisions relating to Academic Misconduct. All forms of academic misconduct, including cheating, fabrication, plagiarism or facilitating academic dishonesty will not be tolerated. The full text of the UC Berkeley Honor Code is available at: https://teaching.berkeley.edu/berkeley-honor-code and the Student Code of Conduct is available at: https://sa.berkeley.edu/student-code-of-conduct#102.01_Academic_Misconduct

We encourage studying in groups of two to four people. This applies to working on homework, discussing labs and projects, and studying. However, students must always adhere to the UC Berkeley Code of Conduct (http://sa.berkeley.edu/code-of-conduct ) and the UC Berkeley Honor Code (https://teaching.berkeley.edu/berkeley-honor-code ). In particular, all materials that are turned in for credit or evaluation must be written solely by the submitting student or group. Similarly, you may consult books, publications, or online resources to help you study. In the end, you must always credit and acknowledge all consulted sources in your submission (including other persons, books, resources, etc.)

Final Project

See the Final Project Guidelines

Attendance and Participation

We believe in the importance of the social aspects of learning —between students, and between students and instructors— and we recognize that knowledge-building does not solely occur on an individual level, but is built by social activity involving people and by members engaged in the activity. Participation and communication are key aspects of this course vital to the learning experiences of you and your classmates.

Therefore, we like to remind all students of the following requirements for live class sessions:

Students are required to join live class sessions from a study environment with video turned on and with a headset for clear audio, without background movement or background noise, and with an internet connection suitable for video streaming.
You are expected to engage in class discussions, breakout room discussions and exercises, and to be present and attentive for your and other teams’ in-class presentations.
Keep your microphone on mute when not talking to avoid background noise. Do your best to minimize distractions in the background video, and ensure that your camera is on while you are engaged in discussions.

That said, in exceptional circumstances, if you are unable to meet in a space with no background movement, or if your connection is poor, make arrangements with your instructor (beforehand if possible) to explain your situation. Sometimes connections and circumstances make turning off video the best option. If this is a recurring issue in your study environment, you are responsible for finding a different environment that will allow you to fully participate in classes, without distraction to your classmates.

Failure to adhere to these requirements will result in an initial warning from your instructor(s), followed by a possible reduction in grades or a failing grade in the course.

Diversity and Inclusion

Integrating a diverse set of experiences is important for a more comprehensive understanding of data science. We make an effort to read papers and hear from a diverse group of practitioners. Still, limits exist on this diversity in the field of data science. We acknowledge that it is possible that there may be both overt and covert biases in the material due to the lens through which it was created. We would like to nurture a learning environment that supports a diversity of thoughts, perspectives and experiences, and honors your identities (including race, gender, class, sexuality, religion, ability, veteran status, etc.) in the spirit of the UC Berkeley Principles of Community https://diversity.berkeley.edu/principles-community

To help us accomplish this, please contact us or submit anonymous feedback through I School channels if you have any suggestions to improve the quality of the course. If something was said in class (by anyone) or you experience anything that makes you feel uncomfortable, please talk to your instructors about it. If you feel like your performance in the class is being impacted by experiences outside of class, please don’t hesitate to come and talk with us. We want to be a resource for you. Also, anonymous feedback is always an option and may lead us to make a general announcement to the class, if necessary, to address your concerns. As a participant in teamwork and course discussions, you should also strive to honor the diversity of your classmates.

If you prefer to speak with someone outside of the course, the MIDS Academic Director Drew Paulin, the I School Assistant Dean of Academic Programs Catherine Cronquist Browning, and the UC Berkeley Office for Graduate Diversity are excellent resources. Also see the following: https://www.ischool.berkeley.edu/about/community.

Disability Services and Accommodations

If you need disability-related accommodations in this class, if you have emergency medical information you wish to share with me, or if you need special arrangements in case the building must be evacuated, please inform me as soon as possible.

The I School recognizes disability in the context of diversity, and the Disabled Students’ Program (DSP) equips students with appropriate accommodations and services to remove barriers to educational access. Students seeking accommodations in this class are responsible for completing the DSP application process to obtain an accommodation letter. You may reach the DSP at (510) 642-0518, or visit the website: https://dsp.berkeley.edu

Publishing Your Work

You are highly encouraged to use your program coursework to build an academic/professional portfolio.

Blog about your coursework (and other ideas) and share on the I School Medium channel
- Instructions are here on the intranet for students: https://www.ischool.berkeley.edu/intranet/connect
- Instructions here are public for alumni: https://www.ischool.berkeley.edu/alumni/stay-connected
Publish projects to your I School project portfolio gallery (for more than just the capstone).
Publish your work on LinkedIn and tag @UC Berkeley School of Information. Do NOT publish your homework assignments!
Publish in academic journals: Contact your professors for assistance. (Note that multiple review iterations are usually required; this can be a time-intensive endeavor.)
- For help writing professional academic papers students are encouraged to contact Sabrina Soracco, the Director of the Graduate Writing Center, in the Graduate Division -- see https://grad.berkeley.edu/staff/sabrina-soracco/ and https://grad.berkeley.edu/professional-development/graduate-writing-center/ -- the latter has links to resource guides, appointments with consultants, workshops, etc.
Publish your news (e.g., conference talks, awards, scholarships) to the I School internal newsletter.

Course Resources

We are not using any particular textbook for this course. We’ll list some relevant readings each week. Here are some general resources:

Speech and Language Processing (3rd edition draft) by Daniel Jurafsky and James H. Martin - free online!
Speech and Language Processing (2nd edition) by Daniel Jurafsky and James H. Martin - This edition does not cover neural net approaches
NLTK Book - Natural Language Processing with Python - accompanies NLTK (Natural Language Tool Kit) and includes useful, practical descriptions (with python code) of basic concepts.
Deep Learning (Goodfellow, Bengio, and Courville)

We’ll be posting materials to the course GitHub repo.

Note: This syllabus may be subject to change. We'll be sure to announce anything major on Ed Discussion.

Code References

The course will be taught in Python, and we'll be making heavy use of NumPy, TensorFlow, Keras, and Jupyter (IPython) notebooks. We'll also be using Git for distributing and submitting materials. If you want to brush up on any of these, we recommend:

Git tutorials: Introduction / Cheat Sheet, or interactive tutorial
Python / NumPy: Stanford's CS231n has an excellent tutorial.
TensorFlow: We'll go over some basics of TensorFlow and Keras in Assignment 1.
Effective TensorFlow 2 is a great reference, ranging from the absolute basics through advanced topics like multi-GPU training, tf.learn, and debugging.
You can also check out the tutorials on the TensorFlow website, but these can be somewhat confusing if you're not familiar with the underlying models. Also, look at the TensorFlow Keras Guide as we will be using Keras in this class.
Keras: The Keras Site has a variety of useful guides as well as code examples.

Miscellaneous Deep Learning and NLP References

Here are a few useful resources and papers that don’t fit under a particular week -- all optional, but interesting!

(Optional) Chris Olah’s blog and Distill
(Optional) GloVe: Global Vectors for Word Representation (Pennington, Socher, and Manning, 2014)
(Optional) Jay Alammar’s blog

Tentative Schedule and Readings

We'll update the table below with assignments as they become available, as well as additional materials throughout the semester. Keep an eye on GitHub for updates!

Dates are tentative: Assignments in particular may change topics and dates. (Updated slides for each week will be posted during the live session week.)

Live Session Slides [available with @berkeley.edu address]

Deliverables

Note: we will update this table as we release assignments. Each assignment will be released around the last live session of the week and due approximately 1 week later (for simple assignments) or 2 to 3 weeks later (for complex assignments).

	Topic	Release	Deadline
Assignment 0	Course Set-up GitHub Ed Discussion Google Cloud	Jan 9	Jan 15
Assignment 1	Assignment 1 Neural Networks	Jan 13	Jan 22
Assignment 2	Assignment 2 Text Classification	Jan 27	Feb 12
Assignment 3	Assignment 3 Multiclass Classification Summarization Question Answering	Feb 17	Mar 5
Project Proposal	Final Project Guidelines		Feb 18
Assignment 4	Assignment 4 Image Captioning	Mar 10	Mar 26
Project Reports			Extended to Apr 16 (hard deadline)
Project Presentations			In-class Apr 17-21

Course Schedule

	Async Material to Watch	Topics	Materials
Week 1 (Jan 9)	Introduction	Overview of NLP applications NLP tasks, model structures and neural architectures Ambiguity and grounding in language Introduction to word embeddings	Read: AI’s Language Problem (Technology Review) Read: The Rise and Fall of the English Sentence Skim: NLTK book chapter 1 (python and basics) Skim: NLTK book chapter 2 (data resources) Optional: The Interpreter (New Yorker) Optional: Introduction to Linguistic Typology Optional: The Illustrated Word2Vec Optional / fun: Tensorflow Playground
Week 2 (Jan 16)	Text Classification	Text classification approaches with neural networks Text classification with CNNs	Read: Understanding Convolutional Neural Networks for NLP (via Wayback Machine link) Read: Convolutional Neural Networks for Sentence Classification (Kim, 2014) Skim:The Illustrated Word2vec Optional / fun: Tensorflow Playground Optional: Natural Language Processing (almost) from Scratch (Collobert et al., 2011)
Week 3 (Jan 23)	Language and Context	Recurrent Neural Nets (RNNs) and language modeling Attention Context awareness and embeddings	Read: Deep contextualized word representations (Peters et. al., 2018) Skim:The Unreasonable Effectiveness of Recurrent Neural Networks Optional:Attention and Augmented Recurrent Neural Networks (section on “Attentional Interfaces” has an awesome visualization of an MT example, showing alignments) Optional: Natural Language Corpus Data (Peter Norvig)
Week 4 (Jan 30)	Pretrained Transformers	Transformer Architecture Pretrained Transformers and Language Models Pretrained Transformers and Context-Based Embeddings	Read: Attention is All You Need (Vaswani et. al., 2017) Skim: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding”, (Devlin et al, 2018) Skim: The Illustrated BERT, ELMo, and co. (How NLP Cracked Transfer Learning)”, (Alammar, 2018)
Week 5 (Feb 6)	Text Generation Models	Sequence to Sequence Architectures Pretrained Encoder-Decoder Transformer Architectures	Read: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer (Raffel et. al. 2019) Skim: Language Models are Few Shot Learners (Brown et. al. 2020) Skim: Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing (Liu et. al. 2021) Skim: Training language models to follow instructions with human feedback (Ouyang et. al. 2022)
Interlude (Extra Material)	Units of Meaning: Words, Morphology, Sentences	Edit distance for strings Tokenization Sentence splitting	Read: Sentence Boundary Detection and the Problem with the U.S. Skim: NLTK book chapter 3 (processing raw text) Skim: Natural Language Corpus Data (Peter Norvig) (if you didn't read in Week 3)
Week 6 (Feb 13)	Machine Translation	Challenges of Language Encoder Decoder architecutres for Neural Machine Translation Evaluation Subword Models	Read: Google’s Neural Machine Translation System Skim: Sequence to Sequence Learning with Neural Networks Skim: Neural Machine Translation by Jointly Learning to Align and Translate Skim: Beyond English-Centric Multilingual Machine Translation
Week 7 (Feb 20)	Question Answering and Summarization	Question Answering Architectures Extractive and Abstractive Summarization Evaluation	Read: Get To The Point: Summarization with Pointer-Generator Networks (See et al. 2017) Skim: Know What You Don’t Know: Unanswerable Questions for SQuAD (Rajpurkar et al. 2018) Skim: DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs (Dua et al. 2019) Skim: Hurdles to Progress in Long-form Question Answering (Krishna et al. 2021) Skim: A Survey on Automatic Text Summarization (Das and Martins, 2007)
Week 8 (Feb 27)	Linguistic Representation	Elements of Language Grammars Dependency Parsing Phrase based Parsing Evaluation	Read: SyntaxNet (Parsey McParseface) Read: A Fast and Accurate Dependency Parser using Neural Networks (Chen & Manning 2014) Read: A Minimal Span-Based Neural Constituency Parser (Stern et al. 2017) Skim: NLTK book chapter 8 (analyzing sentence structure) Optional: NLTK book chapter 5: Categorizing and Tagging Words Optional: A Universal Part-of-Speech Tagset Optional: Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics? Optional: Accurate Unlexicalized Parsing (Klein & Manning 2003) Optional / reference: Penn Treebank Constituent Tags Play: Stanford parser (online demo)
Week 9 (Mar 6)	Entities and Linking	Named Entity Recognition Coreference Resolution Entity Linking Relation Extraction	Read: NLTK Book Chapter 7 (Extracting Information from Text) Read: SpanBERT: Improving Pre-training by Representing and Predicting Spans (Joshi et al. 2019) Optional: Simple Coreference Resolution with Rich Syntactic and Semantic Features (Haghighi and Klein 2009, rule-based coreference) Optional: Improving Coreference Resolution by Learning Entity-Level Distributed Representations (Clark and Manning 2016, neural coreference)
Week 10 (Mar 13)	Embedding-based Retrieval	Single- vs. multi-document retrieval Two tower models	Read: Web Search for a Planet (Google) Read: The Anatomy of a Large-Scale Hypertextual Web Search Engine Skim: Multilingual Universal Sentence Encoder for Semantic Retrieval Skim: "An Introduction to Information Retrieval", sections 6.2 and 6.3 Optional: PageRank (Page, et al. 1999)
Week 11 (Mar 20)	Multimodality in NLP	Multimodal Applications Image Captioning Visual Question Answering	Read: Show and Tell: A Neural Image Caption Generator (Vinyals et. al. 2014) Read: Learning Transferable Visual Models From Natural Language Supervision (Radford et. al 2021)
Break (Mar 27)	No Async	No class	No Readings
Week 12 (Apr 3)	ML Fairness and Privacy	Fairness Privacy	Read: How to make a racist AI without really trying Read: Data Decisions and Theoretical Implications when Adversarially Learning Fair Representations (Beutel et. al. 2017) Read: Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings (Bolukbasi et. al. 2016) Read: Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them (Gonen & Goldberg 2019)
Week 13 (Apr 10)	NLP in the Real World	Productization at Scale Infrastructure Metrics Failure Modes NLP Review	TBD
Week 14 (Apr 17)		In-class project presentations

Thanks for a great semester!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

syllabus

syllabus

README.md

DATASCI 266: Natural Language Processing With Deep Learning

Course Overview

Course Evaluation

Breakdown

General Grading Philosophy

Late Submission Policy

More Serious Issues

Collaboration Policy/Academic Integrity

Final Project

Attendance and Participation

Diversity and Inclusion

Disability Services and Accommodations

Publishing Your Work

Course Resources

Code References

Miscellaneous Deep Learning and NLP References

Tentative Schedule and Readings

Live Session Slides [available with @berkeley.edu address]

Deliverables

Course Schedule

Files

syllabus

Directory actions

More options

Directory actions

More options

Latest commit

History

syllabus

Folders and files

parent directory

README.md

DATASCI 266: Natural Language Processing With Deep Learning

Course Overview

Course Evaluation

Breakdown

General Grading Philosophy

Late Submission Policy

More Serious Issues

Collaboration Policy/Academic Integrity

Final Project

Attendance and Participation

Diversity and Inclusion

Disability Services and Accommodations

Publishing Your Work

Course Resources

Code References

Miscellaneous Deep Learning and NLP References

Tentative Schedule and Readings

Live Session Slides [available with @berkeley.edu address]

Deliverables

Course Schedule