CS 295: Statistical NLP Winter 2018

Paper Summaries

Everyone has to submit three paper summaries from the list of papers.

Details coming soon

Programming Homeworks

HW 1: Semi-supervised Text Classification

Many real-world applications contain a small number of labeled instances but a large number of unlabeled instances. Machine learning algorithms that are able to utilize the information from unlabeled instances are known as semi-supervised approaches. The first programming assignment will require you to implement such an algorithm that benefits from large amounts of unlabeled text.

Due date
January 23, 2018
Description
PDF
Data
Kaggle (signup link on Canvas)
Source code
Github