/
index.html
130 lines (106 loc) · 5.58 KB
/
index.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
---
layout: clean
title: COMP90042
picture: matrix
---
<div class="page-header">
<h3>COMP90042 Web Search and Text Analysis</h3>
</div>
The aim for this subject is for students to develop an understanding of the main algorithms used in natural language processing and text retrieval, for use in a diverse range of
applications including text classification, information retrieval, machine translation, and question answering. Topics to be
covered include vector space models, part-of-speech tagging, n-gram language
modelling, syntactic parsing and neural sequence models. The programming language used is Python, see <a href="python.html">the
detailed configuration instructions</a> for more information on its use in the workshops, assignments and
installation at home.
<h4>Class hours</h4>
<p><b>Lectures</b>
<dl class="dl-horizontal">
<dt>Tue 3:15-4:15pm</dt>
<dd>Law Building-GM15 (David P. Derham Theatre)</dd>
<dt>Wed 1:00-2:00pm</dt>
<dd>Law Building-GM15 (David P. Derham Theatre)</dd>
</dl>
</p>
<p><b>Office hour</b>
<dl class="dl-horizontal">
<dt>Tue 11am-noon</dt>
<dd>Doug McDonell-9.02</dd>
</dl>
</p>
<p><b>Workshops</b> from week 2 onwards. You will be assigned to one of the following
timeslots
<table border="2" cellpadding="5" cellspacing="5" width="100%" style="background color:white;">
<tr bgcolor="#D1D1D1">
<th>Monday</th>
<th>Tuesday</th>
<th>Wednesday</th>
<th>Thursday</th>
<th>Friday</th>
</tr>
<tr>
<td><i>9-10am</i> Doug McDonell-502</td> <td><i>4:15-5:15pm</i> 221 Bouverie St-B113</td> <td><i>11-12pm</i> Elec. Engineering-121</td> <td><i>2:15-3:15pm</i> Alice Hoy-211</td> <td><i>10-11am</i> 221 Bouverie St-B117</td>
</tr>
<tr>
<td><i>11-12pm</i> Elec. Engineering-121</td> <td><i>5:15-6:15pm</i> 221 Bouverie St-B132</td> <td></td> <td><i>3:15-4:15pm</i> Alice Hoy-211</td> <td><i>3:15-4:15pm</i> Alice Hoy-210</td>
</tr>
<tr>
<td><i>1:15-2:15pm</i> 221 Bouverie St-B113</td> <td><i>6:15-7:15pm</i> Old Engineering-EDS4</td> <td></td> <td><i>5:15-6:15pm</i> 221 Bouverie St-B132</td> <td></td>
</tr>
<tr>
<td><i>5:15-6:15pm</i> 221 Bouverie St-B132</td> <td></td> <td></td> <td></td> <td></td>
</tr>
<tr>
<td><i>5:15-6:15pm</i> 221 Bouverie St-B116</td> <td></td> <td></td> <td></td> <td></td>
</tr>
<tr>
<td><i>6:15-7:15pm</i> Old Engineering-EDS4</td> <td></td> <td></td> <td></td> <td></td>
</tr>
</table>
<p>Please see <a href="workshop.html">the workshop page</a> for each week's worksheet.</p>
<p>The instructor for the subject is A/Prof. Trevor Cohn.
The senior tutor for the subject is Winn Chow and the head tutor for the subject is Ekaterina Vylomova.
The tutors for the subject are Ekaterina Vylomova, Winn Chow, Navnita Nandakumar, Nitika Mathur, Xudong Han, Zenan Zhai, Shivashankar Subramanian and Andrei Shcherbakov. </p>
<b>Questions</b>
<p>If you have questions, please post your questions to the discussion forum in the LMS. Given the size of the subject, it is not practical to respond to individual emails, and besides any question you have is likely to be relevant to other students. Otherwise, please talk to your tutor, or direct your queries to Winn, winn.chow1@unimelb.edu.au, or Ekaterina, ekaterina.vylomova@unimelb.edu.au, the senior/head tutors.</p>
<h3>Textbooks</h3>
There are several textbooks used for the class:
<dl class="dl-horizontal">
<dt>JM3</dt>
<dd>Jurafsky, Daniel S.; Martin, James H.;
<i><a href="https://web.stanford.edu/~jurafsky/slp3/">Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition</a></i>,
Third Edition (incomplete draft).
</dd>
<dt>E18</dt>
<dd>Eisenstein, Jacob; <i><a href="https://github.com/jacobeisenstein/gt-nlp-class/raw/master/notes/eisenstein-nlp-notes-10-15-2018.pdf">Natural Language Processing</a></i>, Draft textbook 15/10/18.
</dd>
<dt>IIR</dt>
<dd>Manning, Christopher D; Raghavan, Prabhakar; Schütze, Hinrich;
<i><a href="http://informationretrieval.org/">Introduction to information retrieval</a></i>,
Cambridge University Press 2008.
</dd>
<dt>K09</dt>
<dd>Koehn, Philipp;
<i><a href="https://www-cambridge-org.ezp.lib.unimelb.edu.au/core/books/statistical-machine-translation/94EADF9F680558E13BE759997553CDE5">Statistical Machine translation</a></i>,
Cambridge University Press 2009.
</dd>
</dl>
Much of the reading will be from JM3, although most NLP topics are also covered well by E18, so feel free to read both.
All of the above are either free, or available from the university library as an ebook using your student login (K09).
The texts will be linked below in the Materials column. We will be also using the NLTK software tools extensively
in this class, so we also recommend:
<dl class="dl-horizontal">
<dt>NLPP</dt>
<dd>Steven Bird, Ewan Klein, Edward Loper;
<i><a href="http://proquestcombo.safaribooksonline.com.ezp.lib.unimelb.edu.au/9780596803346">Natural Language Processing with Python</a></i>,
O'Reilly, 2009.
<a href="http://www.nltk.org/book/">Updated draft version</a>
</dd>
</dl>
</p>
Please see <a href="reading.html">the reading page</a> for each week's reading.
<h3>Assignments</h3>
There will be 3 short homework assignments released roughly every three weeks (starting Week 2), and a final project due near the end of the semester.
<h3>Schedule</h3>
We'll put the lecture slides up here as we cover the material, as well as pointers to the required reading.
<p>
{% include lecture-list.html lectures=site.lectures %}