machine learning (cse 446): introduction learning (cse 446): introduction noah smith c 2017...

35
Machine Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington [email protected] September 27, 2017 1 / 35

Upload: lynguyet

Post on 28-May-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Machine Learning (CSE 446):Introduction

Noah Smithc© 2017

University of [email protected]

September 27, 2017

1 / 35

Page 2: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

What VIPs are Saying about Machine Learning

“A breakthrough in machine learning would be worth ten Microsofts”—Bill Gates

“Machine learning is the next Internet”—Tony Tether (DARPA director)

“Machine learning is the hot new thing”—John Hennessy (Stanford president)

“Web rankings today are mostly a matter of machine learning”—Prabhakar Raghavan (Google VP)

“Machine learning is going to result in a real revolution”—Greg Papadopoulos (Sun CTO)

“Machine learning is today’s discontinuity”—Jerry Yang (Yahoo founder)

2 / 35

Page 3: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

What is Learning?

I Predicting the future, given the past

I Generalizing to new scenarios

I Getting better with practice

3 / 35

Page 4: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

What is Learning?

I Predicting the future, given the past

I Generalizing to new scenarios

I Getting better with practice

To measure how well an algorithm has learned, we give it a test (sound familiar?).

4 / 35

Page 5: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

5 / 35

Page 6: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

6 / 35

Page 7: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

7 / 35

Page 8: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

8 / 35

Page 9: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

9 / 35

Page 10: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

10 / 35

Page 11: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

11 / 35

Page 12: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

12 / 35

Page 13: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Categorizing documents (e.g., “is this email spam?”)

I Labeling images (e.g., “who’s in this picture?”)

I Predicting the future: weather, finance, medical outcomes

I Collect sensor data, predict values everywhere (e.g., energy use in a building)

I Recommending products (e.g., movies and books)

I Decision-making in the face of uncertainty (e.g., self-driving cars)

I Given an instance, find similar ones (e.g., images)

I Find structure or patterns in large datasets (e.g., clustering)

13 / 35

Page 14: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Today

ML is required for . . .

I Video and image processing

I Speech and language processing

I Search engines

I Robot control

I Sensor networks

I Computational biology

I Medical and health analysis

When people say “AI” they almost always mean “ML.”

Trends: more data, faster processing and networks, new sensors and IO devices,demand for customization.

Software is becoming too complex to write by hand.

14 / 35

Page 15: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Is it Magic?

15 / 35

Page 16: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Is it Magic?

More like gardening.

Growing successful plants (programs) requires:

I seeds (algorithms)

I nutrients (data)

I a gardener (ML expert)

16 / 35

Page 17: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Is it Magic?

More like gardening.

Growing successful plants (programs) requires:

I seeds (algorithms)

I nutrients (data)

I a gardener (ML expert)

Gardens are somewhat predictable, but not entirely, and our scientific understanding isstill improving!

17 / 35

Page 18: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Inductive, Supervised Machine Learning

I Training: a learningalgorithm is given aset of exampleinput-output pairs(x, y) and producesa function f ; thegoal is for f(x) torecover y, for eachexample, and onfuture examples

I Testing: we apply fto new test examples(x, y) and measurehow well f(x)matches y

(x, y)(x, y)(x, y)(xi, yi)learning

algorithm f

x

f(x)

training data

y

18 / 35

Page 19: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Inputs and Output

I x can be pretty much anything we can representI To start, we’ll think of x as a bundle of attribute-value pairs, e.g., φ(x) = v.

I y can beI a real value (regression)I a label (classification)I an ordering (ranking)I a vector (multivariate regression)I a sequence/tree/graph (structured prediction)I . . .

19 / 35

Page 20: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Examples

I Predict rainfall in Seattle tomorrow.

I Is this email spam?From: [email protected]

Subject: We Have Found Your Missing Money

You are Owed Cash That You Dont Know About Find Unclaimed MoneyI What zip code is in this image?

20 / 35

Page 21: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

ExamplesI Predict rainfall in Seattle tomorrow.

I Is this email spam?From: [email protected]

Subject: We Have Found Your Missing Money

You are Owed Cash That You Dont Know About Find Unclaimed MoneyI What zip code is in this image?

21 / 35

Page 22: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

ExamplesI Predict rainfall in Seattle tomorrow.

I Is this email spam?From: [email protected]

Subject: We Have Found Your Missing Money

You are Owed Cash That You Dont Know About Find Unclaimed Money

I What zip code is in this image?

22 / 35

Page 23: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

ExamplesI Predict rainfall in Seattle tomorrow.

I Is this email spam?From: [email protected]

Subject: We Have Found Your Missing Money

You are Owed Cash That You Dont Know About Find Unclaimed MoneyI What zip code is in this image?

23 / 35

Page 24: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Administrivia

24 / 35

Page 25: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Bookmark These

Course website: http://courses.cs.washington.edu/courses/cse446/17au/

Canvas: https://canvas.uw.edu/courses/1173938

Textbook: http://ciml.info

25 / 35

Page 26: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Your Instructors

Noah (instructor):

I UW CSE professor since 2015, NIPS & ICML papers since 2008, professor since2006, using ML since 1998

I Research interests: machine learning for structured problems in NLP, ML & NLPfor social science

TAs: Kousuke, John, Deric, Patrick, Andrew, and Jane

26 / 35

Page 27: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Outline of CSE 446

I Problem formulations: classification, regression

I Techniques: decision trees, nearest neighbors, perceptron, linear models,probabilistic models, neural networks, kernel methods, clustering

I “Meta-techniques”: ensembles, expectation-maximization

I Understanding ML: limits of learning, practical issues, bias & fairness

I Recurring themes: (stochastic) gradient descent, bullshit detection

27 / 35

Page 28: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Project

I Teams of threeI Parts:

1. Build and justify a new regression or binary classification dataset (due 10/17)2. Dataset review (part of A2) & class-wide selection (official datasets announced 11/3)3. Implement ML algorithms and compete in a bakeoff on ∼5 datasets (due 12/5)

I Don’t wait! Part 1 is already available on the course website.

28 / 35

Page 29: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Grading

I Assignments (five, 11% each)

I Project (30%)

I Final exam (15%)

29 / 35

Page 30: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Grading

I Assignments (five, 11% each)I Some pencil and paper, mostly programmingI Graded mostly on attempt, not correctnessI Five late days; no credit for late work after they are used up.

I Project (30%)

I Final exam (15%)

30 / 35

Page 31: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Grading

I Assignments (five, 11% each)I Some pencil and paper, mostly programmingI Graded mostly on attempt, not correctnessI Five late days; no credit for late work after they are used up.

I Project (30%)I dataset and writeup (10%)I final writeup (15%)I bakeoff performance (5%)

I Final exam (15%)

31 / 35

Page 32: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

Grading

I Assignments (five, 11% each)I Some pencil and paper, mostly programmingI Graded mostly on attempt, not correctnessI Five late days; no credit for late work after they are used up.

I Project (30%)I dataset and writeup (10%)I final writeup (15%)I bakeoff performance (5%)

I Final exam (15%) tentatively Wed. Dec. 13, 8:30–10:20 am

32 / 35

Page 33: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

“Can I Take This Class?”

I Short answer: yes (if you can get past the wait list), but be warned.

I Official prerequisites (and linear algebra) are strongly advised.I Be forthcoming with your potential teammates!

I We assume you’re a strong programmer and comfortable with math.

I We will move fast; lectures will focus on concepts and mathematics, quizzes arefor review and implementation discussions.

I “Sink or swim.”

I’ve been told to give The Link on Friday.

33 / 35

Page 34: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

To-Do List

I Quiz section meetings start tomorrow. Bring your laptop!

I Read: Daume (2017, ch. 1)

I Academic integrity statement: on the course web page; upload your signed scanthrough Canvas.

I Form groups and register them on Canvas (People → Groups → Project Groups)

34 / 35

Page 35: Machine Learning (CSE 446): Introduction Learning (CSE 446): Introduction Noah Smith c 2017 University of Washington nasmith@cs.washington.edu September 27, 2017 1/35

References I

Hal Daume. A Course in Machine Learning (v0.9). Self-published athttp://ciml.info/, 2017.

35 / 35