Mid-Semester Evals. CS 188: Artificial Intelligence Spring Outline. Contest. Conditional Independence. Recap: Reasoning Over Time

Relevanta dokument
Isometries of the plane

Preschool Kindergarten

Adding active and blended learning to an introductory mechanics course

12.6 Heat equation, Wave equation

Module 1: Functions, Limits, Continuity

Provlektion Just Stuff B Textbook Just Stuff B Workbook

Problem som kan uppkomma vid registrering av ansökan

6 th Grade English October 6-10, 2014

Writing with context. Att skriva med sammanhang

1. Compute the following matrix: (2 p) 2. Compute the determinant of the following matrix: (2 p)

BOENDEFORMENS BETYDELSE FÖR ASYLSÖKANDES INTEGRATION Lina Sandström

Course evaluation SMD098, Lp2 2001

Kursutvärderare: IT-kansliet/Christina Waller. General opinions: 1. What is your general feeling about the course? Antal svar: 17 Medelvärde: 2.

Grafisk teknik IMCDP IMCDP IMCDP. IMCDP(filter) Sasan Gooran (HT 2006) Assumptions:

denna del en poäng. 1. (Dugga 1.1) och v = (a) Beräkna u (2u 2u v) om u = . (1p) och som är parallell

Grafisk teknik IMCDP. Sasan Gooran (HT 2006) Assumptions:

This exam consists of four problems. The maximum sum of points is 20. The marks 3, 4 and 5 require a minimum

Beijer Electronics AB 2000, MA00336A,

Användning av Erasmus+ deltagarrapporter för uppföljning

Kurskod: TAMS11 Provkod: TENB 28 August 2014, 08:00-12:00. English Version

Viktig information för transmittrar med option /A1 Gold-Plated Diaphragm

Grafisk teknik. Sasan Gooran (HT 2006)

Module 6: Integrals and applications

Flervariabel Analys för Civilingenjörsutbildning i datateknik

Kurskod: TAIU06 MATEMATISK STATISTIK Provkod: TENA 17 August 2015, 8:00-12:00. English Version

Information technology Open Document Format for Office Applications (OpenDocument) v1.0 (ISO/IEC 26300:2006, IDT) SWEDISH STANDARDS INSTITUTE

1/17/18. Dr. Ross Avilla

8 < x 1 + x 2 x 3 = 1, x 1 +2x 2 + x 4 = 0, x 1 +2x 3 + x 4 = 2. x 1 2x 12 1A är inverterbar, och bestäm i så fall dess invers.

Discovering!!!!! Swedish ÅÄÖ. EPISODE 6 Norrlänningar and numbers Misi.se

Webbregistrering pa kurs och termin

This is England. 1. Describe your first impression of Shaun! What kind of person is he? Why is he lonely and bullied?

#minlandsbygd. Landsbygden lever på Instagram. Kul bild! I keep chickens too. They re brilliant.

Om oss DET PERFEKTA KOMPLEMENTET THE PERFECT COMPLETION 04 EN BINZ ÄR PRECIS SÅ BRA SOM DU FÖRVÄNTAR DIG A BINZ IS JUST AS GOOD AS YOU THINK 05

English. Things to remember

Att stödja starka elever genom kreativ matte.

Kurskod: TAMS28 MATEMATISK STATISTIK Provkod: TEN1 05 June 2017, 14:00-18:00. English Version

FÖRBERED UNDERLAG FÖR BEDÖMNING SÅ HÄR

Health café. Self help groups. Learning café. Focus on support to people with chronic diseases and their families

Styrteknik: Binära tal, talsystem och koder D3:1

Kvalitetsarbete I Landstinget i Kalmar län. 24 oktober 2007 Eva Arvidsson

Mina målsättningar för 2015

Read Texterna består av enkla dialoger mellan två personer A och B. Pedagogen bör presentera texten så att uttalet finns med under bearbetningen.

CHANGE WITH THE BRAIN IN MIND. Frukostseminarium 11 oktober 2018

Support Manual HoistLocatel Electronic Locks


LUNDS TEKNISKA HÖGSKOLA Institutionen för Elektro- och Informationsteknik

Hur fattar samhället beslut när forskarna är oeniga?

Pre-Test 1: M0030M - Linear Algebra.

1. Unpack content of zip-file to temporary folder and double click Setup

1. Varje bevissteg ska motiveras formellt (informella bevis ger 0 poang)

Webbreg öppen: 26/ /

2.1 Installation of driver using Internet Installation of driver from disk... 3

Affärsmodellernas förändring inom handeln

Libers språklåda i engelska: Spel och lekar (7 9)

Chapter 2: Random Variables

Isolda Purchase - EDI

DVG C01 TENTAMEN I PROGRAMSPRÅK PROGRAMMING LANGUAGES EXAMINATION :15-13: 15

Make a speech. How to make the perfect speech. söndag 6 oktober 13

MO8004 VT What advice would you like to give to future course participants?

Solutions to exam in SF1811 Optimization, June 3, 2014

Service och bemötande. Torbjörn Johansson, GAF Pär Magnusson, Öjestrand GC

Unit course plan English class 8C

Consumer attitudes regarding durability and labelling

F ξ (x) = f(y, x)dydx = 1. We say that a random variable ξ has a distribution F (x), if. F (x) =

ISBN: Tommy Ohlsson Stockholm 2013

Förskola i Bromma- Examensarbete. Henrik Westling. Supervisor. Examiner

Matthew Thurley Industriell bildanalys (E0005E) Response rate = 65 %

Theory 1. Summer Term 2010

Utvärdering SFI, ht -13

Boiler with heatpump / Värmepumpsberedare

Tentamen i Matematik 2: M0030M.

Workplan Food. Spring term 2016 Year 7. Name:

CVUSD Online Education. Summer School 2010

samhälle Susanna Öhman

Solowheel. Namn: Jesper Edqvist. Klass: TE14A. Datum:

Självkörande bilar. Alvin Karlsson TE14A 9/3-2015

Second handbook of research on mathematics teaching and learning (NCTM)

Grammar exercises in workbook (grammatikövningar i workbook): WB p 121 ex 1-3 WB p 122 ex 1 WB p 123 ex 2

Team Captain s meeting for Intersport o Scandic Cup. Idre Fjäll 4-6 December 2015

Exempel på uppgifter från 2010, 2011 och 2012 års ämnesprov i matematik för årskurs 3. Engelsk version

In Bloom CAL # 8, sista varv och ihopsättning / last rows and assemble

CUSTOMER READERSHIP HARRODS MAGAZINE CUSTOMER OVERVIEW. 63% of Harrods Magazine readers are mostly interested in reading about beauty

Rep MEK föreläsning 2

Schenker Privpak AB Telefon VAT Nr. SE Schenker ABs ansvarsbestämmelser, identiska med Box 905 Faxnr Säte: Borås

The Arctic boundary layer

Quicksort. Koffman & Wolfgang kapitel 8, avsnitt 9

Ett hållbart boende A sustainable living. Mikael Hassel. Handledare/ Supervisor. Examiner. Katarina Lundeberg/Fredric Benesch

Calculate check digits according to the modulus-11 method

Questionnaire for visa applicants Appendix A

Read, work and talk! - och Lgr 11

Andy Griffiths Age: 57 Family: Wife Jill, 1 kid Pets: Cats With 1 million SEK he would: Donate to charity and buy ice cream

FYTA11-ma1, ht13. Respondents: 11 Answer Count: 9 Answer Frequency: 81,82 %

FORTA M315. Installation. 218 mm.

Uttagning för D21E och H21E

Urban Runoff in Denser Environments. Tom Richman, ASLA, AICP

Chapter 1 : Who do you think you are?

How to format the different elements of a page in the CMS :

Collaborative Product Development:

IKSU-kort Ordinarie avtal

and u = och x + y z 2w = 3 (a) Finn alla lösningar till ekvationssystemet

Transkript:

CS 88: Artificial Intelligence Spring 2 Lecture 2: HMMs and Particle Filtering 4/5/2 Pieter Abbeel --- UC Berkeley Many slides over this course adapted from Dan Klein, Stuart Russell, Andrew Moore Mid-Semester Evals Generally, things seem good! General Examples are appreciated in lecture Favorite aspect: projects (almost all) --- writtens significantly less preferred Office hours: Most common answers: Helpful. and Haven t gone. Some: too crowded. perhaps try a different office hour slot Section: Split between basically positive and don t go Assignments Written: median time 6hrs Programming: median time hrs Some people spend a lot more time though come talk to us if you are stuck Exams: Midterm: evening (3) vs in-class () or indifferent (8) Want to do the contest 2 Course contest Contest Outline HMMs: representation HMMs: inference Forward algorithm Particle filtering Fun! (And extra credit.) Regular tournaments Instructions posted soon! 3 8 Recap: Reasoning Over Time Stationary Markov models.7 X X 3 X 4 rain sun.3 Conditional Independence HMMs have two important independence properties: Markov hidden process, future depends on past via the present Current observation independent of all else given current state.7 X X 3 X 4 X 5 Hidden Markov models X X 3 X 4 X 5 E 2 E 3 E 4 E 5.3 X E P rain umbrella.9 rain no umbrella. sun umbrella.2 sun no umbrella.8 E 2 E 3 E 4 E 5 Quiz: does this mean that observations are independent given no evidence? [No, correlated by the hidden state]

Real HMM Examples Speech recognition HMMs: Observations are acoustic signals (continuous valued) States are specific positions in specific words (so, tens of thousands) Machine translation HMMs: Observations are words (tens of thousands) States are translation options Outline HMMs: representation HMMs: inference Forward algorithm Particle filtering Robot tracking: Observations are range readings (continuous) States are positions on a map (continuous) 3 Filtering / Monitoring Filtering, or monitoring, is the task of tracking the distribution B(X) (the belief state) over time Example from Michael Pfeiffer We start with B(X) in an initial setting, usually uniform As time passes, or we get observations, we update B(X) The Kalman filter was invented in the 6 s and first implemented as a method of trajectory estimation for the Apollo program t= Sensor model: never more than mistake Motion model: may not execute action with small prob. t= t=2 2

t=3 t=4 Inference Recap: Simple Cases X X t=5 Passage of Time Assume we have current belief P(X evidence to date) Example: Passage of Time As time passes, uncertainty accumulates Then, after one time step passes: X Or, compactly: T = T = 2 T = 5 Basic idea: beliefs get pushed through the transitions With the B notation, we have to be careful about what time step t the belief is about, and what evidence it includes Transition model: ghosts usually go clockwise 3

Observation Assume we have current belief P(X previous evidence): Then: X Example: Observation As we get observations, beliefs get reweighted, uncertainty decreases Or: Basic idea: beliefs reweighted by likelihood of evidence Unlike passage of time, we have to renormalize Before observation After observation Example HMM The Forward Algorithm We are given evidence at each time and want to know We can derive the following updates We can normalize as we go if we want to have P(x e) at each time step, or just once at the end Online Belief Updates Recap: Filtering Every time step, we start with current P(X evidence) We update for time: Elapse time: compute P( X t e :t- ) X We update for evidence: Observe: compute P( X t e :t ) Belief: <P(rain), P(sun)> E 2 X <.5,.5> Prior on X The forward algorithm does both at once (and doesn t normalize) lem: space is X and time is X 2 per time step E 2 <.82,.8> <.63,.37> Observe Elapse time <.88,.2> Observe 4

Outline Particle Filtering HMMs: representation HMMs: inference Forward algorithm Particle filtering 3 Filtering: approximate solution Sometimes X is too big to use exact inference X may be too big to even store B(X) E.g. X is continuous Solution: approximate inference Track samples of X, not all values Samples are called particles Time per step is linear in the number of samples But: number needed may be large In memory: list of particles, not states This is how robot localization works in practice......2..2.5 Particle Filtering: Elapse Time Each particle is moved by sampling its next position from the transition model This is like prior sampling samples frequencies reflect the transition probs Here, most samples move clockwise, but some move in another direction or stay in place This captures the passage of time If we have enough samples, close to the exact values before and after (consistent) Particle Filtering: Observe Slightly trickier: We don t sample the observation, we fix it This is similar to likelihood weighting, so we downweight our samples based on the evidence Note that, as before, the probabilities don t sum to one, since most have been downweighted (in fact they sum to an approximation of P(e)) Particle Filtering: Resample Particle Filtering Rather than tracking weighted samples, we resample N times, we choose from our weighted sample distribution (i.e. draw with replacement) This is equivalent to renormalizing the distribution Now the update is complete for this time step, continue with the next one Sometimes X is too big to use exact inference X may be too big to even store B(X) E.g. X is continuous X 2 may be too big to do updates Solution: approximate inference Track samples of X, not all values Samples are called particles Time per step is linear in the number of samples But: number needed may be large In memory: list of particles, not states This is how robot localization works in practice......2..2.5 5

Representation: Particles Particle Filtering: Elapse Time Our representation of P(X) is now a list of N particles (samples) Generally, N << X Storing map from X to counts would defeat the point Each particle is moved by sampling its next position from the transition model P(x) approximated by number of particles with value x So, many x will have P(x) =! More particles, more accuracy For now, all particles have a weight of Particles: (2,3) (3,2) (3,2) (2,) (2,) 37 This is like prior sampling samples frequencies reflect the transition probs Here, most samples move clockwise, but some move in another direction or stay in place This captures the passage of time If we have enough samples, close to the exact values before and after (consistent) Particle Filtering: Observe Particle Filtering: Resample Slightly trickier: Don t do rejection sampling (why not?) We don t sample the observation, we fix it This is similar to likelihood weighting, so we downweight our samples based on the evidence Note that, as before, the probabilities don t sum to one, since most have been downweighted (in fact they sum to an approximation of P(e)) Rather than tracking weighted samples, we resample N times, we choose from our weighted sample distribution (i.e. draw with replacement) This is equivalent to renormalizing the distribution Now the update is complete for this time step, continue with the next one Old Particles: w=. (2,) w=.9 (2,) w=.9 (3,) w=.4 (3,2) w=.3 (2,2) w=.4 (,) w=.4 (3,) w=.4 (2,) w=.9 (3,2) w=.3 Old Particles: (2,) w= (2,) w= (2,) w= (3,2) w= (2,2) w= (2,) w= (,) w= (3,) w= (2,) w= (,) w= Robot Localization In robot localization: We know the map, but not the robot s position Observations may be vectors of range finder readings State space and readings are typically continuous (works basically like a very fine grid) and so we cannot store B(X) Particle filtering is a main technique [Demos] 4 6