chuong 2 - tac tu
DESCRIPTION
trí tuệ nhân tạoTRANSCRIPT
-
Tr Tu Nhn To
Nguyn Nht Quang
Vin Cng ngh Thng tin v Truyn thngTrng i hc Bch Khoa H Ni
Nm hc 2009-2010
-
Ni dung mn hc: Gii thiu v Tr tu nhn to
Tc t nh ngha Mi trng cng vic Cc kiu mi trngg Cc kiu tc t
Gii quyt vn : Tm kim, Tha mn rng buc
Logic v suy din
Biu din tri thc
Suy din vi tri thc khng chc chn
Hc my
Lp k hoch Lp k hoch
Tr tu nhn to2
-
Tc t - nh ngha g Tc t l bt c ci g (con ngi, ngi my, software
robots, cc b n nhit,) c kh nng cm nhn (nhn bit) , , ) g ( )mi trng xung quanh n thng qua cc b phn cm bin(sensors) v hnh ng ph hp theo mi trng thngqua cc b phn hot ng (actuators)qua cc b phn hot ng (actuators)
Tc t con ngi Cc b phn cm bin: mt, tai, v mt s b phn c th khc Cc b phn hot ng: tay, chn, ming, v mt s b phn c
th khc
Tc t ngi my Tc t ngi my Cc b phn cm bin: cc my quay (cameras), cc b truy tm
tn hiu hng ngoiCc b phn hot ng cc loi ng c (motors) Cc b phn hot ng: cc loi ng c (motors)
3Tr tu nhn to
-
Tc t v Mi trngg
Hm tc t: l hm nh x t lch s nhn thc ti cc hnh ng:
f: P* A Chng trnh tc t: hot ng (chy) da trn kin trc
thc t ca hm f
Tc t = Kin trc + Chng trnh4Tr tu nhn to
-
V d: Th gii ca my ht bi
Cc nhn thcCc nhn thc V tr v mc sch s V d: [A, Bn], [B, Sch]
Cc hnh ng Di chuyn (my ht bi) sang tri, sang phi, ht bi, hoc khng
lm g clm g c
5Tr tu nhn to
-
Tc t my ht biy Bng hnh ng ca tc t my ht bi
Chui cc nhn thc Hnh ng g[A, Sch] Di chuyn sang phi[A, Bn] Ht bi[B, Sch] Di chuyn sang tri[ , ] y g[B, Bn] Ht bi[A, Sch], [A, Sch] Di chuyn sang phi[A, Sch], [A, Bn] Ht bi[A, Sch], [A, Bn] Ht bi. . .
function Reflex-Vacuum-Agent( [location,status]) returns an actiong ( [ , ])if status = Dirty then return Suckelse if location = A then return Rightelse if location = B then return Leftelse if location = B then return Left
6Tr tu nhn to
-
Tc t hp l (1)p ( ) Tc t cn phn u lm ng vic cn lm, da
trn nhng g n nhn thc (nhn bit) c v datrn nhng g n nhn thc (nhn bit) c v da trn cc hnh ng m n c th thc hin
Mt hnh ng ng (hp l) l hnh ng gip cho tc g g ( p ) g g pt t c thnh cng cao nht i vi mc tiu t ra
nh gi hiu qu hot ng: l tiu chun nh gi th h t h t t t tmc thnh cng trong hot ng ca mt tc t
V d: Tiu ch nh gi hiu qu hot ng ca mt tc t my ht bi c th l: mc lm sch, thi gian ht bi, mc in
nng tiu tn, mc ting n gy ra,
7Tr tu nhn to
-
Tc t hp l (2)
Tc t hp lp Vi mi chui nhn thc c c,
Mt tc t hp l cn phi la chn mt hnh ng Mt tc t hp l cn phi la chn mt hnh nggip cc i ha tiu ch nh gi hiu qu hot ng ca tc t ,
Da trn cc thng tin c cung cp bi chui nhn thc v cc tri thc c s hu bi tc t
8Tr tu nhn to
-
Tc t hp l (3) S hp l S thng sut mi th
S thng sut mi th = Bit tt c mi th vi tri thc v hn S thng sut mi th = Bit tt c mi th, vi tri thc v hn V cc nhn thc c th khng cung cp tt c cc thng tin lin
quan
Cc tc t c th thc hin cc hnh ng nhm thay i cc nhn thc trong tng lai, vi mc ch thu c cc thng tin hu ch (v d: thu thp thng tin, khm ph g ( p g ptri thc)
Tc t t tr (autonomous agent) l mt tc t m cc h h t h bi h h ki hhnh ng ca n c quyt nh bi chnh kinh nghim ca tc t (cng vi kh nng hc v thch nghi)
9Tr tu nhn to
-
Mi trng cng vic PEAS (1)
PEAS Performance measure: Tiu ch nh gi hiu qu hot ng
Environment: Mi trng xung quanh
Actuators: Cc b phn hnh ng Actuators: Cc b phn hnh ng
Sensors: Cc b phn cm bin
thit k t t t th i h (h l) t ti thit k mt tc t thng minh (hp l), trc tin cn phi xc nh (thit lp) cc gi tr ca cc thnh phn ca PEAS
10Tr tu nhn to
-
Mi trng cng vic PEAS (2)
V d: Thit k mt tc t li xe taxi t ng
h i hi h t (P) t h h nh gi hiu qu hot ng (P): an ton, nhanh, ng lut giao thng, mc hi lng ca khch hng, ti u li nhun,
Mi trng xung quanh (E): cc con ng (ph), cc phng tin khc cng tham gia giao thng, nhng ngi i b, cc khch hng, g g g
Cc b phn hnh ng (A): bnh li, chn ga, phanh, n tn hiu, ci xe,
Cc b phn cm bin (S): my quay (cameras), ng h tc , GPS, ng h o khong cch qung ng, cc b cm bin ng c,
11Tr tu nhn to
-
Mi trng cng vic PEAS (3)
V d: Thit k mt tc t chun on y tnh gi hiu qu hot ng (P): mc sc khenh gi hiu qu hot ng (P): mc sc khe
ca bnh nhn, cc tiu ha cc chi ph, cc vic kin co,
Mi trng xung quanh (E): bnh nhn, bnh vin, nhn vin y t, Cc b phn hnh ng (A): hin th trn mn hnhCc b phn hnh ng (A): hin th trn mn hnh cc cu hi, cc xt nghim, cc chun on, cc iu tr, cc ch dn,
Cc b phn cm bin (S): bn phm nhp vo cc thng tin v triu chng, cc tr li ca bnh nhn i vi cc cu hii vi cc cu hi,
12Tr tu nhn to
-
Mi trng cng vic PEAS (4)
V d: Thit k mt tc t nht vt
nh gi hiu qu hot ng (P): t l (bao nhiu phn trm) cc vt c t vo ng cc thng
Mi trng xung quanh (E): dy chuyn chuyn ng trn c cc vt, cc thng ng
Cc b phn hnh ng (A): cnh tay v bn tay c kt ni
Cc b phn cm bin (S): my quay (camera), cc b cm bin cc gc (cc hng)
13Tr tu nhn to
-
Mi trng cng vic PEAS (5)
V d: Thit k mt tc t dy ting Anh tng tc
nh gi hiu qu hot ng (P): cc i ha im thi ting Anh ca hc vin
Mi trng xung quanh (E): mt nhm hc vin
Cc b phn hnh ng (A): hin th mn hnh ccCc b phn hnh ng (A): hin th mn hnh cc bi tp, cc gi , sa (cha) bi tp
Cc b phn cm bin (S): bn phmCc b phn cm bin (S): bn phm
14Tr tu nhn to
-
Mi trng cng vic PEAS (6)
V d: Thit k mt tc t lc th rc (spam emails filt i )filtering)
nh gi hiu qu hot ng (P): kh nng lc th ( t f f l iti f l ti )rc (amount of error: false positives, false negatives)
Mi trng xung quanh (E): email server and clients
Cc b phn hnh ng (A): nh du th rc, gi thng bo
Cc b phn cm bin (S): nhn v phn tch ni dung cc emails
15Tr tu nhn to
-
Cc kiu mi trng (1)g ( ) C th quan st c hon ton (hay c th quan st c
mt phn)? Cc b cm bin ca mt tc t cho php n truy cp ti trng thi y
ca mi trng ti mi thi im
Xc nh (hay ngu nhin)? ( y g ) Trng thi tip theo ca mi trng c xc nh hon ton da trn
trng thi hin ti v hnh ng ca tc t (ti trng thi hin ti ny) Nu mt mi trng l xc nh, ngoi tr i vi cc hnh ng ca ccg g g
tc t khc, th gi l mi trng chin lc
Phn on (hay lin tip)? Kinh nghim ca tc t c chia thnh cc giai on (chng/hi) Kinh nghim ca tc t c chia thnh cc giai on (chng/hi) Mi giai on bao gm vic nhn thc ca tc t v hnh ng m n
thc hin mi giai on vic la chn hnh ng thc hin ch ph thuc vo mi giai on, vic la chn hnh ng thc hin ch ph thuc vo
giai on (khng ph thuc vo cc giai on khc)
16Tr tu nhn to
-
Cc kiu mi trng (2) Tnh (hay ng)?
Mi trng khng thay i trong khi tc t cn nhc (xem nn Mi trng khng thay i trong khi tc t cn nhc (xem nna ra hnh ng no)
Mi trng bn ng (semi-dynamic) l mi trng m khi thigian tri qua th n (mi trng) khng thay i nhng hiu qugian tri qua th n (mi trng) khng thay i, nhng hiu quhot ng ca tc t th thay i V d: cc chng trnh tr chi c tnh gi
Ri (h li )? Ri rc (hay lin tc)? Mt tp hu hn cc nhn thc v cc hnh ng, c nh
ngha phn bit r rng
Tc t n l (hay a tc t)? Mt tc t hot ng c lp (khng ph thuc / lin h vi cc
tc t khc) trong mt mi trngtc t khc) trong mt mi trng
17Tr tu nhn to
-
Cc kiu mi trng V dChi c Chi c khng Li xe taxi tnh gi tnh gig g
Quan st y ? c c khngXc nh? chin lc chin lc khngPhn on? khng khng khngPhn on? khng khng khngTnh? bn ng c khngRi rc? c c khngT t ? kh kh khTc t n? khng khng khng
Kiu ca mi trng c nh hng quyt nh i vi vic thit ktc ttc t
Mi trng trong thc t thng c cc c im: ch c th quanst c mt phn, ngu nhiu, lin tip, thay i (ng), lin tc, atc ttc t
18Tr tu nhn to
-
Cc kiu tc t
4 kiu tc t c bn
Tc t phn x n gin (simple reflex agents)
T t h d t h h ( d l b d fl Tc t phn x da trn m hnh (model-based reflex agents)
Tc t da trn mc tiu (goal-based agents)
Tc t da trn li ch (utility-based agents)( y g )
19Tr tu nhn to
-
Tc t phn x n gin (1)p g ( )
20Tr tu nhn to
-
Tc t phn x n gin (2)Tc t phn x n gin:
H h th t t (l t) i ki h hHnh ng theo mt quy tc (lut) c iu kin ph hpvi trng thi hin thi (ca mi trng)
function SIMPLE-REFLEX-AGENT(percept)static: rules (tp cc lut c dng: iu kin-hnh ng)
state INTERPRET-INPUT(percept)rule RULE-MATCH(state, rules)action RULE-ACTION[rule]return action
21Tr tu nhn to
-
Tc t phn x da trn m hnh (1)
22Tr tu nhn to
-
Tc t phn x da trn m hnh (2)
Tc t phn x da trn m hnh: S dng mt m hnh ni b gim st trng thi hin ti ca S dng mt m hnh ni b gim st trng thi hin ti ca
mi trng La chn hnh ng: ging nh i vi tc t phn x n gin
function REFLEX-AGENT-WITH-STATE(percept)static: state (m t trng thi hin ti ca mi trng)
rules (tp cc lut c dng: iu kin hnh ng)rules (tp cc lut c dng: iu kin-hnh ng)action (hnh ng gn nht)
state UPDATE-STATE(state action percept)state UPDATE-STATE(state, action, percept)rule RULE-MATCH(state, rules)action RULE-ACTION[rule]
23Tr tu nhn to
return action
-
Tc t da trn mc tiu (1) Bit v trng thi hin ti ca mi trng: cha
Cn bit thm thng tin v mc tiuCn bit thm thng tin v mc tiuTrng thi hin ti ca mi trng: mt ng t, xe taxi c th r
tri, r phi, hoc i thng
Thng tin v mc tiu: xe taxi cn i ti ch n ca hnh khch
Tc t da trn mc tiu Tc t da trn mc tiuTheo di trng thi hin ti ca mi trng
Lu gi mt tp cc mc tiu (cn t c)u g t tp cc c t u (c t c)
Chn hnh ng cho php (rt cuc) s t n cc mc tiu
24Tr tu nhn to
-
Tc t da trn mc tiu (2) ( )
25Tr tu nhn to
-
Tc t da trn li ch (1) Trong nhiu mi trng, thng tin v cc mc tiu khng nh gi hiu qu ca cc hnh ng nh gi hiu qu ca cc hnh ngC rt nhiu chui cc hnh ng cho php taxi i n ch (tc
l t n mc tiu)
Nhng: chui hnh ng no nhanh hn, an ton hn, ng tin cy hn, chi ph thp hn?
Cn s nh gi li ch i vi tc t
Hm li ch (utility function)nh x t chui cc trng thi ca mi trng ti mt gi tr s
thc (th hin mc li ch i vi tc t)
26Tr tu nhn to
-
Tc t da trn li ch (2) ( )
27Tr tu nhn to
-
Tc t c kh nng hc (1) Kh nng hc cho php tc t ci thin hiu qu hot ng ca nng ca n
4 thnh phn to nn mt tc t c kh nng hc Thnh phn hnh ng: m nhim vic la chn cc hnh ng
Thnh phn nh gi (bnh lun): nh gi hiu qu hot ng
Thnh phn hc: gip ci thin hiu qu hot ng - da trn cc nh gi, thay i (ci thin) thnh phn hnh ng
Thnh phn sn sinh kinh nghim: c nhim v xut cc hnh p s s g c ut cc ng gip sn sinh ra (dn n) cc kinh nghim mi
28Tr tu nhn to
-
Tc t c kh nng hc (2)g ( )
29Tr tu nhn to
-
C s tri thc ca tc t Mt c s tri thc (a knowledge base) l mt tp cc mnh
(pht biu) c biu din trong mt ngn ng hnh thc, cung cp tri thc (hiu bit) cho mt tc t
Tc t khai thc c s tri thc (m n s hu) trong qu trnh a ra cc hnh nga ra cc hnh ng
Cc tc t c th c xem xt mc Tri thc: Tc t bit nhng g? Cc mc tiu ca tc t l g?
Ci t h thng: Cc cu trc d liu trong c s tri thc? Cc gii thut Ci t h thng: Cc cu trc d liu trong c s tri thc? Cc gii thut s dng cc tri thc ny?
Tc t cn c kh nngTh th ( ht) t i th i Thu thp (cp nht) cc tri thc mi
Cp nht vic biu din (bn trong tc t) i vi mi trng xung quanh Suy din ra cc thuc tnh n ca mi trng xung quanh
S l h h h l Suy lun a ra cc hnh ng hp l
30Tr tu nhn to
-
a tc t (1) Mi trng hot ng: Cng tc (hp tc) hay l Cnh
tranh (i khng)?tranh (i khng)?
Trong nhiu bi ton thc t, mi trng hot ng lun thay i (bin ng) tc t cn cp nhtthay i (bin ng) tc t cn cp nht
Cn mt m hnh biu din k hoch ca cc tc t khc
Cc tc t cng tc Cng chia s cc mc tiu hoc cc k hoch V d: Lp k hoch (cho hot ng nhm) trong tr chi tennis V d: Lp k hoch (cho hot ng nhm) trong tr chi tennis nh i
Cc c ch cng tc: Phn tch v phn phi cc nhim v cho mi tc tmi tc t
31Tr tu nhn to
-
a tc t (2) Cc tc t cnh tranh
V d: chi c V d: chi c Mi tc t phi nhn bit c s tn ti (v hot ng) ca cc
tc t khcMi t t t h t (d ) k h h ( t ) Mi tc t tnh ton (d on) c cc k hoch ca (mt s) cc tc t khc
Mi tc t tnh ton (d on) c nh hng ca cc k hoch t t kh i i k h h b th ca cc tc t khc i vi k hoch ca bn thn n
Mi tc t quyt nh hnh ng ti u i vi d on nh hng ny
32Tr tu nhn to
-
Tc t - Tng ktg Tc t tng tc vi mi trng thng qua cc b phn cm
bin v cc b phn hnh ng p g Hm tc t m t cc hnh ng m tc t thc hin trong
cc tnh hung Mt tc t hp l s cc i ha hiu qu hot ng ca n Cc chng trnh tc t ci t (thc hin) cc hm tc t
C t PEAS h i t i Cc m t PEAS xc nh mi trng cng vic Cc mi trng c phn loi da theo cc tiu ch: C th
quan st c? Xc nh? Phn on? tnh? Ri rc? Tc t q n l?
Cc kin trc tc t c bn: Phn x, Phn x theo trng thi Da trn mc tiu Da trn li chthi, Da trn mc tiu, Da trn li ch
33Tr tu nhn to