nhom08 chuong05 svm
DESCRIPTION
bài tập máy họcTRANSCRIPT
-
TRNG I HC KHOA HC T NHIN
KHOA CNG NGH THNG TIN
Chuyn ngnh Khoa hc My tnh
Lp Cao hc Kha 22
BO CO
MN MY HC
SUPPORT VECTORS MACHINE
TP.HCM, 2012
-
TRNG I HC KHOA HC T NHIN
KHOA CNG NGH THNG TIN
Chuyn ngnh Khoa hc My tnh
Lp Cao hc Kha 22
BO CO
MN MY HC
SUPPORT VECTORS MACHINE
T GIA DIU 1211013
NGUYN KHA 1211030
HUNH DUY KHOA 1211031
L ANH T 1211078
TRN THIN VN 1211084
GING VIN HNG DN
TS.TRN THI SN
-
i
MC LC
MC LC ................................................................................................................... 1
DANH MC CC HNH ........................................................................................... 3
DANH MC CC BNG ......................................................................................... 5
Phn 1 Support Vectors Classifier .............................................................................. 1
1.1. Tuyn tnh ............................................................................................. 1
1.1.1. Bi ton ......................................................................................... 1
1.1.2. Gii quyt...................................................................................... 2
1.1.3. Nhn t Largrange(Lagrange multiplier) ..................................... 6
1.1.4. Soft Margin ................................................................................... 7
1.2. SVC phi tuyn ....................................................................................... 8
1.2.1. Bi ton ......................................................................................... 8
1.2.2. V d minh ha ........................................................................... 10
1.2.3. Th thut Kernel ......................................................................... 11
1.2.4. Mt s v d hm kernel ............................................................. 13
1.3. Multiple Classification ........................................................................ 14
1.3.1. One Versus the Rest: .................................................................. 14
1.3.2. Pairwise Support Vector Machines ............................................ 17
1.3.3. Error-Correcting Output Coding (Thomas G.Dietterich,
Ghumlum Bakiri) ........................................................................ 19
1.4. Sequential Minimal Optimization (SMO) .......................................... 21
1.4.1. Tm kim theo hng (Direction Search) ................................... 22
1.4.2. Thut ton [6] ............................................................................. 23
-
ii
Phn 2 Support Vectors Regression .......................................................................... 24
2.1. Bi ton hi quy (Regression)............................................................. 24
2.2. Hm li ................................................................................................ 24
2.3. SVR ............................................................................................... 25
Phn 3 Th vin h tr lp trnh v cc hng pht trin......................................... 28
3.1. Cc th vin h tr lp trnh SVM ..................................................... 28
3.2. Mt s hng nghin cu ................................................................... 29
a. Hiu qu tnh ton ....................................................................... 29
b. La chn hm kernel .................................................................. 29
c. Hc SVM c cu trc ................................................................. 30
Ti liu tham kho ..................................................................................................... 32
-
iii
DANH MC CC HNH
Hnh 1-1 Cc dy c th chia tp d liu [1] .............................................................. 2
Hnh 1-2 S lc v hyperlane [1] .............................................................................. 2
Hnh 1-3 Minh ho Largrange Multiflier ................................................................... 6
Hnh 1-4 Vi im nhiu trong b d liu .................................................................. 7
Hnh 1-5 (a) SVC tuyn tnh vi Soft Margin. (b) SVC phi tuyn. ............................ 8
Hnh 1-6 Mt mt phn tch phi tuyn trong khng gian gi thuyt c th tr thnh
1 siu phng trong khng gian c trng. ................................................................... 9
Hnh 1-7 V d minh ha SVM phi tuyn. (a)Khng gian gi thuyt. (b)Khng gian
................................................................................................................................... 10
Hnh 1-8 Siu phng phn lp trong khng gian c trng ...................................... 11
Hnh 1-9 D liu phn b dng ng cong ............................................................. 14
Hnh 1-10 Cc classifiers ca OVR .......................................................................... 15
Hnh 1-11 Trng hp c nhiu fi(x) > 0 (chm hi ) v khng c f(i)>0 (chm
hi xanh). ................................................................................................................... 15
Hnh 1-12 : Minh ha fuzzy 1. .................................................................................. 16
Hnh 1-13 Cc ng en l kt qu ca s dng fuzzy. ......................................... 16
Hnh 1-14 Cc classifiers ca Pairwise. .................................................................... 17
Hnh 1-15 Trng hp c nhiu i tha mn iu kin chn lp ............................... 18
Hnh 1-16 Cy DDAG .............................................................................................. 18
Hnh 1-17 Kt qu DDAG ........................................................................................ 19
Hnh 1-18 Kt qu fsvm. ........................................................................................... 19
Hnh 1-19 Kt qu codewords................................................................................... 20
Hnh 1-20 Hamming decoding .................................................................................. 21
-
iv
Hnh 1-21 Cho im bt u v mt hng u kh thi, tm kim theo hng s cc
i ha hm , vi vi lun tha iu kin KKT
[6] .............................................................................................................................. 22
Hnh 2-1 ng hi quy ca tp im cho trc. .................................................... 24
Hnh 2-2 Nhn thy cc t trng cho ngng li, c hm tri u, c hm phi n
ngng mi tnh li, c hm tuyn tnh, c hm phi tuyn. ..................................... 25
Hnh 2-3 Hm li ................................................................................................... 26
Hnh 3-1 Phn lp bng LibSVM ............................................................................. 28
-
v
DANH MC CC BNG
Bng 1-1 Mt s thut ton ti u tm [6] ........................................................ 21
-
1
Phn 1
Support Vectors Classifier
- SVM Support Vectors Machine l mt m hnh hc c gim st, trong lnh
vc my hc.
- SVM thng c s dng phn lp d liu (classification), hoc phn
ch hi quy (regression annalysis). L nn tng cho nhiu thut ton trong
khai thc d liu.
- SVM c gii thiu vi Vladimir Vapnik v cc ng s vo nm 1979.
Paper c cng b chnh thc vo nm 1995.
- tng chnh ca phng php SVM l phn chia b d liu vo cc phn
lp bng siu phng (hyperlane). T tng chnh, nhiu phng php ci
tin tu bin t phng php nguyn thu, cho nhiu trng hp s dng
khc nhau.
1.1. Tuyn tnh
Vi b d liu c th chia thnh 2 lp mt cch tuyn tnh (two-class linearly
separable data), SVC s tm ra mt siu phng (hyperlane) chia b d liu
bng cch cc i ho bin (Maximal Margin).
1.1.1. Bi ton
- Cho tp im D = {(xi, yi)} , D l mt tp c th phn lp tuyn tnh
(separable data set). Trong , l vector i din cho phn t th
i ,vi mi s c nh nhn { }.
- Bi ton t ra l tm dy rng nht c th phn chia tp d liu
ban u thnh 2 lp
-
2
Hnh 1-1 Cc dy c th chia tp d liu [1]
1.1.2. Gii quyt
- Trong khng gian 2 chiu, mt dy phn cch c th c biu din
bng 1 ng thng (l ng thng chnh gia dy - hyperlane)
- Hyperlane c biu din di phng trnh
Hnh 1-2 S lc v hyperlane [1]
-
3
l vector php tuyn ca hyperlane
- l khong cch gia 2 l ca dy phn cch. Nh vy, dy phn
cch rng nht s l dy c ch s ln nht.
- c tnh bng 2 ln khong cch ngn nht t cc im trong tp
d liu n hyperlane (xem hnh Error! Reference source not
found.). Cc im gn hyperlane nht l nhng im nm trn 2 l, s
c ngha cho vic tnh khong cch , c gi l Support Vector.
thit lp cng thc tnh trc ht, cn tnh c khong cch
t 1 im trong tp d liu n hyperlane.
- Ta c unit vector :
- Gi x l mt im nm trn hyperlane, x l mt im trong tp d
liu, ta c biu thc sau :
suy ra
- V x nm trn hyperlane, nn
(
)
suy ra
- r l khong cch t mt im x trong tp d liu n hyperlane (cn
c gi l geometric margin)
-
4
- Gi
- Gi r l khong cch ngn nht t tp d liu n hyperlane, khi ,
ta lun c :
- Nhn thy, gi tr r c lp vi b ( ,b),
Vy, c th chn sao cho . Khi , biu
thc trn c thu gn nh sau :
c gi l functional margin
*** functional margin c ln ph thuc vo t l scale ca vector ,
cn geometric margin th c lp vi t l ny.
*** Ch , ta c . iu ny khng thay i hyperlane
(phng trnh v hon ton tng ng. Mc ch rt
gn ny ch nhm mc ch d dng cho vic tnh ton v sau.
T , cho mt mu n hyperlane, ta tnh c :
Geometric margin
Functional margin
Suy ra
-
5
T , tm c biu thc cho margin :
Nh vy, tm dy phn cch rng nht, cng ng ngha vic tm margin
ln nht. Tr thnh bi ton cc i ho , hay cc tiu ho
(primal
problem) :
{
gii quyt bi ton ny, ta p dng nhn t largrange (Lagrange
multipliers)
[ ]
|w| cc tr khi L t cc tr trn w v b
{
Tng ng
{
thay vo biu thc Lagrange pha trn v tip tc tm cc tr trn ,
a bi ton v dng dual problem
Thay w =
-
6
ma
Vi rng buc
i n
Vn Convex quadratic programming optimization
iu kin Karush-Kuhn-Tucker (complementary slackness conditions):
[ ] i n
Gii bng thut ton Sequential Minimal Optimization (SMO) tm
b . T , tm c w,b bng cc biu thc pha trn.
*** Sau khi gii xong bi ton ti u ho, ta tm c b , th ch c
nhng gi tr tng ng vi nhng im nm trn 2 bin (Support
Vector) mi c gi tr khc 0, cn gi tr nhng im cn li iu
bng 0.
1.1.3. Nhn t Largrange(Lagrange multiplier)
- Gi thit :
{
- Xt rng buc
Vector php tuyn
Vector tip tuyn
Suy ra Hnh 1-3 Minh ho Largrange
Multiflier
-
7
Khi , f(x,y) s t cc tr khi .
{
v song song
Suy ra
Vit li, cch khc
Biu thc trn c ngha, f(x,y) t cc tr th
*** C th tm hiu thm v nhn t Largrange trong cc ti liu [2] [3]
1.1.4. Soft Margin
Trong thc t, tp d liu khng lun tho tnh cht separable (phn chia c),
m i khi, c mt vi im b nhiu (inseparable points) . V vy, t phng php
nguyn thu ban u, cn thm mt vi ci tin chp nhn nhng im nhiu ny.
Hnh 1-4 Vi im nhiu trong b d liu
-
8
chp nhn cc nhiu, cng thc ban u c tu chnh thm vi tham s
min
Trong :
Slack variable m t nhiu cho phn lp .
C nh hng ca li (C cng ln th mc nh hng ca cc
im nhiu cng ln, ngc li C cng nh, th iu kin ni lng hn,
d dng chp nhn nhiu hn).
Rng buc functional margin s c dng :
,
1.2. SVC phi tuyn
1.2.1. Bi ton
Phn trc trnh by v phng php SVC tuyn tnh vi tng chnh l tm
mt siu phng vi margin ln nht phn tch d liu. Tuy nhin, bi ton ti u
ch gii c nu tp d liu phn tch tuyn tnh c. Trong thc t, tp d liu
a s l khng phn chia tuyn tnh c bng siu phng (Hnh 1-5).
Hnh 1-5 (a) SVC tuyn tnh vi Soft Margin. (b) SVC phi tuyn.
-
9
gii quyt vn d liu khng kh tch tuyn tnh, chng ta c hai cch tip
cn. Cch tip cn th nht ( c trnh by) l s dng phng php cc i ha
bin mm (soft margin) nh Hnh 1-5(a). Tuy nhin, trong cc trng hp d liu
nh Hnh 1-5(b) th cch tip cn bin mm khng kh dng.
Trong trng hp , chng ta phi s dng mt cch tip cn khc. tng chnh
ca hng tip cn ny l nh x cc mu trong khng gian ban u (khng gian
gi thuyt) sang 1 khng gian c s chiu ln hn (khng gian c trng). Sau p
dng phng php SVM tuyn tnh tm ra 1 siu phng phn hoch trong khng
gian c trng. Siu phng ny s ng vi mt phi tuyn trong khng gian gi
thuyt. Hnh 1-6 minh ha cho phng php ny. Ta thy, d liu, sau khi c nh
x t khng gian 2 chiu sang khng gian 3 chiu, c th phn tch hon ton bng
1 siu phng trong khng gian mi.
Hnh 1-6 Mt mt phn tch phi tuyn trong khng gian gi thuyt c th tr thnh 1
siu phng trong khng gian c trng.
-
10
1.2.2. V d minh ha
Hnh 1-7 V d minh ha SVM phi tuyn. (a)Khng gian gi thuyt. (b)Khng gian
Gi s ta c tp d liu nh Hnh 1-7(a). Mc tiu t ra l tm 1 siu phng phn
lp d liu chnh xc. Ta thy trong khng gian gc, tp d liu ny khng th phn
tch tuyn tnh. Do , ta phi nh x vo khng gian c trng mi. Gi s ta c
hm nh x nh sau
( ) {
(
)
( )
D liu trong khng gian gi thuyt sau khi nh x vo khng gian c trng mi s
nh Hnh 1-7(b). Sau khi nh x, ta s s dng phng php SVC tuyn tnh tm
siu phng phn lp trong khng gian c trng. Quay li bi ton SVC tuyn tnh,
v thay cc gi tr bng . Tnh tng t nh trng hp phi tuyn ta c
w = (1,1) v b = -3. Th vo phng trnh siu phng ta c siu phng: y = 3 x
nh hnh Hnh 1-8. Sau khi c c phng trnh siu phng, ta s s dng n
phn lp tng t nh trng hp tuyn tnh.
-
11
Hnh 1-8 Siu phng phn lp trong khng gian c trng
1.2.3. Th thut Kernel
Tuy nhin, cch tip cn trn gp phi 1 vn l khng gian c trng c th c s
chiu ln hn rt nhiu khng gian gi thuyt (c th v hn chiu), v do tn
nhiu thi gian tnh ton. Tuy nhin, ta thy rng trong cc php tnh, (x) ch xut
hin di dng tch v hng tc l dng (x)(y) m khng xut hin n l. V
vy, thay v s dng dng tng minh ca (x) th ch cn s dng hm biu din
gi tr v hng (x)(y).
t K(x,y)=(x)(y), K(x,y) c gi l hm ht nhn (kernel function) [4].
Nh vy l ch cn bit dng ca hm ht nhn K(x,y) m khng cn bit c th nh
x (x). Lc hm phn lp tr thnh:
-
12
Tuy nhin khng th chn ty hm K(x,y) m phi chn K(x,y) sao cho tn ti mt
hm m K(x,y) = (x)(y). iu kin bo m vn ny l iu kin Mercer.
Sau y l 1 v d v K v :
Vi x = (x1,x2) R2, (x) = (1, , ,
, , ) R
6 th
K(x,y) = (x)(y) = (1+x.y)2.
nh x v khng gian ban u: Sau khi gii bi ton phi tuyn, ta c c siu
phng phn lp trong khng gian c trng. Da vo phng trnh siu phng ta
xc nh c cc im support vector. Sau , nh x cc support vector ny v
khng gian gi thuyt. Cui cng, t cc im support vector ta xc nh c
ng phn lp trong khng gian ban u.
Vn quan trng l tm hm kernel K(x,y) nh th no: R rng y l mt
vn ph thuc vo bi ton nhn dng. i vi nhng bi ton nhn dng n
gin trong s phn b cc mu ca 2 lp khng qu phc tp th c th tm hm
K(x,y) n gin sao cho s chiu ca l khng qu ln.
Cch xy dng mt hm kernel: Hm kernel phi tha mn nh l Mercer.
nh l ca Mercer [4] Cho K(x, x) l mt kernel i xng lin tc c xc nh
trong khong cch ng a x b v tng t cho x. Kernel K(x, x) c th c
m rng trong chui:
vi cc h s dng, i > 0 vi tt c i. S m rng ny l c c s v hi t, n l
cn thit v iu kin
gi cho tt c cc (-) m
-
13
C th tm tt cc c im hu ch nht trong vic xy dng kernel, m c gi
l Mercer kernel. l, i vi bt k tp hp con gii hn ngu nhin no ph
thuc vo khng gian u vo X, ma trn tng ng c xy dng bi hm kernel
K(x, x): l mt ma trn i xng v bn xc nh dng
(tr ring ca ma trn >= 0), c gi l mt ma trn Gram [5].
1.2.4. Mt s v d hm kernel
c tnh ca hm kernel:
Nu K1(x,y), K2(x,y) l cc hm kernel th K3(x,y) cng l hm kernel vi:
1. K3(x,y) = K1(x,y) + K2(x,y) [**]
2. K3(x,y) = aK1(x,y) vi a > 0 [**]
3. K3(x,y) = K1(x,y).K2(x,y) [**]
4. K3(x,y) = aK1(x,y) + bK2(x,y) vi a,b > 0 [**]
T cc cng thc trn c th suy ra mt s hm kernel thng dng nh sau:
1. Linear: ( )
2. Hm a thc bc p: ( ) ; ( )
3. Hm Gaussian (Radial-basis function):
( )
Trong trng hp no nn s dng hm kernel no l ty thuc vo s phn b ca
d liu. V d, nu d liu phn b dng ng cong nh Hnh 1-9 th chng ta phi
dng hm nhn a thc dng K(x,y)=(1+x.y)p.
-
14
Hnh 1-9 D liu phn b dng ng cong
Chiu khng gian c trng ng vi kernel ny l d =
. Kernel ny c th
chuyn tt c cc mt cong bc p trong khng gian Rn thnh siu phng trong khng
gian c trng.
Tm li phng php SVM phi tuyn l tm mt hm kernel K(x,y), sau gii bi
ton SVM tuyn tnh vi vic thay x1x2 = K(x1,x2) tm ra w v b.
1.3. Multiple Classification
1.3.1. One Versus the Rest:
1.3.1.1. Continuous Decision Functions
Hun luyn
Vi mt tp m class, xy dng mt tp classifier D1, ..., Dm. Mi mt fi s phn
bit 1 lp vi cc lp cn li.
-
15
Hnh 1-10 Cc classifiers ca OVR
Nh vy ta sau khi hun luyn ta s c tp D1,..,Dm classifier nhn din.
Nhn din:
ng vi mi gi tr x cn phn lp:
- x s thuc lp i nu c duy nht mt Di(x) > 0.
- Trng hp c nhiu hn mt Di(x) > 0 hoc khng c Di(x) >0, ta s dng
cch tnh:
Ngha l x thuc lp i vi Di(x) ln nht.
Hnh 1-11 Trng hp c nhiu fi(x) > 0 (chm hi )
v khng c f(i)>0 (chm hi xanh).
-
16
1.3.1.2. Fuzzy Support Vector Machines (FSVM):
S dng fuzzy gii quyt trng hp khng nhn din c khi c nhiu hn
mt Di(x) > 0 hoc khng c Di(x) >0.
S dng 2 fuzzy:
1. For i = j: mii(x)= { o i i o i
2. For i = j: mij(x)= { o i i o i
Hnh 1-12 : Minh ha fuzzy 1.
Sau khi tm c mij(x), ta tm mi(x): mi(x) = min mij(x) , j =1,...,n
x s thuc v lp: arg max mi(x), i=1,...,n
Hnh 1-13 Cc ng en l kt qu ca s dng fuzzy.
-
17
==> FSVM v SVMs with Continuous Decision Functions l tng ng.
1.3.2. Pairwise Support Vector Machines
Hun luyn
Vi mt tp m class, xy dng mt tp classifier D1, ..., Dn(n-1)/2. Mi mt fi s
phn bit 1 lp vi 1 lp khc trong tp.
Hnh 1-14 Cc classifiers ca Pairwise.
Nh vy sau khi hun luyn ta s c n(n-1)/2 classifier nhn din.
Nhn din:
nhn bit x thuc v lp no, ta tnh tt c cc gi tr Di(x), i=1,...,n(n-1)/2. x s
thuc v lp:
Di(x) =
arg max Di(x) , i=1,...,n.
Tuy nhin, s xy ra trng hp c nhiu i tha mn iu kin chn lp, khi m x
thuc v phn gch cho trong trng hp di.
-
18
Hnh 1-15 Trng hp c nhiu i tha mn iu kin chn lp
C nhiu phng php x l trng hp ny:
Decision Directed Acyclic Graph (Platt, Cristianini, and J. Shawe-Taylor)
Phng php ny xy dng mt cy quyt nh quyt nh da trn ph nh ca
i. Nh vy s khng gp trng hp khng th phn lp nh trn.
Hnh 1-16 Cy DDAG
Kt qu ca DDAG :
-
19
Hnh 1-17 Kt qu DDAG
Fuzzy SVM
Tng t trng hp OVR.
Hnh 1-18 Kt qu fsvm.
1.3.3. Error-Correcting Output Coding (Thomas G.Dietterich, Ghumlum
Bakiri)
Hun luyn
tng ca phng php ny l to ra cho mi lp trong m class mt chui nhi
phn duy nht gi l "codewords" t n hm phn lp f.
Mi hm phn lp f c xc nh da vo tng c s d liu.
V d: trong bi ton phn lp cc ch s t 0-9. Ta c th c cc hm f nh sau:
f1: phn lp 1,4,5.
f2: phn lp 1,3,5,7,9.
...
Sau khi xc nh cc hm phn lp f da vo d liu, ta tin hnh to cc
codewords cho cc lp. V d kt qu codewords:
-
20
Hnh 1-19 Kt qu codewords.
Lu : trng hp cc codewords c khong cch khc bit qu nh, th thng
thng s nh ngha thm ct (hm phn lp f) tng khong cch ch gia cc
codeword ny
Nhn din
Vi gi tr x cn nhn din, ta xc nh codeword ca x vi ci hm f xc nh.
Sau khi c codeword ca x ta so vi codewords ca d liu xc nh codeword
ca x gn vi lp i nht th x s thuc v lp i.
so snh codeword ca x vi codewords cc lp ta dng khong cch Hamming
(m s lng cc cc bit khc nhau).
1.3.3.1. Mt ci tin ca EOCO (Erin L. Allwein, Robert E. Schapire, Yoram Singer):
Thay v s dng codeword l 0,1 th tc gi s dng cc gi tr codeword l -1,0,+1.
Trong gi tr 0 biu din rng ta khng quan tm n hm phn lp f i vi lp.
xc nh gi tr codeword gn vi gi tr codewords lp no nht th tc gi s
dng phng php Hamming decoding:
dH(M(r), f (x)) = 1/2
Bn di l 1 v d tnh Hamming decoding:
-
21
Hnh 1-20 Hamming decoding
1.4. Sequential Minimal Optimization (SMO)
Nu nh gii h phng trnh theo phng php trn th s rt tn chi
ph v cc im khng phi l Support Vector s c gi tr xp x 0.
Cn c thut ton vi chi ph thp hn ti u ha bi ton cng vi cc
h s .
Bng 1-1 Mt s thut ton ti u tm [6]
Ra i nm 1999, l thut ton hiu qu u tin dng hun luyn tp d liu, ti
u cc h s (Trc c thut ton Chunk ca Vapnik v Chervonenkis).
Bi ton ban u :
Trc khi tm hiu thut ton, ta i vo mt khi nim mi :
-
22
1.4.1. Tm kim theo hng (Direction Search)
Gi s ta c tha mn iu kin ti u ca hm i ngu D().
Ta gi u={u1, u2, , un} l mt hng kh thi nu ta c th thm vo mt on
th vn tha mn D() ( ).
Gi l tt cc cc h s tha iu kin trn, nh vy [ ].
Nh vy, ta c c bi ton ti u .
ma
Hnh 1-21 Cho im bt u v mt hng u kh thi, tm kim theo hng s cc
i ha hm , vi vi lun tha iu kin KKT [6]
Xt hm D(+ ) l hm li.
Nh vy, theo cng thc Newton, vi s thay i rt nh, ta c th tnh c
nh sau [6] :
-
23
1.4.2. Thut ton [6]
Tm tt s lc :
u tin khi to cc tr s , v gk.
ng vi mi bc lp, tm hai ch s ca sao cho chng xa nhau nht. Sau
tin hnh tm kim c hng trn hai gi tr , ri tin hnh cp nht
li trng s ca o hm .
-
24
Phn 2
Support Vectors Regression
2.1. Bi ton hi quy (Regression)
Bi ton hi quy l bi ton kh ph bin, mc tiu l tm hm c trng cho mt
tp im ri rc ban u.
Hnh 2-1 ng hi quy ca tp im cho trc.
Cho tp im :
Hm hi quy s c dng :
[7]
Mc tiu l xy dng hm f vi li l thp nht.
2.2. Hm li
nh gi li cho hi quy, ngi ta a ra cc hm li. Dng tng qut :
[7]
Trong :
P l o xc sut trn tng quan ton b tp d liu
-
25
c(f,x,y)=(f(x) y2)
Mt s hm li thng dng :
Hnh 2-2 Nhn thy cc t trng cho ngng li, c hm tri u, c hm phi n
ngng mi tnh li, c hm tuyn tnh, c hm phi tuyn.
2.3. SVR
Hm mc tiu ti u s l :
-
26
Hnh 2-3 Hm li
[7]
Hm Lagrange
[7]
iu kin
Thay th cc iu kin trn vo ta hm i ngu
[7]
-
27
Gi tr c thay th v bin mt v ta c
iu kin KKT [7]
p dng cc iu kin trn, ta gii h phng trnh c cc * *
b c tnh theo cng thc
-
28
Phn 3
Th vin h tr lp trnh v cc hng pht trin
3.1. Cc th vin h tr lp trnh SVM
Hai th vin ph bin ci t thut ton SVM l LibSVM v SVMlight.
Trong LibSVM h tr nhiu nn tng h iu hnh khc nhau, cng nh
nhiu ngn ng lp trnh khc nhau (C++, Java) d dng ci tin v ng
dng. c bit LibSVM cho php tinh chnh nhiu tham s hn mt s phn
mm hoc th vin khc v h tr nhng b tham s mc nh h tr gii
quyt nhiu vn thc t mt cch hiu qu.
Hnh 3-1 Phn lp bng LibSVM
SVMlight l mt bn ci t khc ca SVM bng ngn ng C. SVMlight
thng qua k thut chn la hiu qu v kh thi nht v hai phng php tnh
ton hiu qu l nn v cache ca nh gi kernel. SVMlight gm hai
chng trnh C chnh: SVM_learn s dng hun luyn b phn lp v
SVM_classify kim chng.
-
29
Ngoi ra cn mt s b cng c c giao din hoc trc quan h tr thut
ton SVM nh Torch (C++), Spider (MATLAB) hay Weka (Java).
3.2. Mt s hng nghin cu
Trong mt thp k qua SVM pht trin rt nhanh v c l thuyt ln ng
dng nhng vn cn rt nhiu hng nghin cu trin vng. Sau y l mt s
hng chnh.
a. Hiu qu tnh ton
Mt trong nhng nhc im t u ca SVM l chi ph tnh ton cho
bc hun luyn ln, dn n kh p dng cho nhng b d liu ln. Tuy
nhin, vn ny c gii quyt thnh cng. Mt cch tip cn l
chia nh bi ton ti u ha thnh nhng bi ton nh hn m mi bi
ton ny ch lin quan n mt s bin c chn lc v th ti u ha
c gii quyt mt cch hiu qu. Tin trnh lp li cho n khi nhng
bi ton nh c gii quyt hon tt.
Mt vn gn y hn ca my hc SVM l tm mt mt cu ti tiu
bao quanh cc thc th. Nhng thc th khi nh x vo khng gian N-
chiu th hin mt li c th dng xy dng mt mt cu ti tiu bao
quanh. Gii quyt bi ton SVM trn nhng tp li s cho gii php tt
gn ng vi tc rt nhanh.
b. La chn hm kernel
Khi s dng hm kernel trong SVM, vic la chn hm kernel mt cch
tng qut phi tha nh l Mercer. Do , nhng hm kernel ph bin
thng thuc v mt trong ba loi: hm sigmoid, hm a thc v hm bn
knh c s. Gn y Pekalska v cc cng s a ra quan im mi v
vic thit k hm kernel da trn nh x mi lin quan gn. Nhng hm
kernel mi ny khng tha iu kin Mercer a ra cng nh khng gii
hn trong mt khng gian c trng v thc nghim ban u cho thy
-
30
hiu qua tt hn nhng hm kernel Mercer. Tuy nhin, nn tng l thuyt
ca th h hm kernel mi ny cn c nghin cu su hn na.
Ngoi ra, mt cch tip cn khc l s dng nhiu kernel hn l ch mt.
Thng qua s kt hp c th t c kt qu tt hn. Bng vic xc
lp ng hm mc tiu, vic la chn cc tham s cho kernel c th hin
thc cho php s dng nhiu kernel.
c. Hc SVM c cu trc
Cc i ha bin gia cc lp l ng lc ban u ca SVM. iu ny
dn n SVM tp trung vo vic phn tch cc lp ca mu hc nhng
khng quan tm n s phn b ca d liu trong tng lp. nh l
Khng c ba n tra min ph pht biu rng khng tn ti phng
php phn lp mu no tuyt i u th hn nhng phng php khc
hoc thm ch l so vi vic on m mt cch ngu nhin. Thc t cho
thy, ty vo tng bi ton, mi lp khc nhau c th c cu trc khc
nhau. B phn lp phi cn chnh ng bin sao cho khp vi cu trc
ca chng, c bit cho vn tng qut hot ca b phn lp. Tuy nhin,
SVM ban u khng quan tm n cu trc, dn n vic xc nh siu
phng phn cch mt cch cng nhc ngay gia nhng support vector,
dn n b phn lp khng ti u ha cho cc vn thc t.
Gn y, mt s thut ton c pht trin quan tm n cu trc ca
thng tin hn SVM ban u. Chng mang li mt quan im mi v b
phn lp, khi m n c th cm c cu trc ca s phn b d liu.
Nhng thut ton ny c chia thnh hai cch tip cn. Cch tip cn
th nh l manifold learning. Gi thit rng d liu thc t nm trong
nhng submanifold trong khng gian u vo, v thng thng cc thut
ton lin quan n Laplacian Support Vector Machine (LapSVM). Xy
dng LapSVM u tin thng qua th Laplacian trong mi lp. Sau ,
-
31
to ra cu trc manifold ca d liu tng ng vi ma trn Laplacian
trong SVM truyn thng.
Cch tip cn th hai l khai thc cc thut ton phn cm vi gi nh d
liu c cha nhiu cm vi thng tin phn phi. Gi nh ny dng nh
tng qut hn gi nh manifold. Mt cch tip cn gn y c bit n
l Structureed Large Margin Machine (SLMM). SLMM ng dng k
thut gom cm ly c thng tin v cu trc vo trong cc rng buc.
Mt s Large Margin Machines ph bin l Minimax Probibility Machine
(MPM), v Maxi-min Margin Machine (M4) c th xem nh mt dng
c bit ca SLMM. Thc nghim cho thy SLMM c kh nng phn
lp tt hn. Tuy nhin bi ton ti u ha ca SLMM l Second Order
Cone Programming (SOCP) thay v SP ca SVM, SLMM c chi ph tnh
ton cao hn bc hun luyn khi so snh vi SVM truyn thng. Hn
na khng n gin tng qut ha hay bi ton nhiu lp. T , mt
SVM c cu trc mi (SSVM) c pht trin. Kt qu l bi ton ti
u ha c th c gi vi QP nh SVM, v d dng m rng. Hn na,
SSVM cho thy v mt l thuyt v thc nghim tt hn SVM v
SLMM trong vn tng qut ha.
-
32
Ti liu tham kho
[1] Prabhakar Raghavan & Hinrich Schtze Christopher D. Manning. Standford.
[Online]. http://nlp.stanford.edu/IR-book/html/htmledition/support-vector-
machines-the-linearly-separable-case-1.html
[2] Jeff Knisley. Multivariable Calculus Online. [Online].
http://math.etsu.edu/multicalc/prealpha/Chap2/Chap2-9/index.htm
[3] Yan-Bin Jia, "Lagrange Multipliers," Nov 27, 2012.
[4] Hui Xue, Qiang Yang, and Songcan Chen. Chapter 3: Support Vector Machines.
[5] http://www.encyclopediaofmath.org/index.php/Gram_matrix.
[6] Lin, Leon Bottou, Chih-Jen, "Support Vector Machine Solvers".
[7] Bernhard Schlkopf, Alexander J. Smola, Learning with Kernels: Support Vector
Machines, Regularization, Optimization, and Beyond., 2001.