Liner Regression
trainSet: (x(i),y(i))
input: x
output: y
model: fw,b(x)=wx+b
cost funcation: J(w,b)=2m1i=1∑m(fw,b(x(i))−y(i))2
gradient descent: minw,bJ(w,b)
w′:=w−α∂w∂J(w,b):=w−α2m1i=1∑m∂w∂(fw,b(x(i))−y(i))2:=w−αm1i=1∑m[(fw,b(x(i))−y(i))x(i)]
b′:=b−α∂b∂J(w,b):=b−α2m1i=1∑m∂b∂(fw,b(x(i))−y(i))2:=b−αm1i=1∑m(fw,b(x(i))−y(i))
w=w′,b=b′ (Simultaneous update)
learning rate: α
Multiple features(variables)
using vectorization
trainSet: (x(i),y(i))
input: x
output: y
model: fw,b(x)=wTx+b
cost funcation: J(w,b)=2m1i=1∑m(fw,b(x(i))−y(i))2
gradient descent: minw,bJ(w,b)
wj′forj=0…n−1:=wj−α∂wj∂J(w,b):=wj−α2m1i=1∑m∂wj∂(fw,b(x(i))−y(i))2:=wj−αm1i=1∑m[(fw,b(x(i))−y(i))xj(i)]
b′:=b−α∂b∂J(w,b):=b−α2m1i=1∑m∂b∂(fw,b(x(i))−y(i))2:=b−αm1i=1∑m(fw,b(x(i))−y(i))
w=w′,b=b′ (Simultaneous update)
learning rate: α