吴恩达《机器学习》

500 浏览 0 回复 2018-11-20

我不是匠人

+关注

4.1 Multiple features(variables)

notation
$n$ = number of feature
$x^{(i)}$ = input (features) of $i$ th training example
$x_{j}^{(i)}$ = value of features $j$ in $i$ th training example

hypothesis:

previously: $h_{θ} (x) = θ_{0} + θ_{1} x$
Now: $h_{θ} (x) = θ_{0} + θ_{1} x_{1} + θ_{2} x_{2} + \dots + θ_{n} x_{n} = θ^{T} x$
for convenience of notation,define $x_{0} = 1$
$X = (\begin{matrix} <mstyle displaystyle="false" scriptlevel="0"> x_{0} </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> x_{1} </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> x_{2} </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> \dots </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> x_{n} </mstyle> \end{matrix}) \in R^{n + 1} θ = (\begin{matrix} <mstyle displaystyle="false" scriptlevel="0"> θ_{0} </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> θ_{1} </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> θ_{2} </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> \dots </mstyle> \\ <mstyle displaystyle="false" scriptlevel="0"> θ_{n} </mstyle> \end{matrix}) \in R^{n + 1}$
conclusion:
multivariate linear regression

4.2 Gradient Descent for multiple variables

hypothesis: