3.4 Orthogonal Basis and Gram-Schmidt

Orthogonal vectors

independent 하기 때문에
basis vectors가 될 수 있다.

Orthonormal

Orthogonal basis vector를 각각 그 길이로 나누면 orthonormal basis가 된다.

Vector $q_{1}, \dots, q_{n}$ 은 다음의 경우 orthonormal하다.
$q_{i}^{T} q_{j} = {\begin{cases} 0 w h e n e v e r i \neq j, o r t h o g o n a l i t y \\ 1 w h e n e v e r i = j, n o r m a l i z a t i o n \end{cases}$

Orthonormal한 column을 갖는 matrix는 $Q$ 로 표기한다.
$Q = [\begin{matrix} q_{1} & q_{2} & \dots & q_{n} \end{matrix}]$

Orthogonal Matrices

$Q$ 가 orthonormal column을 가지면 $Q^{T} Q = I$ 이다.

Orthogonal matrix는 orthonormal column을 갖는 square matrix이다.

그러면 $Q^{T}$ 는 $Q$ 의 inverse가 된다. $Q^{T} = Q^{- 1}$
Q가 Rectangular matrix인 경우 $Q^{T}$ 는 $Q$ 의 left inverse

Example
Rotation matrix

$θ$ 만큼 이동하는 axes rotation

$Q = [\begin{matrix} c o s θ & - s i n θ \\ s i n θ & c o s θ \end{matrix}], Q^{T} = Q^{- 1} = [\begin{matrix} c o s θ & s i n θ \\ - s i n θ & c o s θ \end{matrix}]$

Orthogonal: $c o s θ s i n θ - s i n θ c o s θ = 0$
Orthonormal: $s i n^{2} θ + c o s^{2} θ = 1$

Example
Permutation matrix

$If P = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 1 \\ 1 & 0 & 0 \end{matrix}] then P^{- 1} = P^{T} = [\begin{matrix} 0 & 0 & 1 \\ 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}]$

기하학적으로 orthogonal matrix $Q$ 는 rotation matrix와 reflection matrix의 곱이다.

Projection matrix는 연산하는 vector의 길이를 줄이지만, Rotation, reflection을 비롯한 orthogonal matrix들은 vector의 길이(length)를 보존한다.

$Lengths conservation ‖ Q x ‖ = ‖ x ‖ for every vector x.$

이는 $Q^{T} Q = I$ 이기 때문에 가능하다. $‖ Q x ‖^{2} = (Q x)^{T} (Q x) = x^{T} Q^{T} Q x = ‖ x ‖^{2}$

Inner product와 angle 역시 보존된다.

$Inner product or angle conservation (Q x)^{T} (Q y) = x^{T} Q^{T} Q x = x^{T} x$

Basis를 알고 있으면 이를 조합하여 어떤 vector라도 만들어 낼 수 있다. Basis가 orthonormal basis일 경우 이 과정이 매우 간단해진다.

문제는 basis vector의 coefficients를 찾아내는 것이다.

임의의 vector $b$ 에 대해서,
$b = x_{1} q_{1} + x_{2} q_{2} + \dots + x_{n} q_{n}$

식의 양 변에 $q_{1}^{T}$ 를 곱하면 왼쪽 항은 $q_{q}^{T} b$ 가 되고 오른쪽 항은 $x_{1} q_{1}^{T} q_{1}$ 을 제외하고는 모두 사라진다. $q_{1}^{T} q_{1} = 1$ 이므로

$q_{i}^{T} b = x_{i}, since {\begin{cases} q_{i}^{T} q_{j} = 0, i \neq j \\ 1, i = j \end{cases}$

그러면 모든 vector $b$ 를 다음과 같이 나타낼 수 있다.
$b = (q_{1}^{T} b) q_{1} + (q_{2}^{T} b) q_{2} + \dots + (q_{n}^{T} b) q_{n}$

$Q x = b$ 이므로 $x = Q^{- 1} b$ 이다. $Q^{- 1} = Q^{T}$ 이므로 $x = Q^{T} b$ 로 나타낼 수 있다.

$x = Q^{T} b = [\begin{matrix} q_{1}^{T} \\ ⋮ \\ q_{n}^{T} \end{matrix}] [\begin{matrix} b \end{matrix}] = [\begin{matrix} q_{1}^{T} b \\ ⋮ \\ q_{n}^{T} b \end{matrix}]$

Remark 1

앞서 vector $b$ line $a$ 로 projection한 vector를 $(a^{T} b / a^{T} a) a$ 로 나타냈었다.

$a$ 를 $q$ 로 바꾸면
$for q_{i}, \frac{q_{i}^{T} b}{{q_{i}}^{T} q_{i}} q_{i} = (q_{i}^{T} b) q_{i} = x_{i} q$

$(q_{i}^{T} b) q_{i}$ 는 $b$ 를 $q_{i}$ 로 projection한 것과 같다.

이 관점에서, $b = (q_{1}^{T} b) q_{1} + (q_{2}^{T} b) q_{2} + \dots + (q_{n}^{T} b) q_{n}$ 는 $b$ 를 각 $q$ 에 one-dimensional 하게 projection 한 것의 합으로 볼 수도 있다.

Remark 2

$Q^{T} = Q^{- 1}$ 이므로 $Q^{T} Q = I$ 일 뿐만 아니라 $Q Q^{T} = I$ 이다.

이는 $Q$ 의 row vector를 각각 inner product 한 것으로 row vector들도 orthogonal하다는 결론을 내릴 수 있다.

즉, square matrix의 column이 orthonormal하면 그 row도 orthonormal하다.

Rectangular Matrices with Orthogonal Columns

3단원에서 주로 Rectangular A에 대해서 다뤘으므로 orthonormal한 column을 갖는 rectangular matrix에 대해서도 생각해보자.

Column의 개수보다 row의 개수가 많은 $Q$ 는 ( $m > n$ ) least squares를 이용해서 풀어야한다.

핵심은 여전히 $Q^{T} Q = I$ 라는 것이다. $Q^{T}$ 는 여전히 $Q$ 의 left-inverse이다.

그러므로 $Q$ 가 orthonormal column을 가지면 least-square problem은 보다 쉽게 풀 수 있다.

$\begin{matrix} Q x & = & b & rectangular system with no solution for most b \\ Q^{T} Q \hat{x} & = & Q^{T} b & normal equation for the best \hat{x} \\ \hat{x} & = & Q^{T} b & \hat{x_{i}} is q_{i}^{T} b \\ p & = & Q \hat{x} & the projection of b is (q_{1}^{T} b) q_{1} + \dots + (q_{n}^{T} b) q_{n} \\ p & = & Q Q^{T} b & the projection matrix is P = Q Q^{T} \end{matrix}$

$Q Q^{T} = [\begin{matrix} 1 & 0 & \dots & 0 & 0 s \\ 0 & 1 & \dots & 0 & 0 s \\ ⋮ & ⋮ & ⋱ & ⋮ & 0 s \\ 0 & 0 & \dots & 1 & 0 s \\ 0 s & 0 s & 0 s & 0 s & 0 s \end{matrix}] = [\begin{matrix} I_{m x n} & 0 s \\ 0 s & 0 s \end{matrix}]$

Example

$b = (x, y, z)$ 를 $x y$ plane에 project

$q_{1} = [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}] a n d (q_{1}^{T} b) = x; q_{2} = [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}] a n d (q_{2}^{T} b) = y;$

$P = q_{1} q_{1}^{T} + q_{2} q_{2}^{T} = [\begin{matrix} 1 & 0 & 0 \\ 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}], a n d P b = [\begin{matrix} x \\ y \\ 0 \end{matrix}] (xy plane)$

Example

least square problem에서 측정한 시간 값의 average가 0일 때 straight line에 fitting하기 위한 matrix가 orthogonal column을 갖는다.

자세한 내용은 서적 참고.

The Gram-Schmidt Process

임의의 independent vector가 $a, b, c$ 가 주어졌을 때, 이 vector들을 orthonormal basis라면 굉장히 많은 이점이 생긴다.

Gram-Schmidt process로 vector $a, b, c$ 에서 orthonormal vector $q_{1}, q_{2}, q_{3}$ 를 얻을 수 있다.

$q_{1}$ 은 간단하게 $a$ 의 방향으로 두면 된다.
- 크기는 1이 되도록 $a$ 의 길이로 나눠줘야한다.
  $q_{1} = \frac{a}{‖ a ‖}, unit vector$
$q_{2}$ 는 $q_{1}$ 에 orthogonal해야 한다.
- 그러므로 두 번째 vector $b$ 가 $q_{1}$ 방향의 component를 갖고있다면 이를 빼줘야한다.
  $Second vector B = b - (q_{1}^{T} b) q_{1} = (q_{2}^{T} b) q_{2} a n d q_{2} = B / ‖ B ‖$
- $b = (q_{1}^{T} b) q_{1} + (q_{2}^{T} b) q_{2}$
동일한 방법으로 $q_{3}$ 를 구할 수 있다.
$Third vector C = c - (q_{1}^{T} c) q_{1} - (q_{2}^{T} c) q_{2} = (q_{3}^{T} c) q_{3} a n d q_{3} = C / ‖ C ‖$

이처럼 매번 새로운 vector에서 이미 정해진(settled) 방향의 component를 뺴는 방법을 Gram-Schmidt process라고 한다.

주어진 vector $a_{1}, \dots, a_{n}$ 에 대해서

$A_{j} = a_{j} - \sum_{i = 1}^{j - 1} (q_{i}^{T} a_{j}) q_{i} = (q_{j}^{T} a_{j}) q_{j} a_{j} = \sum_{i = 1}^{j} (q_{i}^{T} a_{j}) q_{i} q_{j} = \frac{A_{j}}{‖ A_{j} ‖} Normalization$

계산 과정에서 a_j가 $A_{j - 1}$ 안에 놓여있는 것에 가까워 $A_{j}$ 의 크기가 굉장히 작은 경우가 발생한다.
이 경우 해당 벡터는 이미 이 전에 모두 포함되어있다 가정하고 다음 벡터로 넘어가면된다.

Exmaple

주어진 independent vector $a, b, c$ 로부터 $q_{1}, q_{2}, q_{3}$ 를 구해라

$a = [\begin{matrix} 1 \\ 0 \\ 1 \end{matrix}], b = [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}], c = [\begin{matrix} 2 \\ 1 \\ 0 \end{matrix}]$

$q_{1} = \frac{a}{‖ a ‖} = \frac{1}{\sqrt{2}} [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}]$

$$
B = b - (q_1^Tb)q_1 =
$[\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}]$

{1 \over \sqrt{2}}
$[\begin{matrix} \frac{1}{\sqrt{2}} \\ 0 \\ \frac{1}{\sqrt{2}} \end{matrix}]$ =
{1 \over 2}
$[\begin{matrix} 1 \\ 0 \\ - 1 \end{matrix}]$
$$

$q_{2} = \frac{B}{‖ B ‖} = \frac{1}{\sqrt{2}} [\begin{matrix} 1 \\ 0 \\ - 1 \end{matrix}]$

$$
\begin{matrix}
C &=& {c - (q_1^Tc)q_1 - (q_2^Tc)q_2} \

&=&
$[\begin{matrix} 2 \\ 1 \\ 0 \end{matrix}]$

\sqrt{2}
$[\begin{matrix} \frac{1}{\sqrt{2}} \\ 0 \\ \frac{1}{\sqrt{2}} \end{matrix}]$
\sqrt{2}
$[\begin{matrix} \frac{1}{\sqrt{2}} \\ 0 \\ \frac{- 1}{\sqrt{2}} \end{matrix}]$ =
$[\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}]$

\end{matrix}
$$

$q_{3} = \frac{C}{‖ C ‖} = [\begin{matrix} 0 \\ 1 \\ 0 \end{matrix}]$

The Factorization $A = Q R$

Column이 $a, b, c$ 인 matrix $A$ 로부터 $q_{1}, q_{2}, q_{3}$ 인 matrix $Q$ 를 이끌어냈다.

$A$ 로부터 $Q$ 를 만들기 위해서는 이 둘을 연결해주는 세 번째 matrix가 존재해야한다.

요점은 $a$ vector들을 $q$ 의 조합으로 표현하는 것이다.

위에서 $b$ 는 $q_{1}$ , $q_{2}$ 의 합으로, 비슷하게 $c$ 는 $q_{1}$ , $q_{2}$ , $q_{3}$ 의 합으로 나타낼 수 있다.

$b = (q_{1}^{T} b) q_{1} + (q_{2}^{T} b) q_{2} c = (q_{1}^{T} c) q_{1} + (q_{2}^{T} c) q_{2} + (q_{3}^{T} c) q_{3}$

이를 matrix의 형태로 나타내면 $A$ 를 새로운 factorization $A = Q R$ 으로 나타낼 수 있다.

$QR factors A = [\begin{matrix} a & b & c \end{matrix}] = [\begin{matrix} q_{1} & q_{2} & q_{3} \end{matrix}] [\begin{matrix} q_{1}^{T} a & q_{1}^{T} b & q_{1}^{T} c \\ q_{2}^{T} b & q_{2}^{T} c \\ q_{3}^{T} c \end{matrix}] = Q R$

$R$ 은 upper triangle martix로 nonzero가 diagonal의 오른쪽(right)에 위치하기 때문에 R이라고 부른다.
$Q R$ factorization은 첫 factor $Q$ 가 orthonormal column이라는 것을 제외하면 $A = L U$ 와 비슷하다.
$a, B, C$ 의 lenght는 $R$ 의 diagonal 성분과 같다.

$Q R$ factorization을 이용하면 $A$ 의 연산이 쉬워진다.

$A^{T} A = R^{T} Q^{T} Q R = R^{T} R$
$A^{T} A \hat{x} = A^{T} b$ 가 triangular system으로 간단해진다: $R^{T} R \hat{x} = R^{T} Q^{T} b$

Function Spaces and Fourier Series

Brief and optional section.

Vector space를 function space로 확장.
Gram-Schmidt orthogonalization을 function space에 적용.

1. Hilbert Space (Function space)

n dimensional space $R^{n}$ 을 $R^{\infin}$ 로 확장
$v = (v_{1}, v_{2}, v_{3}, \dots)$
Finite length를 갖는 vector만 포함, $‖ v ‖^{2} = v_{1}^{2} + v_{2}^{2} + \dots 1 < \infin$
Function is defined in a finite interval

Hilbert space는 $R^{\infin}$ 안에 있으면서 vector가 finite length를 갖는 vector space이다.

Vector space of Hilbert space

$‖ v_{1} + v_{2} ‖ \leq ‖ v_{1} ‖ + ‖ v_{2} ‖ < \infin \to addition \in R^{\infin}$
$‖ c v_{1} ‖ = ‖ c ‖ ‖ v_{1} ‖ < \infin \to scalar multiplication \in R^{\infin}$

2. Lengths and Inner Products

특정 구간에서의 continuous function $f$ 는 그 구간 전체에서 연속적인 component $f (x)$ 를 갖는 vector로 볼 수 있다.

이 vector의 length는 이전에 사용했던 각 component의 제곱값을 더하는 방식으로는 구할 수 없다. $f$ 를 구하기 위해서는 summation 방식을 특정 구간에서의 integration으로 바꿔야한다.

Example

$f (x) = s i n x 0 \leq x \leq 2 π$

$‖ f (x) ‖^{2} = \int_{0}^{2 π} (f (x))^{2} d x = \int_{0}^{2 π} (s i n x)^{2} d x = π$

Summation을 integration으로 대체하는 것을 이용해 두 function의 inner product도 구할 수 있다.

Exmaple

$f (x) = s i n x, g (x) = c o s x$

$(f, g) = \int_{0}^{2 π} f (x) g (x) d x = \int_{0}^{2 π} s i n x c o s x d x = 0$

3. Fourier Series

Series

Vector space에 basis vector가 존재한 것 처럼 function space에도 basis function이 존재한다.
Basis function이 있으면 각 function $x (t)$ 를 basis function의 combination으로 나타낼 수 있다.
Function의 경우 combination 대신 series라는 명칭을 쓴다.
$x (t) = \sum_{i = 1}^{\infin} a_{i} b_{i} (t)$

가장 대표적인 예로 $1, t, t^{2}, \dots$ 가 function basis이다. 이들은 independent 하지만 orthogonal 하지않다.

Function을 구성하는 orthogonal basis는 대표적으로 sine과 cosine이 있다.

Sine과 cosine을 orthogonal basis function으로 expansion한 function이 Fourier series이다.

$f (x) = a_{0} + \sum_{n = 1}^{\infin} a_{n} c o s n x + \sum_{m = 1}^{\infin} b_{m} s i n m x$

Orthogonality

$m, n$ 은 정수, $(m \neq n, m_{1} \neq m_{2}, n_{1} \neq n_{2})$

$\int_{0}^{2 π} c o s n_{1} t c o s n_{2} t d t = \frac{1}{2} \int_{0}^{2 π} (c o s (n_{1} + n_{2}) t + c o s (n_{1} - n_{2}) t) d t = 0$

동일한 방식으로,

$\int_{0}^{2 π} c o s n t s i n m t = 0 \int_{0}^{2 π} s i n m_{1} t s i n m_{2} t = 0$

Coefficients

주어진 basis function에 대해서 특정 function을 나타내는 series coefficients는 unique하다.
특정 function의 coefficient를 알면 해당 function을 재현할 수 있다
Coefficients를 구하기 위해서는 양변에 구하려는 coefficient를 갖는 basis를 곱한 다음에 0부터 2 $π$ 까지 integrate하면 된다.

Example

$f (x) = a_{0} + a_{1} c o s x + b_{1} s i n x + a_{2} c o s 2 x + b_{2} s i n 2 x + \dots$

$b_{1}$ 을 구하기 위해서는 양 변에 $s i n x$ 를 곱한 뒤 0부터 2 $π$ 까지 적분한다.

$\int_{0}^{2 π} f (x) s i n x d x = a_{0} \int_{0}^{2 π} s i n d x + a_{1} \int_{0}^{2 π} c o s x s i n x d x + b_{1} \int_{0}^{2 π} (s i n x)^{2} d x + \dots$

오른쪽 항의 적분값은 자기 자신을 곱한 $s i n x$ 항만을 제외하고는 모두 0이된다.

그러므로 $b_{1}$ 은
$b_{1} = \frac{\int_{0}^{2 π} f (x) s i n x d x}{\int_{0}^{2 π} (s i n x)^{2} d x} = \frac{(f, s i n x)}{(s i n x, s i n x)}$

4. Gram-Schmidt for Functions

Sine과 cosine 외의 basis function이 많지만 주로 orthogonal하지 않다.

가장 간단한 polybomial function $1, x, x^{2}, \dots$ 역시 orthogonal 하지않아 차수가 높아지면 주어진 function f(x)를 나타내는 matrix를 계산하는것은 불가능에 가깝다.

해결 방법은 Gram-Schmidt를 이용해서 orthogonal한 polynomial basis function을 만드는 것이다.

우선, inverval을 $- 1 \leq x \leq 1$ 처럼 symmetric하게 잡아준다.

이러면 x의 odd power를 가진 항과 even power를 가진 항이 orthogonal하게 된다.

$(1, x) = \int_{- 1}^{1} x d x = 0, (x, x^{2}) = \int_{- 1}^{1} x^{3} d x = 0$

Polynomial basis vector $1, x, x^{2}, \dots$ 에 대해서 orthogonal basis를 $v_{1}, v_{2}, v_{3}, \dots$ 라고 하자.

$v_{1} = 1, v_{2} = x$ 이다.
$(v_{1}, v_{2}) = (1, x) = \int_{- 1}^{1} x d x = 0 orthogonal$

$v_{3}$ 를 구해보면,
$v_{3} = x^{2} - \frac{(1, x^{2})}{(1, 1)} 1 - \frac{(x, x^{2})}{(x, x)} x = x^{2} - \frac{\int_{- 1}^{1} x^{2} d x}{\int_{- 1}^{1} 1 d x} = x^{2} - \frac{1}{3}$

$v_{1}, v_{2}$ 와의 inner product를 구해서 확인해보면,

$(1, x^{2} - \frac{1}{3}) = \int_{- 1}^{1} (x^{2} - \frac{1}{3}) d x = 0 (x, x^{2} - \frac{1}{3}) = \int_{- 1}^{1} (x^{3} - \frac{1}{3} x) d x = 0$

5. Best Straight Line

수업에서 다루지 않음.

Summary

Set Hilbert space

$x (t) = lim_{Δ t \to 0} (x (a), x (a + Δ t), x (a + 2 Δ t), \dots, x (b)) \to x (t) \in R^{\infin}$

$x (t), y (t) \in H (a \leq t \leq b) x (t) + y (t) \in H, α x (t) \in H$

Inner products
$(x (t), y (t)) = lim_{Δ t \to 0} \sum_{k = 0}^{\infin} x (a + k Δ t) y (a + k Δ t) = \int_{a}^{b} x (t) y (t) d t$

Length
$‖ x (t) ‖^{2} = \int_{a}^{b} x^{2} (t) d t$

Orthogonality
$(x (t), y (t)) = \int_{a}^{b} x (t) y (t) d t = 0$

Series, Basis functions
$x (t) = \sum_{i = 1}^{\infin} a_{i} b_{i} (t) x (t) = \sum_{i = 1}^{\infin} (q_{i} (t), x (t)) q_{i} (t), for orthonormal basis$

'공부를 합니다 > 수학 (mathematics)' 카테고리의 다른 글

선형대수(HYU)_16 판별식의 공식 (0)	2020.07.12
선형대수(HYU)_15 행렬의 판별식 (0)	2020.06.26
선형대수(HYU)_11-12 벡터투영과 최소제곱법 (0)	2020.05.15
선형대수(HYU)_10 벡터의 직교성과 직선투영 (0)	2020.05.02
선형대수(HYU)_09 선형변환과 행렬 (0)	2020.04.22

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

야채크래커의 부스러기

선형대수(HYU)_13-14 QR 분할과 함수공간

3.4 Orthogonal Basis and Gram-Schmidt

Orthonormal

Orthogonal Matrices

Rectangular Matrices with Orthogonal Columns

The Gram-Schmidt Process

The Factorization $A = Q R$

Function Spaces and Fourier Series

1. Hilbert Space (Function space)

2. Lengths and Inner Products

3. Fourier Series

4. Gram-Schmidt for Functions

5. Best Straight Line

Summary

'공부를 합니다 > 수학 (mathematics)' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

선형대수(HYU)_13-14 QR 분할과 함수공간

3.4 Orthogonal Basis and Gram-Schmidt

Orthonormal

Orthogonal Matrices

Rectangular Matrices with Orthogonal Columns

The Gram-Schmidt Process

The Factorization A=QR

Function Spaces and Fourier Series

1. Hilbert Space (Function space)

2. Lengths and Inner Products

3. Fourier Series

4. Gram-Schmidt for Functions

5. Best Straight Line

Summary

'공부를 합니다 > 수학 (mathematics)' 카테고리의 다른 글

'공부를 합니다/수학 (mathematics)' Related Articles

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

The Factorization $A = Q R$