How to Invent Hyperbolic Geometry
Rotation
Rotations are nice maps from a space to itself.
Well, but "nice" is subjective, and there's lots of arguably nice
maps that aren't rotations: translations and scaling are pretty
well-behaved, too. What makes rotations special? Let's focus on the fact that they
- Preserve lengths (unlike scaling)
- Are linear maps, which includes preserving the origin (unlike translations)
What do we mean by length? In euclidean space $\R^n$, the length of a vector $\bx$ is
$\sqrt{x_1^2 + \cdots + x_n^2}$. For simplicity, let's think about $\R^2$ and
only focus on squared length
\[|\bx|^2 = x_1^2 + \cdots + x_n^2\]
What linear maps $\R^2 \to \R^2$ preserve squared length? We're looking for
a matrix
\[ M = \begin{array}{cc} a&b\\c&d \end{array}\]
such that whenever we have a vector
\[ \bx = \begin{array}{c} x\\ y \end{array}\]
we find that $|M\bx|^2 = |\bx|^2$. This means
\[ x^2 + y^2 = \left|
\left(\begin{array}{cc} a&b\\c&d \end{array}\right)\left(\begin{array}{c} x\\ y \end{array}\right)
\right|^2
\]
\[ = \left|\begin{array}{c} ax + by \\ cx + dy \end{array}\right|^2 \]
\[ = (ax+by)^2 + (cx + dy)^2 \]
\[ = a^2x^2 + 2abxy + b^2y^2 + c^2x^2 + 2cdxy + d^2y^2 \]
\[ = (a^2 + c^2)x^2 + 2(ab+cd)xy + (b^2+d^2)y^2 \]
Since this needs to be true for every $x,y$, we have
\[ a^2 + c^2 = b^2 + d^2 = 1 \]
\[ ab+ cd = 0\]
This means that $(a,c)$ and $(b,d)$ are some points on the unit circle, so we can find a $\theta, \phi$
such that $a = \cos \theta$ and $c = \sin \theta$ and $b = \sin \theta$ and $d = \cos \theta$.
The condition that $ab+cd = 0$ means
\[ \sin \phi \cos \theta + \cos \phi \sin \theta = 0\]
which by trig identities means
\[ \sin (\phi + \theta) = 0\]
which means $\phi = 2\pi n - \theta$ for some $n \in \Z$, so without loss we might as well
set $\phi = - \theta$. This means our rotation matrix must look like
\[
R_\theta = \left(\begin{array}{cc} \cos \theta & - \sin\theta \\ \sin \theta &\cos \theta \end{array}\right)
\]
for some $\theta$.
Generalizing Length
What would happen if we instead said that the squared length of a vector was
\[ \left|\begin{array}{c} x \\ y \end{array}\right|^2 = k^2 x^2 + y^2 \]
for some constant $k$? Then the requirement that rotations preserve length would look like
\[ k^2x^2 + y^2 = \left|
\left(\begin{array}{cc} a&b\\c&d \end{array}\right)\left(\begin{array}{c} x\\ y \end{array}\right)
\right|^2
\]
\[ = \left|\begin{array}{c} ax + by \\ cx + dy \end{array}\right|^2 \]
\[ = k^2(ax+by)^2 + (cx + dy)^2 \]
\[ = k^2a^2x^2 + 2k^2abxy + k^2b^2y^2 + c^2x^2 + 2cdxy + d^2y^2 \]
\[ = (k^2a^2 + c^2)x^2 + 2(k^2ab+cd)xy + (k^2b^2+d^2)y^2 \]
Let's define $C = c/k$ and $B = kb$. This is equivalent to
\[ k^2x^2 + y^2 = (k^2a^2 + k^2C^2)x^2 + 2k(aB+Cd)xy + (B^2+d^2)y^2 \]
which means
\[ a^2 + C^2 = B^2 + d^2 = 1\]
\[ aB + Cd = 0\]
We can repeat the reasoning from the last section to conclude that there exists a $\theta$ such that
\[
\left(\begin{array}{cc} a&B\\C&d \end{array}\right)
= \left(\begin{array}{cc} \cos \theta & -\sin \theta\\ \sin\theta&\cos \theta \end{array}\right)
\]
but that's not what our actual transform is: it's
\[
R_{\theta;k} = \left(\begin{array}{cc} a&b\\c&d \end{array}\right)
= \left(\begin{array}{cc} a&B/k\\kC&d \end{array}\right)
= \left(\begin{array}{cc} \cos \theta & -(1/k)\sin \theta\\ k\sin\theta&\cos \theta \end{array}\right)
\]
Adjustment for $\theta$
Let's consider the derivative
\[{d\over d\theta} R_{\theta;k} \bx \adjust_{\theta=0} = {d\over
d\theta}\left(\begin{array}{cc} \cos \theta & -(1/k)\sin \theta\\
k\sin\theta&\cos \theta \end{array}\right) \left(\begin{array}{c}
x \\ y \end{array}\right) \adjust_{\theta=0}\] \[ = {d\over
d\theta}\left(\begin{array}{c}x \cos \theta -(y/k)\sin \theta \\ kx
\sin\theta + y \cos \theta \end{array}\right) \adjust_{\theta=0}\] \[
= \left(\begin{array}{c} -y/k \\ kx \end{array}\right) \]
Let's suppose we'd like to adjust the choice of $\theta$ so that the derivative when $(x,y) = (0,1)$ is constant, instead of varying with $k$. This means considering instead
\[R_{k\theta;k} = \left(\begin{array}{cc} \cos (k\theta) & -(1/k)\sin (k\theta)\\ k\sin(k\theta)&\cos (k\theta) \end{array}\right)\]
and then we find
\[{d\over d\theta} R_{k\theta;k} \bx = \left(\begin{array}{c} -y \\ k^2x \end{array}\right) \]
Hyperbolic Geometry
The conceptual leap at this point is the question: what happens if we consider imaginary numbers
and set $k = i$?
Then our notion of squared length becomes
\[ \left|\begin{array}{c} x \\ y \end{array}\right|^2 = y^2 - x^2 \]
And our rotation becomes
\[R_{i\theta;i} = \left(\begin{array}{cc} \cos (i\theta) & i\sin (i\theta)\\ i\sin(i\theta)&\cos (i\theta) \end{array}\right)
= \left(\begin{array}{cc} \cosh \theta & -\sinh \theta\\ -\sinh\theta&\cosh \theta \end{array}\right)\]
So just to clear the minus sign we could define
\[ H_{\theta} = R_{-i\theta;i} = \left(\begin{array}{cc} \cosh \theta & \sinh \theta\\ \sinh\theta&\cosh \theta \end{array}\right)\]
The Beltrami-Klein model
We could say that the hyperbolic plane is the set of all points in $\R^3$ such that $z > 0$
and $z^2 - x^2 - y^2 = 1$. We can see that
\[ H_{\theta;x} = \left(\begin{array}{cc}
\cosh \theta & 0 &\sinh \theta\\
0 & 0 & 0 \\
\sinh\theta&0 &\cosh \theta
\end{array}\right)\]
\[ H_{\theta;y} = \left(\begin{array}{cc}
0 & 0 & 0 \\
0 & \cosh \theta & \sinh \theta\\
0 & \sinh\theta&\cosh \theta
\end{array}\right)\]
both map the hyperbolic plane into itself. Although we've talked about these as generalizations
of rotation, we can think of them as translations in the $y$ and $x$ directions.
We say the Beltrami-Klein disk is the interior of the unit circle.
We can go back and forth between the hyperbolic plane and the disk like so:
If we have a point $(x, y, \sqrt{1 + x^2 + y^2})$ in the hyperbolic plane, then we output the point
$(x/\sqrt{1 + x^2 + y^2}, y/\sqrt{1 + x^2 + y^2})$.
If we have a point $(u,v)$ in the disk, then we are trying to find a point in the hyperbolic plane
$(x, y, \sqrt{1 + x^2 + y^2})$ such that
\[ u = x/\sqrt{1 + x^2 + y^2} \]
\[ v = y/\sqrt{1 + x^2 + y^2} \]
Trying to solve for $x,y$ leads us to
\[ u^2 = x^2/(1 + x^2 + y^2) \qquad v^2 = y^2/(1 + x^2 + y^2) \]
\[ u^2(1 + x^2 + y^2) = x^2 \qquad v^2(1 + x^2 + y^2) = y^2 \]
\[ x^2 (u^2-1) + y^2u^2 = -u^2\]
\[ x^2 v^2 + y^2(v^2-1) = -v^2\]
\[\left(\begin{array}{c}x^2\\y^2 \end{array}\right) = -
\left(\begin{array}{cc}u^2-1& u^2 \\ v^2 & v^2-1 \\ \end{array}\right)^{-1}
\left(\begin{array}{c}u^2\\v^2 \end{array}\right)\]
\[
= {1\over u^2 + v^2 - 1}
\left(\begin{array}{cc}v^2-1& -u^2 \\ -v^2 & u^2-1 \\ \end{array}\right)
\left(\begin{array}{c}u^2\\v^2 \end{array}\right)\]
\[
= {1\over u^2 + v^2 - 1}
\left(\begin{array}{c}v^2u^2 - u^2 - u^2v^2 \\ -u^2v^2 + u^2v^2 - v^2 \end{array}\right)\]
\[ = \left(\begin{array}{c} {u^2 \over 1 - u^2 - v^2} \\ {v^2 \over 1 - u^2 - v^2} \end{array}\right)\]
\[
\left(\begin{array}{c}x\\y \end{array}\right)
=
\left(\begin{array}{c} {u \over \sqrt{1 - u^2 - v^2}} \\ {v \over \sqrt{1 - u^2 - v^2}} \end{array}\right)\]
What happens if we take a point in the Beltrami-Klein disk, map it into the hyperbolic plane, hit it with $H_{\theta;x}$, then map it back? We start with $(u,v)$, produce the point
\[ {1\over\sqrt{1-u^2 - v^2}} \left(\begin{array}{c}u\\v\\1 \end{array}\right)\]
which gets turned by $H_{\theta;x}$ into
\[ {1\over\sqrt{1-u^2 - v^2}} \left(\begin{array}{c}cu+s\\v\\su+c \end{array}\right)\]
for $c = \cosh\theta$ and $s = \sinh \theta$. Transforming this back into the Beltrami-Klein disk
gives
\[ {1\over su+c} \left(cu+s , v \right) \]
\[= \left({cu+s \over su+c} , {v \over su+c}\right) \]
\[= \left({u+s/c \over us/c+1} , {v/c \over us/c+ 1}\right) \]
\[= \left({u+\tanh \theta \over 1 + u\tanh \theta} , {v \mathrel\mathrm{sech} \theta \over 1 + u\tanh \theta}\right) \]