Global choice and algebraic closures

It was pointed out to me today that I glazed over a set-theoretic point in my proof that every field has an algebraic closure. We appealed to Zorn’s lemma, which says:

Given a partially ordered set \mathcal P, if every chain \mathcal C\subseteq\mathcal P has an upper bound, then \mathcal P has a maximal element.

An assumption we made here that seems innocuous is that \mathcal P is a set. If it is a proper class (class and not a set), we may run into problems. So what if the collection of all field extensions of a given field is a proper class? Then we can’t use Zorn’s Lemma.

I think, though haven’t attempted to prove it, that this collection of field extensions is really a set, meaning we’re safe in applying Zorn’s lemma. But it’s an interesting diversion to see what happens if we try to prove Zorn’s lemma for classes.

It’s well known that Zorn’s lemma is equivalent to the axiom of choice, so it would be nice to find an “axiom of choice for classes.” Such a thing exists, and it’s called the “axiom of global choice.” It says the same thing about classes that the axiom of choice says about sets. So if you accept the axiom of global choice, we just need to prove a “global Zorn’s lemma.”

Unfortunately, the standard proof of Zorn’s lemma goes something like this: Assume it’s false. Then use the axiom of choice to construct (via transfinite induction) larger and larger elements. These form a well-ordering. We can keep going and we’ll eventually get to a well-ordering that has too many elements to be contained in the set \mathcal P.

The problem is that if we replace the axiom of choice with the axiom of global choice, we’ll be unable to make this well-ordering bigger than our “poclass” \mathcal P.

If you want more information on this sort of set theory stuff, you should read Zach Norwood’s posts (this is a great blog to which you should definitely subscribe):

Basic facts about AC, part I

Basic facts about AC, part II

On an unrelated note, coming soon will be a series of posts on some generalized abstract nonsense.

Fixed fields

If I have a field k and a field automorphism \sigma, I can ask about the fixed points of \sigma. That is, all of the x\in k such that \sigma(x)=x. Notice that if \sigma fixes x and y, then \sigma(x+y)=\sigma(x)+\sigma(y)=x+y and \sigma(xy)=\sigma(x)\sigma(y)=xy, so \sigma fixes x+y and xy. In fact, the set of fixed points of \sigma is itself a field! I just checked two of the rules. You can check the rest.

This field is called the fixed field of \sigma, and it’s super super important. If we have two automorphisms, \sigma and \tau, we can take the intersection of their fixed fields. The intersection of fields is always a field. What field is it? It’s the field consisting of everything fixed by both \sigma and \tau.

But why stop at two automorphisms? What if I take a group G of automorphisms of k. That has a fixed field. It’s called the fixed field of G. I could take the entire automorphism group of k, or I could just take a subgroup of it. The bigger the group I take, the more fields I intersect, so the smaller my fixed field is.

That’s an important idea. Big group means small fixed field. Small group means big fixed field. We’ll make this precise very soon.

Automorphisms of Q

We’ve talked a bit about automorphisms, but we haven’t seen very much of them, so I wanted to do an example. What is \mbox{Aut}(\mathbb Q) (the group of automorphisms of the field of rational numbers)?

It definitely contains the identity (a.k.a. lame) automorphism, since every field has that one. Is there anything else? No, and here is why. Let \phi be an automorphism of \mathbb Q. That is, let \phi\in\mbox{Aut}(\mathbb Q). Let p/q be a rational number. Then we can write p/q as

p/q=\frac{\overbrace{1+\cdots+1}^{p\ times}}{\underbrace{1+\cdots+1}_{q\ times}}

If we apply \phi,, we get that

\phi(p/q)=\frac{\overbrace{\phi(1)+\cdots+\phi(1)}^{p\ times}}{\underbrace{\phi(1)+\cdots+\phi(1)}_{q\ times}}=\frac{p\cdot\phi(1)}{q\cdot\phi(1)}=p/q.

This is to say, any field automorphism of a field extending \mathbb Q fixes \mathbb Q. Another way to say this is that we can reach any number in \mathbb Q just from 1 and operations which any field automorphism respects (namely addition and division).

This is to say, the automorphism group of the field of rationals is trivial!

Field automorphisms

It’s worth, I suppose, talking about automorphisms in general. An automorphism is a “renaming” of stuff so that it keeps the same structure. The “auto” means self. The “morph” means shape. The “same” is suppressed, because it really comes from the word “isomorphism,” which means same shape. So this is like a self-similar shape… but not in a fractally sense.

What is required for an automorphism? Well, any special constants need to stay the same. And the operations need to be “preserved,” whatever that means. For a field, it means that 0 and 1 are fixed, and that you can move + and \cdot outsied the function. More specifically, \phi is an automorphism of a field F if it is a function \phi:F\to F with

  • \phi(0)=0
  • \phi(1)=1
  • \phi(x+y)=\phi(x)+\phi(y)
  • \phi(xy)=\phi(x)\phi(y)
  • \phi is a bijection. That is, \phi doesn’t send two different points to the same point, and it sends something to every point. (Bijection is a fancy word meaning “1-1 and onto.”)
Can you think of any field automorphisms? In any field, you always have the trivial automorphism which sends a point x to x. It’s lame, but it counts.
What about something a bit more complicated? How about complex conjugation? What if \phi:\mathbb C\to \mathbb C by \phi(a+bi)=a-bi (for a,b\in \mathbb C)? Yes, this works. It’s not hard to check any of these rules, but I’m too lazy to do so.
How about something really crazy? Take the field \mathbb Q(\sqrt2), and the automorphism \psi defined by \psi(a+b\sqrt 2)=a-b\sqrt 2 (where a,b\in \mathbb Q). What? But that sends negative things to positive things? It doesn’t preserve the shape at all!
I assure you, it is a field automorphism. If you don’t believe me, check the rules yourself. The issue you’re potentially having, is that you’re thinking of \mathbb Q(\sqrt 2) as sitting inside \mathbb R, and thinking of the ordering on \mathbb R. But alas, I said nothing about order. It’s damn near impossible, but try to forget everything you know about the structure of \mathbb R, because it will only misguide you.

The intermediate field lattice

If I have two fields k and K, and k\subseteq K, we call k a subfield of K, but it turns out the proper way to think of it is the reverse. We don’t think of K as some starting field with k as a smaller one sitting inside. Instead, we think of k as the starting field, and K as a field extension of k. We could do this for any object, but for whatever reason, it works well in Galois theory.

So one thing we could ask is “what are all the fields between k and K? We can draw nice pictures called Hasse diagrams. The idea is, we put the bigger fields towards the top, and make it so that fields that contain each other are connected by a chain of lines.

Let’s do an example. Suppose we start with \mathbb Q and extend it to the field \mathbb Q(\sqrt 2,i). Here’s the picture:

You forgot about \mathbb Q(i\sqrt 2), didn’t you? Yeah, it’s hard. Man, if only there was a way to make sure we got all of them. Like, putting them in a one-to-one correspondence with, I don’t know, groups or something like that.

But seriously, how do we know we got everything? Well, degree 2 extensions can always be written as adjoining the square root of something (fun exercise I alluded to previously), and we can check that these are the only possibilities that lie inside \mathbb Q(\sqrt2,i). Since [\mathbb Q(\sqrt2,i),\mathbb Q]=4, all the extensions need to be of degree 1,2, or 4. Since 1 and 4 are stupid, we need only consider the degree 2 extensions, and we just checked that we got them all.

One more thing: the fact that \mathbb Q(\sqrt 2) is on the left and \mathbb Q(i) is on the right is arbitrary. I could order those three fields in any of the 6 different ways, and it would still be correct. My choice was arbitrary. All that matters is the containment relationships.

Constructing regular polygons

Yesterday, as usual, we saw some cool stuff. The gist of it was that for polynomials P that can’t be factored over \mathbb Q, if P(\alpha)=0 and P is a polynomial of degree n, then [\mathbb Q(\alpha):\mathbb Q]=n. Here we can replace \mathbb Q with any field k, but for constructibility, we only care about \mathbb Q.

What about constructing regular polygons? If I want to construct a regular N-gon, I need to construct \cos\left(\frac{2\pi}N\right). So for which N can I do this? I’m going to do something a little crazy. I’m going to consider the \zeta_N=\cos\left(\frac{2\pi}N\right)+i\sin\left(\frac{2\pi}N\right). This is a complex number, which you might recognize as an Nth root of unity. That is \left(\zeta\right)^N=1. What I want to know is the smallest polynomial which has \zeta_N as root. I’m not going to justify this, but if you take all the terms (x-\zeta_N^k) where k is relatively prime to N, and multiply them together, you get exactly this polynomial.

But the real question is the degree of this polynomial. It has one factor of x for each k relatively prime to N. We have a name for that. It’s the Euler totient function \phi(N). I have a prize for anyone who can tell me what the word “totient” means.

It turns out that by working over the complex numbers, I haven’t actually cheated (though I won’t justify this either), and the degree of [\mathbb Q\left(\cos\left(\frac{2\pi}N\right)\right):\mathbb Q]=[\mathbb Q(\zeta_N):\mathbb Q]=\phi(N). Well, we need \phi(N) to be a power of two if we have any chance of constructing it. Here are some facts about \phi(N) you might prove in an introductory number theory course:

  • If p is a prime dividing N, then p-1 divides \phi(N).
  • If n^2 divides N, then n divides \phi(N).
  • If N is odd, then \phi(2N)=\phi(N).
  • If N is even, then \phi(2N)=2\phi(N).
So the first thing we should do is break up N into its even and odd parts. Write N=2^k\cdot m where m is odd. Repeated applications of the previous rules tells us that \phi(N)=2^{k-1}\phi(m). So \phi(N) is a power of two if and only if \phi(k) is as well. Take some prime p which divides k. Then p-1 divides \phi(k), so we need p-1 to be a power of two. That is, p is a prime which is one more than a power of two.
We have a name for these types of primes. They’re called Fermat primes, and they seem to be exceedingly rare. We only know of 5 (namely 3, 5, 17, 257, and 65537). What’s more, we can only have each of these primes dividing N once (by the second rule).
So we do know of infinitely many constructible N-gons. Namely, the square, octagon, 16-gon, 32-gon, etc. But if you want to know how many constructible N-gons there were with N odd, you would be asking an extremely difficult question that has been open for nearly 200 years!
Okay. At long last, we’re ready to jump into Galois Theory. Hold on to your socks, because they’re about to get knocked off!

The classical Greek questions

Today we’ll answer these questions:

  1. Can we build a cube with twice the volume of a given cube?
  2. Can we trisect an arbitrary angle?
  3. Can we construct a square with the same area as the unit circle?
Here’s the first idea we will need. Let \alpha be the root of a cubic polynomial with coefficients in \mathbb Q. When I say a cubic polynomial, I want one that can’t be factored. So, while \sqrt{17} is a root of P(x)=x^3-x^2-17x+17, it’s silly, because P(x)=(x^2-17)(x-1). We call polynomials like P reducible because we can reduce it to smaller polynomials. We want an irreducible polynomial. In some sense, you should think of irreducible polynomials as “prime polynomials,” but don’t ever say this to an algebraist, because strictly speaking this is a different concept (but in our case they are the same).
I want to show that [\mathbb Q(\alpha):\mathbb Q]=3. In fact, if \alpha is the root of an irreducible polynomial with coefficients in \mathbb Q of degree n, then [\mathbb Q(\alpha):\mathbb Q]=n.

Proof:

Remember that [\mathbb Q(\alpha):\mathbb Q]=\dim_{\mathbb Q}\mathbb Q(\alpha). We’re considering \mathbb Q(\alpha) as a vector space over the field \mathbb Q, and want to know its dimension. Let P(x)=c_0+c_1x+\dots+c_nx^n be an irreducible polynomial with c_i\in\mathbb Q and P(\alpha)=0. The vector space \mathbb Q(\alpha) is definitely spanned by 1, \alpha, \alpha^2,\alpha^3,\dots. I claim that 1,\alpha,\dots, \alpha^{n-1} form a basis. This would be a basis with n elements, making [\mathbb Q(\alpha):\mathbb Q]=n.

We have a linear dependence c_0+c_1\alpha+\dots+c_n\alpha^n=0, so any basis must have fewer than n+1 elements. Now suppose we have a linear dependence among 1,\alpha,\alpha^2,\dots,\alpha^{n-1}. Say,

\lambda_0+\lambda_1\alpha+\lambda_2\alpha^2+\dots+\lambda_{n-1}\alpha^{n-1}

Then we have a polynomial Q(x)=\lambda_0+\lambda_1x+\lambda_2x^2+\dots+\lambda_{n-1}x^{n-1} of smaller degree with Q(\alpha)=0.

Why is this a problem? We can take combinations of P(x) and Q(x), and try to make the polynomial of smallest degree in this way. Let’s call it R(x)=A(x)P(x)+B(x)Q(x) for some A,B. This is the “greatest common divisor” of P and Q. You thought GCDs only existed for integers? Nope. There’s a general structure (of which the integers are an example) which always has a concept of a GCD.

The important thing is that R(x) is a divisor of P(x). But we assumed P(x) was irreducible, so the only possible divisors are P itself and 1. Since Q(x) has smaller degree than P(x), so does R(x), so it must be that R(x)=1. But then R(\alpha)\neq 0. What? Contradiction!

This means we have \{1,\alpha,\alpha^2,\dots,\alpha^{n-1}\}  is a basis, so [\mathbb Q(\alpha):\mathbb Q]=n.

\square

This “back-and-forth” between bases for vector spaces and polynomials is a really cool trick, and is a fun part of Galois theory.

Now we can answer the questions.

  1. NO! If we could construct a cube with twice the volume of a given cube, we could in particular construct a cube of volume 2, so we could construct the side length \sqrt[3]2. But x^3-2 is an irreducible polynomial (check that yourself) with \sqrt[3]2 as a root, so this would mean we have a degree 3 extension \mathbb Q(\sqrt[3]2)\subseteq \mathbb K. But every extension of \mathbb Q in \mathbb K has degree that is a power of two. Three is certainly not a power of two. Contradiction!
  2. NO! If we could construct arbitrary angles \theta given 3\theta, we could in general construct the length \cos\theta from \cos3\theta. If 3\theta=60^\circ, then \theta=20^\circ, which is a root of the polynomial 4x^3-3x-\frac12 (this comes from the triple-angle formula for cosine).
  3. NO! If we could, we could construct the side length \sqrt\pi. This is the most absurd, because then we could construct \pi which isn’t the root of any polynomial. Some might want to even write [\mathbb Q(\pi):\mathbb Q]=\infty, which is accurate. We can’t get to infinity by only finite constructions.
So there you have it. Field theory rocks.

More awesomeness of the tower rule

A few days ago we noticed that as we build up \mathbb K, by placing down intersection points, we never need to solve anything more complicated than the quadratic equation. We may need to do it a gazillion times, but every constructible number can be build by using nothing fancier than the quadratic equation.

Let’s do some magic. Suppose we construct some number \alpha. Maybe we had to put down points at a bazillion (which, in the metric system, can be more than a gazillion) intersections. Each time we place a point, it gives us a new constructible length. Let’s write down these lengths in the order they are constructed. Call them a_1,a_2,\dots, a_n=\alpha (here n takes the place of a gazillion). Let E_0=\mathbb Q, E_1=E_0(a_1)=\mathbb Q(a_1), E_2=E_1(a_2)=\mathbb Q(a_1,a_2), and so on, up until E_n=E_{n-1}(a_n)=\mathbb Q(a_1,\dots,a_n). Another way to describe this is to say that E_k is the smallest field containing \mathbb Q and a_i for 1\le i\le k.

The first thing to notice is that E_0\subseteq E_1\subseteq\dots\subseteq E_n. So what can we say about E_n? It’s a field? But wait! It’s actually a vector space too! It’s a vector space with scalar field E_m for any m! So let’s consider [E_n:E_0]=[E_n:\mathbb Q].

The magical tower rule tells us that [E_n:E_0]=[E_n:E_{n-1}][E_{n-1}:E_0]. But we can actually continue expanding this product a lot more:

[E_n:E_0]=[E_n:E_{n-1}][E_{n-1}:E_{n-2}]\cdots[E_1:E_0]

So it suffices to figure stuff out about [E_{k+1}:E_k]. This is the extension we get when we add one new point.

But we already know something about what we get when we put down a new point. It is either a quadratic extesnion (if a circle is involved), or a linear extension (if it is just two lines intersecting). Linear extensions are lame, because it’s not an extension at all. It’s just the same field back. But quadratic extensions are not so lame.

In any event, [E_{k+1}:E_k] is either 1 or 2, so [E_n:E_0] must be a power of two. This is particularly handy, and an awesomely beautiful result. We are now really ready to snatch some cool results.

Tomorrow we’ll solve all the classical greek questions. Wednesday we’ll figure out which regular polygons we can construct. Then on Thursday, as promised, we’ll really start Galois Theory. Sorry to delay it so long, but there’s a lot that needs to be said before we can really attack the problem we want to see (the unsolvability of the general quintic).

The Tower Rule

Yesterday, we noticed that [\mathbb Q(\sqrt 2):\mathbb Q]=2. We can use a similar argument to deduce that [\mathbb Q(\sqrt[4] 2):\mathbb Q(\sqrt2)]=2. What does that tell us about [\mathbb Q(\sqrt[4] 2):\mathbb Q]? You may have guessed that [\mathbb Q(\sqrt[4] 2):\mathbb Q]=4. Now you’re either thinking that 4=2+2 or 4=2\cdot2. Those of you thinking that 4=2\cdot 2 are correct!

In general, if I have fields E, F, and G, with E\subseteq F\subseteq G, I can think of G as a vector space over the field F and F as a vector space over E. What’s more, I can think of G as a vector space over the field E. Then I can look at the dimensions, and get what I declare to be the coolest result in general field theory:

[G:E]=[G:F][F:E]

Remember, if you want, you can think about this as \dim_EG=\dim_FG\cdot\dim_EF, but the former way I think looks nicer.

Proof:

Let \{a_1,a_2,\dots,a_m\} be a basis for the vector space F over the field E. Let \{b_1,b_2,\dots,b_n\} be a basis for the vector space G over the field F. We want to find a basis for the vector space G over the field E, and we want it to have m\cdot n elements. There’s really only one logical choice, so let’s hope it works. We’ll show that \mathcal B=\{a_ib_j\mid 1\leq i\leq m,\ 1\leq j\leq n\} is a basis for G over the field E.

So take v\in G. We know there is a unique way to write it v=f_1b_1+f_2b_2+\cdots+f_nb_n where f_i\in F. But then each of these can be written uniquely as f_j=e_{1,j}a_1+e_{2,j}a_2+\cdots+e_{m,j}a_m. So we have

v=\displaystyle\sum_{i=1}^m\sum_{j=1}^ne_{i,j}a_ib_j.

Nice! So \mathcal B spans G (with coefficients in the field E). Now lets check linear independence. Suppose

\displaystyle\sum_{i=1}^m\sum_{j=1}^n\lambda_{i,j}a_ib_j=0.

We can think of this once again as a vector space over the field F, and let f_j=\lambda_{1,j}a_1+\cdots+\lambda_{m,j}a_m \in F, giving us

\displaystyle\sum_{j=1}^n f_jb_j=0.

But \{b_1,\dots,b_n\} is a basis for the vector space G over the field f, and hence a linearly independent set, so it must be that for every j, 0=f_j=\lambda_{1,j}a_1+\cdots+\lambda_{m,j}a_m. Now we have \{a_1,\dots,a_n\} is a basis and hence linearly independent, so each of the \lambda_{i,j} must be zero, for any 1\le i\le m or 1\le j\le n. This is exactly what it means for \mathcal B to be linearly independent. WOOHOO!

\square

If you didn’t think that was cool, you have no business calling yourself a decent human being.

Rationals and the Square Root of 17

Lets think about a simple example. What is the smallest field that contains the rationals and \sqrt{17}? We write it as \mathbb Q(\sqrt{17}). In general, we would write k(\alpha) to mean the smallest field containing k and \alpha. So what is \mathbb Q(\sqrt{17})? Clearly, it must have every a+b\sqrt{17}, for all rational a, b. It turns out that this is exactly it:

\{a+b\sqrt{17}\mid a,b\in\mathbb Q\}=\mathbb Q(\sqrt{17})

Let’s check. We just need to make sure that we can invert any element other than zero. So if we have some a+b\sqrt{17},

\frac1{a+b\sqrt{17}}=\frac{a-b\sqrt{17}}{(a+b\sqrt{17})(a-b\sqrt{17})}=\frac a{a^2-{17}b^2}-\frac b{a^2-{17}b^2}\sqrt{17}

This is of the form x+y\sqrt{17}, so we are done! While I don’t want to check it here, it is in general true that if \alpha is a root of a polynomial p(x) with coefficients in a field k, then k(\alpha)=\{c_0+c_1\alpha+c_2\alpha^2+\cdots+c_n\alpha^n\mid c_i\in k\} for some big enough n.

For a moment let’s forget about how to multiply in general in \mathbb Q(\sqrt{17}). We still know how to add. Let’s even say we still remember how to multiply by rational numbers. Well then, what do you know! \mathbb Q(\sqrt{17}) is a vector space over \mathbb Q! So what is its dimension?

If you guessed 2, you guessed correctly. Clearly 1 and \sqrt{17} are linearly independent over \mathbb Q (that is, \sqrt{17} is irrational), so its at least dimension 2. Furthermore, \{1,\sqrt{17}\} span all of \mathbb Q(\sqrt{17}), so the dimension is at most 2, making it exactly 2.

You may have noticed that there was nothing special about \sqrt{17} here. We could have picked \sqrt r for any rational r. Well, almost any rational. It would be dumb to pick 9, since \sqrt{9}=3\in\mathbb Q. But so long as the square root is not rational, \dim_{\mathbb Q}\mathbb Q(\sqrt r)=2.

Mathematicians will sometimes write this as [\mathbb Q(\sqrt{17}):\mathbb Q]=2. Why? It makes tomorrow’s result look slightly nicer, and kind of mirrors other notation in group theory. Quit asking questions!

Follow

Get every new post delivered to your Inbox.