Category Archives: My Research

Posts related to my own research activities, including original results that I have been able to prove.

A question for students you dislike

At the recent CNTA conference, Professor Joe Silverman gave an explicit homomorphism of \text{GL}_3(\mathbb{R}) into \text{GL}_6(\mathbb{R}) which he jokingly called a great question to ask undergraduates to work out explicitly… if you don’t like them very much. In a similar vein, here is a question that one might ask undergraduates they don’t particularly like:

Let \mathbf{a} = (\alpha, \beta, \gamma) be a triple of co-prime integers which are not all zero and such that \alpha \gamma > 1. Prove that the ternary quadratic form given by:

K_{\mathbf{a}} (F) = \dfrac{1}{8\alpha^3} \bigg( 72 \beta^2 \gamma A^2 + 9 \alpha(\beta^2 + 4 \alpha \gamma) B^2 + 8 \alpha^3 C^2 - 18\beta (\beta^2 + 4 \alpha \gamma)AB
+ 12 \alpha (3 \beta^2 - 4 \alpha \gamma)AC - 24 \alpha^2 \beta BC \bigg)

takes on integer values whenever the triplet (A,B,C) lies in the lattice defined by the congruence conditions

4 \beta \gamma x - (2 \beta^2 + \alpha \gamma) y + 2 \alpha \beta z \equiv 0 \pmod{\alpha^2}


\gamma(2 \beta^2 + \alpha \gamma)x - \beta(\beta^2 + \alpha \gamma)y + \alpha \beta^2 z \equiv 0 \pmod{\alpha^3}.

I will reveal the solution in due time (and it does not involve explicit computation of congruences), but if come up with a solution let me know!



On binary forms II

The following joint paper of myself and my advisor Professor C.L. Stewart has been released on the arxiv. In this follow-up post, I would like to describe in some detail how to establish an asymptotic formula for the number of integers in an interval which are representable by a fixed binary form F with integer coefficients and non-zero discriminant.

There are essentially three ingredients which go into the proof, each established decades apart. The first essential piece of the puzzle was established by Kurt Mahler in the 1930’s. He showed that if we examine the number of integer points in the region \{(x,y) \in \mathbb{R}^2 : |F(x,y)| \leq Z\}, then the number of such points is closely approximated by the area of the region. Since the region is homogeneously expanding, the area itself is well-approximated by scaling the `fundamental region’ given by \{(x,y) \in \mathbb{R}^2 : |F(x,y)| \leq 1\}. Indeed, let A_F denote the area of this fundamental region and let N_F(Z) denote the number of integer pairs (x,y) \in \mathbb{Z}^2 such that |F(x,y)| \leq Z. Then Mahler proved that

\displaystyle N_F(Z) \sim A_F Z^{\frac{2}{d}}.

More precisely, he proved a very good error term. He showed that when d \geq 3, we have

\displaystyle N_F(Z) = A_F Z^{\frac{2}{d}} + O_F\left(Z^{\frac{1}{d-1}} \right).

The question then becomes is there some way to remove the redundancies in Mahler’s theorem? For example, if F has even degree, then F(x,y) = F(-x,-y) for all (x,y) \in \mathbb{R}^2, so the pairs (x,y), (-x,-y) \in \mathbb{Z}^2 represent the same integer. Is it true that this is the only way that this can happen? Unfortunately, the answer is no. For example, consider the binary form F_n = x^n + (x - y)(2x - y) \cdots (nx-y). Then clearly the points (1,1), \cdots, (1,n) all represent 1, and this construction works for any positive integer n. Therefore, there does not appear to be a simple way to count the multiplicities of points representing the same integer in Mahler’s theorem.

While examples like the above exist, perhaps it is possible that this happen sufficiently rare as to be negligible. For instance, if only O\left(Z^{2/d - \delta}\right) many points counted by N_F(Z) are such that there exist many `essentially different’ (precise definition to come) other points which represent the same integer, and even in the worst case there can be at most say O\left(Z^{\frac{\delta}{2}}\right) many essentially different pairs, then we have shown that in total, the contribution from these bad points to N_F(Z) is only O\left(Z^{\frac{2}{d} - \frac{\delta}{2}}\right), which is fine.

We shall now make some definitions. We say that an integer h is essentially represented by F if there exist two integer pairs (x_1, y_1), (x_2, y_2) for which F(x_1, y_1) = F(x_2, y_2) = h, then there exists an element

\displaystyle T = \begin{pmatrix} t_1 & t_2 \\ t_3 & t_4 \end{pmatrix} \in \text{GL}_2(\mathbb{Q}) such that

\begin{pmatrix} x_1 \\ y_1 \end{pmatrix} = T \begin{pmatrix} x_2 \\ y_2 \end{pmatrix}

and such that

F_T(x,y) = F(t_1 x + t_2 y, t_3 x + t_4 y) = F(x,y)

for all (x,y) \in \mathbb{C}^2. Otherwise, we say that h is not essentially represented.

Now put R_F(Z) to be the number of integers up to Z which are representable by F, and let R_F^{(1)}(Z) be the number of essentially represented integers and R_F^{(2)}(Z) be the number of non-essentially represented integers. If we can show that R_F(Z) \sim R_F^{(1)}(Z), then we are basically done. This amounts to showing that R_F^{(2)}(Z) is small compared to Z^{\frac{2}{d}}.

Christopher Hooley proved this for both the ‘easy cubic case’ and the ‘hard cubic case’. However, it was D.R. Heath-Brown who showed that R_F^{(2)}(Z) is always small compared to R_F^{(1)}(Z). This paved the way to our eventual success at this problem.

It remains to account for the interaction between those T \in \text{GL}_2(\mathbb{Q})  which fix F and R_F^{(1)}(Z). These elements are called the rational automorphisms of F and we denote them by \text{Aut} F = \text{Aut}_\mathbb{Q} F. The most novel contribution we made to this topic is that we accounted for the exact interaction between \text{Aut} F and R_F^{(1)}(Z) with the so-called ‘redundancy lemmas’. This will be discussed at a future time.

On binary forms

After months of silence, I am finally able to share the research I’ve been doing in the last few months. I’ve dropped hints before in this post and this other one. These are all part of a big project I’ve been working on jointly with my advisor on the representation of integers by binary forms. More recently, I have been working on a project with Cindy Tsang on counting binary quartic forms with small Galois groups. These are all connected by an insight into binary forms essentially due to Hooley.

Let F be a binary form of degree d, integer coefficients, and non-zero discriminant \Delta(F). Put R_F(Z) for the number of integers n in the interval [-Z,Z] for which the equation F(x,y) = n has a solution in integers x,y. Put N_F(Z) = \# \{(x,y) \in \mathbb{Z}^2 \text { s.t. } |F(x,y)| \leq Z\}. When d = 2 and F is positive definite, Gauss proved that N_F(Z) \sim A_1 Z. He conjectured, and then Landau proved, that R_F(Z) \sim A_2 Z (\log Z)^{-1} in this case. Thus most integers cannot be represented by F, and for each integer that can be represented, there are many representations on average.

However, this is very atypical behaviour. Indeed, quadratic forms are complete norm forms of degree 2. For incomplete norm forms, i.e., binary forms of degree at least 3, one should expect a totally different behaviour in that N_F(Z) and R_F(Z) are not that different. This was confirmed by Hooley in a significant paper in 1967 [Hoo1]. Indeed, he showed that when F is an irreducible binary cubic form such that \Delta(F) is not a perfect integer square, then N_F(Z) \sim R_F(Z). We shall refer to this as the `easy cubic case’. In 1986 [Hoo2], he went on to obtain the asymptotic formula for R_F(Z) for binary quartic forms of the shape F(x,y) = Ax^4 + Bx^2 y^2 + Cy^4. In this case, one has

R_F(Z) \sim \frac{1}{4} N_F(Z)

when A/C is not a perfect 4-th power of a rational number, and

\displaystyle R_F(Z) \sim \frac{1}{4} \left(1 - \frac{1}{2|AC|}\right) N_F(Z)

otherwise. We refer to this as the `easy quartic case’. Finally, he went on to deal with the case when F is a binary cubic form such that \Delta(F) is a square in \mathbb{Z}. In this case, he showed that there is a positive integer m which can be determined explicitly in terms of the coefficients of F such that

\displaystyle R_F(Z) \sim \left(1 - \frac{2}{3m} \right) N_F(Z).

We will refer to this as the `hard cubic case’.

There is a general theory which applies to all binary forms (of any degree at least three) which allows one recover both the |AC| term in the easy quartic case and the m in the hard cubic case. This will be fully explained in an joint paper by my advisor Professor Cameron Stewart and I. However, in the cubic and quartic cases specifically, one can resolve the problem more finely, in that not only can one describe the relationship between R_F(Z) and N_F(Z), but show that there is a positive rational number W_F which can be given explicitly in terms of the coefficients of F when F is a binary cubic or quartic form. My contribution to the cubic case is that we can find this rational constant even when F is not assumed to be irreducible. Indeed, we find that the most interesting behaviour actually occurs when F is completely reducible (but still with non-zero discriminant)! The reducibility of F seems to have no effect in the case of quartic forms.

The key observation is that the behaviour of the constant W_F is determined completely by the automorphism group of the binary form F in \text{GL}_2(\mathbb{Q}). This appears to be an extraordinary insight made by Hooley in his investigation of the hard cubic case. It appears that almost all authors either explicitly or implicitly assumed that it suffices to look only at the smaller group \text{GL}_2(\mathbb{Z}).

More details will be posted later.

Some problems for the new year

Part new year resolution and part a birthday present to myself (and those audience members interested), I’ve decided to write up some problems I’ve been thinking about but either don’t have the time or the techniques/knowledge to tackle at the present time. Hopefully they will keep me motivated into 2016, as well as anyone else who’s interested in them. In no particular order:

1) Stewart’s Conjecture: I have already discussed this problem in two earlier posts (here and here). The conjecture is due to my advisor, Professor Cameron Stewart, in a paper from 1991. The conjecture asserts that there exists a positive number c > 0 such that for all binary forms F(x,y) of degree d \geq 3, integer coefficients, and non-zero discriminant, there exists a positive number r_F which depends on F such that for all integers h with |h| \geq r_F, the equation F(x,y) = h has at most c solutions. In particular, the value of c does not depend on F nor d. A weaker version of this conjecture asserts the existence of a positive number c_d for every degree d \geq 3 for which the above holds.

I suspect that Chabauty’s method, applied to the estimation of integer points on hyperelliptic curves, is close to being able to solve this problem; see this paper by Balakrishnan, Besser, and Muller. However, there may be other tools that may be used without involving a corresponding curve. That said, since a positive answer to Stewart’s conjecture would have significant impact on the theory of rational points on hyperelliptic curves, it seems that the two problems are intrinsically intertwined.

2) Asymptotic Chen’s Theorem: This is related to a problem I’ve been thinking about lately. Chen’s theorem asserts that every sufficiently large even integer N can be written as the sum of a prime and a number which is the product of at most two primes. However, this simple statement hides the nature of the proof. The proof essentially depends on two parts, and (as far as I know) has not been improved on a fundamental level since Chen. The first is the very general Jurkat-Richert theorem, which can handle quite general sequences. Its input is some type of Bombieri-Vinogradov theorem, i.e., some type of positive level of distribution. It essentially churns out semi-primes of some order given a particular level of distribution. We will phrase the result slightly differently, in terms of the twin prime conjecture. Goldbach’s conjecture is quite related, and Chen actually proved the analogous statement for both the twin prime problem and Goldbach’s conjecture. Bombieri-Vinogradov provides the level 1/2, and with this level, the Jurkat-Richert theorem immediately yields that there exist infinitely many primes p such that p+2 is the product of at most three primes. Using this basic sieve mechanism and the Bombieri-Vinogradov theorem, it is impossible to breach the ‘three prime’ barrier. A higher level of distribution would do the trick, but so far, Bombieri-Vinogradov has not been improved in general (although Yitang Zhang‘s seminal work on bounded gaps between primes does provide an improvement in a special case). Thus, we require the second piece of the proof of Chen’s theorem, the most novel part of his proof. He was able to show that there aren’t too many primes p such that p+2 has exactly three prime factors, so few that the difference in number between those primes p where p+2 has at most three prime factors and those with exactly three prime factors can be detected. However, the estimation of these two quantities using sieves (Chen’s theorem does not introduce any technology that’s not directly related to sieves) produce terms with the same order of magnitude, so Chen’s approach destroys any hope of establishing an asymptotic formula for the number of primes p for which p+2 is the product of at most two primes. It would be a significant achievement to prove such an asymptotic formula, because it means there has been a significant improvement to the underlying sieve mechanism, or some other non-sieve technology has been brought in successfully to tackle the problem. Either case, it would be quite the thing to behold.

3) An interpolation between geometrically irreducible forms and decomposable forms: A celebrated theorem of Axel Thue is the statement that for any binary form F(x,y) with integer coefficients, degree d \geq 3, and non-zero discriminant and for any non-zero integer h, the equation F(x,y) = h has only finitely many solutions in integers x,y.  Thue’s theorem is ineffective, meaning one cannot actually find an upper bound for the number of solutions except to know that it must be finite. Thue’s theorem has been refined by many authors over the past century, with some of the sharpest results known today due to my advisor Cam Stewart and Shabnam Akhtari.

If one wishes to generalize Thue’s theorem to higher dimensions, then there are two obvious candidates. The more obvious one is to consider general homogeneous polynomials F(x_1, \cdots, x_n) in many variables. However, in this case Thue’s techniques do not generalize in an obvious way. Thue’s original argument reduced the problem to a diophantine approximation problem, i.e., to show that there are only finitely many rational numbers which are `very close’ to a given root of F. This exploits the fact that all binary forms can be factored into linear forms, a feature which is absent for general homogeneous polynomials in n \geq 3 variables. Thus, one needs to narrow the scope and instead consider decomposable forms, meaning homogeneous polynomials F(x_1, \cdots, x_n) which can be factored into linear forms over \mathbb{C}, say. To this end, significant progress has been made. Most notably, Schmidt’s subspace theorem was motivated by this precise question. Schmidt, Evertse, and several others have worked over the years to establish results which are quite close to the case of Thue equations, though significant gaps remain, but that’s a separate issue and we omit further discussion.

The question I have is whether there is a way to close the gap between what can be proved about decomposable forms and for general forms. The forms which are the most different from decomposable forms, which are essentially as degenerate as possible geometrically, are the ones that are the least degenerate; i.e., the geometrically irreducible forms. These are the forms that cannot be factored at all. Specifically, their lack of factorization is not because its factorability is hidden by some arithmetic or algebraic obstruction but because it is geometrically not reducible. Precisely, geometrically irreducible forms are those forms F(x_1, \cdots, x_n) which do not have factors of positive degree even over an algebraically closed field, say \mathbb{C}. For decomposable forms, a necessary condition is to ensure that the degree d exceeds the number of variables n; much like the condition d \geq 3 in the case of Thue’s theorem. However, absent from the case when n = 2 is the possibility that there are forms of degree exceeding one which behave `almost’ like linear forms, in a concrete sense. By this I mean we can show that as long as basic local conditions are satisfied, the form represents all integers. This has shown to be the case for forms whose degree is very small compared to the number of variables; the first such result is due to Birch, and has been improved steadily since then. Thus the interpolation I am wondering about is the following: let F(x_1, \cdots, x_n) be a homogeneous polynomial with integer coefficients and degree d \geq n+1, with no repeated factors of positive degree. Suppose that F factors, over \mathbb{C}, into forms of very small degree, say d' \ll \log n. Can we hope to establish finiteness results like we can for decomposable forms? This seems like a very interesting question.

If you are interested in any of these problems or if you have an idea as to how to approach any of them, please let me know!

Large sieve inequality

I am currently reading the book Opera de Cribro by John Friedlander and Henryk Iwaniec, and in particular studying the large sieve. One important thing to remember is that the “large sieve” is not really a sieve in the conventional sense. A ‘sieve’ typically refers to a choice of sieve weights, for example a combinatorial sieve is usually some way of defining sieve weights \lambda_d in such a way that \lambda_d = \mu(d) for some positive integers d, while \lambda_d = 0 for others. The large sieve does not involve a choice of sieve weights; and indeed, is usually independent from such choices (at least in its distilled from, the Bombieri-Vinogradov theorem).

The large sieve is actually just an inequality, which is not strictly number-theoretical. In fact, it applies equally well to any “well-spaced” points on the unit circle. The full force of this philosophy has recently been brought to bear on the Vinogradov Mean Value Theorem, in this paper. We write it in its most general form. We will adopt the convention e(x) = e^{2 \pi i x} and for a given sequence (a_n) of complex numbers, define S(\alpha) = \displaystyle \sum_{M < n \leq M+N} a_n e(\alpha n). Now suppose that \alpha_1, \cdots, \alpha_r are well-spaced real numbers with respect to some parameter \delta, meaning that for k \ne l, the number \alpha_k - \alpha_l is at least \delta away from an integer. We will write the distance of a real number \beta from an integer as \lVert \beta \rVert. In other words, we insist that if k \ne l, then \lVert \alpha_k - \alpha_l \rVert \geq \delta.

Moreover, it is clear that we can have at most \delta^{-1} many \alpha_j‘s. From the Cauchy-Schwarz inequality, we see that \lvert S(\alpha) \rvert^2 \leq N \sum_{M < n \leq M+N} |a_n|^2. Therefore, any upper bound for the term

\displaystyle \sum_r \lvert S(\alpha_r)\rvert^2

must include N, \delta^{-1}. The remarkable thing is that this is enough! Indeed, Selberg proved the following sharp form of the large sieve inequality:

\displaystyle \sum_r \lvert S(\alpha) \rvert^2 \leq (N + \delta^{-1} -1) \sum_n |a_n|^2.

This has the following striking number-theoretic interpretation. Consider all the rational numbers a/q with \gcd(a,q) = 1 and 1 \leq q \leq Q. Observe that any two such rationals differ by at most 1/Q^2, in other words, these rationals are Q^2-spaced. Then the large sieve inequality gives the following

\displaystyle \sum_{q \leq Q} \sum_{\substack{a \pmod{q} \\ \gcd(a,q) = 1}} \left \lvert S \left(\frac{a}{q}\right) \right \rvert^2 \leq \left(Q^2 + N - 1\right) \sum_n |a_n|^2.

There are striking consequences to this inequality, including the famous theorem of Linnik.

Solution to why that nonic is solvable

Previously, we claimed that for any quadruple a,b,c,d of rational integers, not all zero, the nonic polynomial

F(x) = x^9 + a x^8 + b x^7 + c x^6 + d x^5 - (126 - 56 a + 21 b - 6 c + d)x^4

- (84 - 28 a + 7 b - c)x^3 - (36 - 8a + c)x^2 - (9 - a)x - 1

is solvable, meaning that it is possible to determine the roots of F explicitly by radicals. By Galois theory, this is equivalent to the assertion that the Galois group of the Galois closure of F is a solvable group.

To do this, we need the following fact, which was proved by Bhargava and Yang in this paper as Theorem 4. The statement of their theorem is correct, and the proof is mostly correct, but there is a minor issue. The problem is that the stabilizer of F under \text{GL}_2(\mathbb{C}), which we will denote by \text{Aut}_\mathbb{C} F, need not be realizable as a subgroup of the Galois group of F, which we will denote by \text{Gal} (F). However, the argument they gave for the commuting action between elements of \text{Aut}_\mathbb{C} F and \text{Gal}(F) is correct.

We now consider a binary form F of the shape given above. We see that both \text{Aut}_\mathbb{C} F and \text{Gal}(F) act on the roots of F and can therefore be embedded via their action on the roots of F into S_9, the symmetric group on nine letters. If we restrict to \text{GL}_2(\mathbb{Q}) action and denote by \text{Aut} F = \text{Aut}_\mathbb{Q} F, then it follows from Galois theory that \text{Gal} (F) must be a subgroup of the centralizer of any element in \text{Aut} F in S_9.

We then check that, miraculously, \text{Aut} F always contains the following element of order 3:

U = \begin{pmatrix} 0 & 1 \\ -1 & -1 \end{pmatrix}.

We check that the only complex numbers fixed by this element are the roots of x^2 + x + 1. Therefore, if F is irreducible, then no root of F can be fixed by U. Relabelling the roots if necessary, we can assume that U can be realized in S_9 as

U = (123)(456)(789).

The centralizer C(U) of U is the stabilizer of U under the action by conjugation of S_9 on itself. The orbit of U is precisely the set of elements in S_9 of the same cycle type, therefore the orbit contains

\displaystyle \binom{9}{3}(2!) \binom{6}{3}(2!) (2!) \frac{1}{3!} = 2240.

By the orbit-stabilizer theorem, it follows that C(U) contains 9!/2240 = 162 = 2 \times 3^4 elements. Since \text{Gal} (F) is contained in C(U), it is solvable by Burnside’s theorem, which asserts that any finite group whose order is only divisible by two distinct primes is solvable.


A solvable nonic polynomial

Continuing from our demonstration that a certain sextic polynomial, which are not in general solvable, has an explicit factorization, we go on to describe how a class of degree 9 polynomials is solvable. Consider a,b,c,d to be rational integers, not all zero, and the nonic polynomial

F(x) = x^9 + a x^8 + b x^7 + c x^6 + d x^5 - (126 - 56 a + 21 b - 6 c + d)x^4

- (84 - 28 a + 7 b - c)x^3 - (36 - 8a + c)x^2 - (9 - a)x - 1.

The claim is that all such polynomials are in fact solvable!

I will reveal the argument a little later, but it’ll be interesting to see what kind of arguments readers can come up with.