- Split View
-
Views
-
Cite
Cite
Alina Carmen Cojocaru, Rachel Davis, Alice Silverberg, Katherine E. Stange, Arithmetic Properties of the Frobenius Traces Defined by a Rational Abelian Variety (with two appendices by J-P. Serre), International Mathematics Research Notices, Volume 2017, Issue 12, June 2017, Pages 3557–3602, https://doi.org/10.1093/imrn/rnw058
- Share Icon Share
Abstract
Let |$A$| be an abelian variety over |$\mathbb Q$| of dimension |$g$| such that the image of its associated absolute Galois representation |$\rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{ \mathbb Z})$|. We investigate the arithmetic of the traces |$a_{1, p}$| of the Frobenius at |$p$| in |$\operatorname{Gal}(\overline{\mathbb Q}/\mathbb Q)$| under |$\rho_A$|. In particular, we obtain upper bounds for the counting function |$\#\{p \leq x: a_{1, p} = t\}$| and we prove an Erdös-Kac-type theorem for the number of prime factors of |$a_{1, p}$|. We also formulate a conjecture about the asymptotic behaviour of |$\#\{p \leq x: a_{1, p} = t\}$|, which generalizes a well-known conjecture of Lang and Trotter from 1976 about elliptic curves.
1 Introduction
Given an abelian variety |$A/\mathbb Q$|, its reductions |$A_p/\mathbb F_p$| modulo primes encode deep arithmetic global information. A primary question related to these reductions concerns their |$p$|-Weil polynomials, in particular the coefficients of these polynomials.
In the simplest case when |$A$| has dimension 1, that is, when |$A$| is an elliptic curve over |$\mathbb Q$|, for each prime |$p$| of good reduction the |$p$|-Weil polynomial is |$ P_{A, p}(X) = X^2 - a_p X + p \in \mathbb Z[X], $| where |$ a_p := p + 1 - |A_p(\mathbb F_p)|. $| The coefficient |$a_p$| satisfies the Weil bound |$|a_p| < 2 \sqrt{p}$| and is of major significance in number theory. For example, it appears as the |$p$|th Fourier coefficient in the expansion of the weight 2 newform associated to |$A$|. The study of |$a_p$| comes in several flavours, some having led to well-known problems in arithmetic geometry, such as the Sato-Tate Conjecture from the 1960s (now a theorem) and the Lang-Trotter Conjecture on Frobenius traces from the 1970s (still open).
The constant |$c(A, t)$| has a precise heuristic description derived from the Chebotarev Density Theorem, combined with the Sato-Tate Conjecture when |$\operatorname{End}_{\overline{\mathbb Q}}(A) \simeq \mathbb Z$| and with a prime distribution law arising from works of Deuring and Hecke when |$\operatorname{End}_{\overline{\mathbb Q}}(A) \not\simeq \mathbb Z$|.
While the Lang-Trotter Conjecture remains open, several remarkable related results have been proven. When |$\operatorname{End}_{\overline{\mathbb Q}}(A) \not\simeq \mathbb Z$| (the CM case) and |$t \neq 0$|, upper bounds of the right order of magnitude can be proved using sieve methods. When |$\operatorname{End}_{\overline{\mathbb Q}}(A) \simeq \mathbb Z$| and |$t \neq 0$|, weaker upper bounds, unconditional or conditional (upon the Generalized Riemann Hypothesis, GRH), can be proved using effective versions of the Chebotarev Density Theorem; such bounds were first obtained by Serre [42, Theorem 20]. The currently best unconditional upper bound, |$\pi_A(x, t) \ll_A \frac{x (\log \log x)^2}{(\log x)^2}$|, was obtained by V. K. Murty [38, Theorem 5.1] (see [48] for an earlier result), while the currently best upper bound under GRH, |$\pi_A(x, t) \ll_A \frac{x^{\frac{4}{5}}}{(\log x)^{\frac{1}{5}}}$|, was obtained by Murty, Murty, and Saradha [37, Theorem 4.2] (for very recent improvements on the exponent of the |$\log x$| factor, see [52]). When |$\operatorname{End}_{\overline{\mathbb Q}}(A) \simeq \mathbb Z$| and |$t = 0$|, stronger results are known; in particular, the unconditional bounds |$\frac{\log \log \log x}{(\log \log \log \log x)^{1 + \varepsilon}} \ll_{\varepsilon} \pi_A(x, 0) \,{\ll}\, x^{\frac{3}{4}}$| were obtained by Fouvry and M. R. Murty [22, Theorem 1] and, respectively, by Elkies [19] using, as a key tool, Deuring’s characterization of supersingular primes [17].
Inspired by these works, the main goal of our article is to investigate the arithmetic of the Frobenius traces of a generic higher-dimensional abelian variety |$A/\mathbb Q$|; in particular:
(i) we will prove upper bounds for the generalization of the counting function |$\pi_A(x, t)$| and deduce results on the growth of the Frobenius traces;
(ii) we will determine the normal order of the sequence defined by the prime divisor function of the Frobenius traces, and, more generally, we will prove an Erdös-Kac-type result for this sequence;
(iii) under suitable hypotheses, we will formulate a generalization of (1).
Our main results mark only the beginning of such investigations in higher dimensions and we hope shall stimulate further research.
Our main setting and notation are as follows. Let |$A/\mathbb Q$| be a principally polarized abelian variety of dimension |$g$|. Let |$\overline{\mathbb Q}$| denote an algebraic closure of |$\mathbb Q$| and let |$\operatorname{End}_{\overline{\mathbb Q}}(A)$| denote the endomorphism ring of |$A$| over |$\overline{\mathbb Q}$|. Let |$N_A$| be the product of primes of bad reduction for |$A$|.
The reason we usually impose the restriction that our abelian varieties be principally polarized is for ease of notation. When the abelian variety is principally polarized, the image of the |$\ell$|-adic representation |$\rho_{A, \ell}$| lies in |$\operatorname{GSp}_{2g}(\mathbb Z_{\ell})$|. Without the restriction on the polarization, the image lies in a group that can be defined by replacing the matrix |$J_{2g}$| of Section 2.1 below with a matrix that has a more complicated description, and our results could be modified accordingly; see, for example, Section 2.3 of [43] for the group of symplectic similitudes in this general setting.
For any |$\varepsilon > 0$| we have:
- (i1) unconditionally,
- (i2) under GRH,
(ii) if |$t \neq \pm 2g$|, then (i1) and (i2) hold with |$\alpha$| replaced by |$\beta$|;
(iii) if |$t = 0$|,then (i1) and (i2) hold with |$\alpha$| replaced by |$\gamma$|.
We will actually prove a more general result, stated as Theorem 14 in Section 4, and that the case |$g =1$| of Theorem 1 is [42, Theorem 20, p. 189].
An immediate consequence of Theorem 1 concerns the non-lacunarity of the sequence |$(a_{1, p})_p$|:
We keep the setting and notation of Theorem 1. For any |$\varepsilon > 0$| we have:
- (i) unconditionally,
- (ii) under GRH,
Recall that |$\nu(n)$| denotes the number of distinct prime factors of a positive integer |$n$| and that an arithmetic function |$f(\cdot)$| is said to have normal order |$F(\cdot)$| if for all |$\varepsilon > 0$|, then |$ (1 - \varepsilon) F(n) < f(n) < (1 + \varepsilon) F(n) $| for all but a zero density subset of positive integers |$n$|. It is a classical result of Erdös, originating in work of Hardy and Ramanujan [25], that |$\nu(p-1)$| has normal order |$\log \log p$|. More generally, Erdös and Kac [20] proved that |$\nu(p-1)$| has a normal distribution. Variations of these results have also been obtained in arithmetic geometric contexts, including that of modular forms [36]. We now prove such results in the context of abelian varieties:
In particular, |$\nu(a_{1, p})$| has normal order |$\log \log p$|.
The case |$g = 1$| not only recovers but also generalizes the main theorem of [36] for weight 2 newforms that are not of CM type.
Finally, in Conjecture 4 below we propose a generalization of (1) to the case of higher-dimensional abelian varieties for which |$\Im \rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$| and for which the following holds:
Equidistribution assumption: the normalized traces |$\frac{ a_{1, p}}{\sqrt{p}}$| are equidistributed on |$[-2g, 2g]$| with respect to the projection by the trace map of the (normalized) Haar measure of the unitary symplectic group |$\operatorname{USp}(2g)$|.
We propose:
If |$c(A, t) = 0$|, we interpret the asymptotic as saying that there are at most finitely many primes |$p$| such that |$a_{1, p} = t$|.
For a discussion about the possible growth of |$\pi_A(x, 0)$|, see Section 5.
The image of |$\rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$| for a large class of abelian varieties. Indeed, in [43, 44] Serre showed that this holds whenever |$\operatorname{End}_{\overline{\mathbb Q}}(A) \simeq \mathbb Z$| and the dimension |$g$| of |$A$| is |$1$|, |$2$|, |$6$|, or an odd number. An open image result also holds when |$\operatorname{End}_{\overline{\mathbb Q}}(A) \simeq \mathbb Z$| and there exists a number field |$K$| such that the Néron model of |$A/K$| over the ring of integers of |$K$| has a semistable fibre of toric dimension 1; see [24]. As pointed out in [24, p. 704], for |$g \geq 2$| the image of |$\rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$| for most abelian |$g$|-folds that arise as Jacobians of hyperelliptic curves defined by |$y^2 = f(x)$| with the degree |$n$| of the monic polynomial |$f \in \mathbb Z[x]$| equal to |$2 g + 1$| or |$2 g + 2$|. Specifically, the hypotheses in Hall’s Theorem are satisfied if the Galois group of |$f$| is |$S_n$|, or if there exists a rational prime |$p$| for which |$f (\mod p)$| has |$n-1$| distinct zeroes over an algebraic closure, one of which is a double zero; see Kowalski’s appendix in [24] and Zarhin’s article [50]. □
When |$\Im \rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$|, the Equidistribution Assumption is a very special case of a general conjecture explained in Section 13 of [45, Conjecture 13.5] that generalizes the Sato-Tate Conjecture. See also [15, pp. 173–174, 797–804, 906] and [40]. □
Generalizations of the Lang-Trotter Conjecture (1) have been previously considered by other authors. For example, in [39], V. K. Murty addressed generalizations in the setting of modular forms, while in [28, pp. 421–423], Katz addressed generalizations in the setting of abelian varieties arising as Jacobians of genus |$g$| curves. Our conjecture encompasses a generic class of abelian varieties |$A$| and is precise in terms of both the growth in |$x$| and the constant depending on |$A$| and |$t$|. The potential vanishing of the constant |$c(A, t)$| is an important open problem in itself. In [28, p. 420], for instance, Katz discusses a general mechanism that leads to congruence obstructions for realizing |$a_{1, p} = t$|. We relegate this study to future work. □
The paper is structured as follows. In Section 2 we present some of the key results needed for proving Theorem 1, Corollary 2, and Theorem 3, and for arguing towards Conjecture 4. In Section 3 we prove Theorem 1 and Corollary 2 using the strategy of [42, Sections 7–8] and also with the help of the main result of Serre’s Appendix 1 of this aricle. In Section 4 we prove Theorem 3 following a general strategy of [5]. In Section 5 we provide our heuristic reasoning towards Conjecture 4 and address some connections with existing works. In Section 6 we provide computational data related to our theoretical investigations. J-P. Serre supplied two appendices: the first gives a result on the dimension of conjugacy classes in symplectic groups, while the second gives properties of a certain density function for unitary symplectic groups.
2 Generalities
2.1 Basic notation
we use |$p$| and |$\ell$| to denote rational primes; we write |$n | m^{\infty}$| to mean that all the prime divisors of |$n$| occur among the prime divisors of |$m$|, possibly with higher multiplicities; we write |$n || m$| to mean that |$n | m$|, but |$n^2 \nmid m$|; we write |$v_{\ell}(n)$| for the valuation of |$n$| at |$\ell$|.
We note that |$\operatorname{GSp}_{2}(R) = \operatorname{GL}_2(R)$|. We recall that |$\operatorname{GSp}_{2g}(R)$| has centre |$\{\mu I_{2g} : \mu \in R^{\times} \}$| and that, as an algebraic group, it has dimension |$2g^2 + g+1$|.
2.2 The Chebotarev density theorem
2.2.1 Finite extensions of a number field
We will use the following conditional effective version of this theorem:
In order to apply this theorem, the following variation of a result of Hensel [26], proved in [42], is useful:
2.2.2 |$\ell$|-adic extensions of a number field
In [42], Serre used the effective versions of the Chebotarev Density Theorem of Lagarias and Odlyzko [34] to deduce upper bounds for |$\pi_C(x, L/K)$| in the case of an |$\ell$|-adic Galois extension |$L/K$| of a number field |$K$|. We recall his main results below.
([42, Theorem 10, p. 151]) Keep the above setting and notation. Let |$0 \leq d < D$| be such that the Minkowski dimension of |$C$| satisfies |$ \dim_{\cal{M}} C \leq d. $| Define |$ \displaystyle{\alpha := \frac{D - d}{D}.} $|
- (i) Unconditionally, we haveIn particular, for any |$\varepsilon > 0$|, we have
- (ii) Under GRH for Dedekind zeta functions, we haveIn particular, for any |$\varepsilon > 0$|, we have
Serre obtains the following improvement in special cases:
Then |$(i)$| and |$(ii)$| of Theorem 10 hold with |$\beta_C$| in place of |$\alpha$|. □
Note that |$r_C \geq 0$|, hence |$\beta_C \geq \alpha$| and so Theorem 11 is Theorem 10 when |$\beta_C = \alpha$|. When |$r_C \geq 1$|, hence |$\beta_C > \alpha$|, Theorem 11 improves upon Theorem 10. This happens when |$C \cap Z(G) = \emptyset$|.
2.3 Abelian varieties
Property (2) links the |$p$|-Weil polynomial |$P_{A, p}(X)$| to the division fields of |$A$|, in particular to the Galois representation defining |$\rho_{A}$|.
We recall that:
- by the Néron-Ogg-Shafarevich criterion,(6)
- by the injectivity of the restriction of |$\bar{\rho}_{A, m}$| to |${\operatorname{Gal}(\mathbb Q(A[m])/\mathbb Q)}$|,(7)
In many cases, the image of the representation |$\rho_{A}$| is better understood. For example, as already mentioned in Remark 5 of Section 1, for several classes of abelian varieties |$A/\mathbb Q$| with a trivial endomorphism ring, |$\Im \rho_A$| is open in |$\operatorname{GSp}_{2g} (\hat{\mathbb Z})$|. In particular, for such |$A$| we have that:
|$ \Im \rho_{A, \ell}$| is open in |$\operatorname{GSp}_{2 g}(\mathbb Z_{\ell})$| for all rational primes |$\ell$|;
|$G(\ell) \simeq \operatorname{GSp}_{2 g}(\mathbb Z/\ell \mathbb Z)$| for all but finitely many rational primes |$\ell$|.
Let |$A/\mathbb Q$| be a principally polarized abelian variety of dimension |$g$| such that |$\Im \rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$|.
- (i) There exists an integer |$m \geq 1$| such that |$\rho_A(\operatorname{Gal}(\overline{\mathbb Q}/\mathbb Q)) = \Pi^{-1}(G(m))$|, where we recall thatis the natural projection. Denote by |$m_A$| the least such integer.
- (ii) For all positive integers |$m_1, m_2$| with |$m_1 | m_A^{\infty} \; \; \text{and} \; \; (m_2, m_A) = 1,$| we have
- (iii) For all |$t \in \mathbb Z$| we haveIn particular, if |$t \neq 0$|, then
- (iv) For all |$t \in \mathbb Z$|, |$t \neq 0$|, we have
This completes the proof of (8).
This ensures the convergence of the infinite product |$\displaystyle\prod_{\ell} H_t(\ell)$|, proving (iii).
Indeed, the first assertion is trivial, while the second assertion follows by noting that, if |$t_1 \not\equiv 0 (\mod \ell)$| and |$t_2 \not\equiv 0 (\mod \ell)$|, then the endomorphism |$\left[t_2 t_1^{-1}\right]$| of |$\operatorname{GSp}_{2g}(\mathbb Z/\ell \mathbb Z)$| defined by multiplication by |$t_2 t_1^{-1}$| is a bijection satisfying that |$\left[t_2 t_1^{-1}\right] (C(\ell, t_1)) = C(\ell, t_2)$|.
It is now easy to show that (9) follows from this along with (8) for |$|C(\ell, 0)|$|.
Consequently, (9) holds for |$t=0$|, |$g \geq 2$|.
Note that |$M_1 = 1$| and that for |$s \ge 2$|, the first of the two conditions defining |$M_s$| gives |$\alpha_1$| linearly in terms of the other |$\alpha_i$|, while the second gives |$\alpha_2$| as a root of a quadratic in the remaining terms. Thus, if |$s \geq 2$|, then |$M_{s} \leq 2(\ell-1)^{s-2}$|. It follows that when |$r=2$|, the sum |$\sum_{\alpha \in \mathbb F_\ell^\times} K(\alpha)^r$| is bounded by an expression of leading degree at most |$2$| in |$\ell$| (by direct computation using |$M_1$|), and when |$r \ge 3$|, by an expression of leading degree at most |$r-1$| in |$\ell$|.
The quadratic function above is maximized when |$b = \lfloor \frac{g}{2} \rfloor$| and |$k = \lfloor \frac{g-2b+2}{2} \rfloor =1$|, with maximal value |$\frac{3g^2}{2} + \frac{g}{2} + 1$|; the bound (12) follows. This proves (9), and therefore the first part of (iii).
To prove the second part of (iii), observe that |$t \neq 0$| is divisible by at most finitely many primes, and so |$\displaystyle\prod_{\ell \nmid m_A} H_t \left( \ell^{v_{\ell}(t) + 1} \right)$| is a constant multiple of |$\displaystyle\prod_{ \ell} H_t(\ell)$|, hence finite by the first part of (iii).
We sketch a proof of the latter using arguments from [8]; we leave it as an exercise to the reader to derive these formulae using the aforementioned results of [32]. We will use these formulae in Remark 24.
Dividing by |$\ell-1$|, we deduce that |$N_{\ell,t} = (\ell-2)^2$| for any fixed nonzero |$t$|; this completes the proof of (22). □
3 Proof of Theorem 1
We will deduce Theorem 1 from the following more general result:
Let |$A/\mathbb Q$| be a principally polarized abelian variety of dimension |$g$| and let |$t \in \mathbb Z$|.
(i) Assume that there exists a prime |$\ell$| such that:
(a) |$G_\ell$| is open in |$\mathbb G_\ell$|;
(b) |$\exists \; 0 \leq d < \dim \mathbb G_{\ell} \; \text{such that} \; \dim_{{\cal{M}}} C_{\ell}(t) \leq d.$|
Define
Then for any |$\varepsilon > 0$| we have:
- (i1) unconditionally,(23)
- (i2) under GRH,(24)
(ii) If |$t \neq \pm 2g$|, assume that there exists a prime |$\ell$| such that:
(a) |$G_\ell$| is open in |$\mathbb G_\ell$|;
(b) |$\exists \; 0 \leq d < \dim \mathbb G_{\ell} \; \text{such that} \; \dim_{{\cal{M}}} C_{\ell}(t) \leq d;$|
(c) |$v_\ell(\frac{t}{2g}) \neq 0$|.
DefineThen |$r_{C_{\ell}(t)} > 0$| and the equations (23) and (24) hold with |$\alpha$| replaced by |$\beta$|.
(iii) If |$t=0$|, assume that there exists a prime |$\ell$| such that:
(a) |$G_\ell$| is open in |$\mathbb G_\ell$|;
(b) |$ \exists \; 0 \leq d < \dim P\mathbb G_{\ell} \; \text{such that} \; \dim_{{\cal{M}}} C'_{\ell}(0) \leq d.$|
DefineThen |$r_{C'_{\ell}(0)} > 0$| and the equations (23) and (24) hold with |$\alpha$| replaced by |$\gamma$|.\qedhere
Throughout the proof we let |$x > 0$|, to be thought of as approaching |$\infty$|.
It remains to estimate |$\pi_{C_{\ell}(t)}(x, L/\mathbb Q)$|, which we do by following the method of [42, Section 8].
We choose |$\ell$| as in the hypothesis of (i). Since |$G_\ell$| is open in |$\mathbb G_\ell$|, we have |$\dim G_{\ell} = \dim \mathbb G_{\ell}$|. We apply Theorem 10 to the extension |$L/\mathbb Q$| and the conjugacy set |$C_{\ell}(t)$| with |$D := \dim \mathbb G_{\ell}$|.
Therefore we can improve upon the result of (i) by applying Theorem 11 to the extension |$L/\mathbb Q$| and the conjugacy set |$C_{\ell}(t)$| with |$D := \dim \mathbb G_{\ell}$|.
Therefore we can improve upon the result of (i) by applying Theorem 11 to the extension |$L'/\mathbb Q$| and the conjugacy set |$C'_{\ell}(0)$| with |$D := \dim \mathbb G_{\ell} - 1$|. ∎
In our setting, by the openness assumption on |$\Im \rho_A$|, hypothesis (a) of Theorem 14 holds for any prime |$\ell$|. It remains to verify hypothesis (b) and to compute the values of |$\alpha$|, |$\beta$|, and |$\gamma$|.
To verify hypothesis (b) of either parts (i) or (ii), observe that |$\mathbb C_{\ell}(t)$| is a closed subvariety of the algebraic group |$\operatorname{GSp}_{2g}$| and so |$C_\ell(t)$| has a well-defined dimension strictly smaller than |$\dim \mathbb G_\ell$|. The bound applies to the Minkowski dimension |$\dim_{\mathcal{M}}C_\ell(t)$| also by [42, Theorem 8]. Part (b) follows with |$d := \dim \mathbb G_\ell - 1$|.
To verify hypothesis (b) of part (iii), observe that |$\Pi(\mathbb C_{\ell}(0))$| is a closed subvariety of the algebraic group |$P\mathbb G_{\ell}$| and so |$C'_{\ell}(0)$| has a well-defined dimension strictly smaller than |$\dim P\mathbb G_\ell$|. The bound applies to the Minkowski dimension |$\dim_{\mathcal{M}}C'_\ell(0)$| also by [42, Theorem 8]. Part (b) follows with |$d := \dim \mathbb G_\ell - 2$|.
Recalling that |$\dim \operatorname{GSp}_{2g} = 2g^2 + g + 1$|, we see that |$\alpha = \frac{1}{2 g^2 + g + 1}$|.
This completes the proof of Theorem 1. ∎
The uniformity in |$t$| of the bounds for |$\pi_A(x, t)$| provided by Theorem 1 was crucial in the above estimates. ∎
4 Proof of Theorem 3
Let |$A/\mathbb Q$| be a principally polarized abelian variety of dimension |$g$| such that |$\Im \rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$|. We will investigate |$\nu(a_{1, p})$| via the method of moments, with the goal of proving:
With this, by adapting to our context the proof of the Erdös-Kac Theorem due to Billingsley [5] (see also [4] and the references therein for an accessible exposition), Theorem 3 is proved.
By (29), |$R_{\ell}$| models the event that |$\ell | a_{1, p}$| for some |$p$|. Our strategy then is to prove (27) by comparing |$\mathbb E\left( (R(y) - \log \log x)^k \right)$| and |$ \frac{1}{\pi(x)} \displaystyle\sum_{ p \leq x \atop{ p \nmid N_A \atop{ a_{1, p} \neq 0 } } } (\nu_y(a_{1, p}) - \log \log x)^k $| for each |$k \geq 1$|.
This completes the proof of Theorem 3.
The first and second moments of |$\nu(a_{1, p})$| may be estimated directly, without any comparison with the model defined by |$R_{\ell}$|. The strategy originates in Turán’s proof of the Hardy-Ramanujan Theorem, [47], and is summarized below. □
The cancellation of the |$\pi(x) (\log \log x)^2$| terms is essential and that the choice of |$\delta$| ensures that the largest emerging |$\operatorname{O}$|-term depending on |$y$|, namely |$\operatorname{O}_{A}\left( \frac{ x^{2 \delta (2 g^2 + g + 1)} }{\log x} x^{\frac{1}{2}} \right)$|, is sufficiently small; precisely, it is |$\ll_A \pi(x) \ll_A \pi(x) \log \log x$|.
The normal order of |$\nu(a_{1, p})$| may also be obtained via the ubiquitous large sieve; see [33, Proposition 2.15] for generalities related to such works. Moreover, the |$k$|th moments (27) may be estimated more precisely via sieve methods by applying the general result [23, Proposition 3]. □
5 Heuristic Reasoning for Conjecture 4
We devote this section to arguing heuristically towards Conjecture 4. Our main setting will be that of a principally polarized abelian variety |$A/\mathbb Q$| of dimension |$g$| for which |$\Im \rho_A$| is open in |$\operatorname{GSp}_{2g}(\hat{\mathbb Z})$| and which satisfies the Equidistribution Assumption. In particular, the function |$\Phi$| introduced in Section 1 is bounded, continuous, and nonzero on |$(-1, 1)$|; this was proved in more than one way in email communication between Katz [29] and Serre; in Appendix 2 we include a letter from Serre to Katz that contains such a proof.
This follows by viewing the expression inside the limit as a Riemann sum approximation of the integral |$\displaystyle\int_{-1}^{1} \Phi (\tau) \; d \tau = 1$|. For more details, see [35, pp. 31–32]. ∎
Here, the symbol |$\approx$| means equality deduced purely heuristically. The last line is simply notation, as introduced in Section 2.2.
When |$t = 0$| and |$g = 1$|, the limit over |$m \; \widetilde{\rightarrow} \; \infty$| exists and equals an infinite product by [35, Lemma 2, p. 34]; see Remark 22 below. When |$t = 0$| and |$g \geq 2$|, we are currently unable to make a similar statement and relegate such a study to future work.
We conclude this section with several remarks about the above conjecture.
In our notation |$\Phi(x)=\Psi(4x)\cdot 4$|, since one can rescale the variable and account for the fact that both functions are assumed to have integral 1. Therefore, |$\Phi(0)=\Psi(0) \cdot 4=\frac{256}{15 \pi^2}$|. □
For higher |$g$|, the function |$\Phi$| is shown to have a certain general form in Appendix 2. It may again be calculated explicitly using, for example, [49, Theorem 7.8.B] and [30, 5.0.4] (see also the upcoming [7]). □
For |$g=1$|, a more refined version of (1) was proposed in [2]; for higher |$g$|, similar refinements are relegated to future work. □
Variations of our Conjecture 4 may be formulated for non-generic classes of abelian varieties such as the case of a CM elliptic curve |$E/\mathbb Q$| (which was already considered in [35]); in such cases, both the assumption on the image of |$\rho_A$| and the Equidistribution Assumption must be modified appropriately. We relegate such endeavours to future work. □
6 Computations
The Lang-Trotter Conjecture as formulated in (40) has been supported by numerical evidence (see [8, 13, 35]). Among the main ensuing difficulties are the computations of the integer |$m_A$| and of the quotient |$\frac{m_A |C(m_A, t)|}{|G(m_A)|}$|. These may be resolved for |$g=1$| by working with a {Serre curve}, that is, an elliptic curve for which |$\left|\mathbb GL_2\left(\hat{\mathbb Z}\right) : \Im \rho_A\right| = 2$|. For such a curve, the integer |$m_A$| is the least common multiple of |$2$| and the discriminant of |$\mathbb Q\left(\sqrt{\Delta_A}\right)$|, where |$\Delta_A$| is the discriminant of any Weierstrass equation of |$A$|; see [27, Section 4, p. 1558]. As proved in [27] and later in [14], in more than one sense almost all elliptic curves are Serre curves. Examples of such curves, as exhibited by Serre in [41, pp. 310–311] and by Daniels in [16, p. 227], have been used for numerical computations in [16, p. 227].
For higher |$g$|, the investigation of |$m_A$| from a computational perspective is a solid problem in itself that remains to be tackled. In this section, while we do not provide numerical evidence for Conjecture 4, we do provide some computational data that complements our main theoretical results.
6.1 Values of |$\pi_A(x,t)$|
Figures 1 and 2 show the values of |$\pi_{A}(x, t)$| graphed versus |$\sqrt{x}/\log x$| for |$t \in \{0, 1\}$| and |$A \in \{J_1, J_2, J_3\}$|, where |$J_1, J_2, J_3$| are the Jacobians of the hyperelliptic curves listed in Table 1. Prediction (38) would imply that these graphs approximate a straight line, whose slope is determined by the constant in front of |$\sqrt{x}/\log x$|; the graphs are indeed consistent with this implication.
Jacobian . | Hyperelliptic curve . | Genus . | Comments . |
---|---|---|---|
|$J_1$| | |$y^2 = x^5 - x + 1$| | 2 | Good reduction outside |$\{ 2, 19, 151 \}$|, |$\operatorname{End}(J_1) \cong \mathbb Z$| [18, p. 509] |
|$J_2$| | |$y^2 = 4x^7 - 12x - 35$| | 3 | Everywhere semistable, |$\operatorname{End}(J_2) \cong \mathbb Z$| [51, p. 2] |
|$J_3$| | |$y^2 = 4x^9 - 8x - 39$| | 4 | Everywhere semistable, |$\operatorname{End}(J_3) \cong \mathbb Z$| [51, p. 2] |
Jacobian . | Hyperelliptic curve . | Genus . | Comments . |
---|---|---|---|
|$J_1$| | |$y^2 = x^5 - x + 1$| | 2 | Good reduction outside |$\{ 2, 19, 151 \}$|, |$\operatorname{End}(J_1) \cong \mathbb Z$| [18, p. 509] |
|$J_2$| | |$y^2 = 4x^7 - 12x - 35$| | 3 | Everywhere semistable, |$\operatorname{End}(J_2) \cong \mathbb Z$| [51, p. 2] |
|$J_3$| | |$y^2 = 4x^9 - 8x - 39$| | 4 | Everywhere semistable, |$\operatorname{End}(J_3) \cong \mathbb Z$| [51, p. 2] |
Jacobian . | Hyperelliptic curve . | Genus . | Comments . |
---|---|---|---|
|$J_1$| | |$y^2 = x^5 - x + 1$| | 2 | Good reduction outside |$\{ 2, 19, 151 \}$|, |$\operatorname{End}(J_1) \cong \mathbb Z$| [18, p. 509] |
|$J_2$| | |$y^2 = 4x^7 - 12x - 35$| | 3 | Everywhere semistable, |$\operatorname{End}(J_2) \cong \mathbb Z$| [51, p. 2] |
|$J_3$| | |$y^2 = 4x^9 - 8x - 39$| | 4 | Everywhere semistable, |$\operatorname{End}(J_3) \cong \mathbb Z$| [51, p. 2] |
Jacobian . | Hyperelliptic curve . | Genus . | Comments . |
---|---|---|---|
|$J_1$| | |$y^2 = x^5 - x + 1$| | 2 | Good reduction outside |$\{ 2, 19, 151 \}$|, |$\operatorname{End}(J_1) \cong \mathbb Z$| [18, p. 509] |
|$J_2$| | |$y^2 = 4x^7 - 12x - 35$| | 3 | Everywhere semistable, |$\operatorname{End}(J_2) \cong \mathbb Z$| [51, p. 2] |
|$J_3$| | |$y^2 = 4x^9 - 8x - 39$| | 4 | Everywhere semistable, |$\operatorname{End}(J_3) \cong \mathbb Z$| [51, p. 2] |
6.2 Converging products of Lemma 12
6.3 The normal order of |$\nu(a_{1,p})$|
Figure 3 shows the average number of |$\nu(a_{1,p})$| for |$J_1$| of Table 1, for |$p$| in the intervals |$[2^{i-1}, 2^i]$|, |$i = 2, \ldots, 21$|. The graphs of |$\log \log 2^i$| and |$\log \log 2^{i-1}$| are shown for comparison. Figure 4 presents histograms of the values of |$\nu(a_{1,p})$| for |$J_1$| in two intervals: |$[1, 2^{20}]$| and |$[2^{20},2^{21}]$|. The corresponding histograms for |$J_2$| and |$J_3$| are very similar.
Funding
This work was partially supported by the National Science Foundation under agreement No. DMS-0747724 (to A.C.C.), by the European Research Council under Starting Grant 258713 (to A.C.C.), by the Simons Collaborative Grant under Award No. 318454 (to A.C.C.), by the National Science Foundation under agreement CNS-0831004 (to A.S.), by the National Science Foundation MSPRF 0802915 (to K.E.S.), by the Natural Sciences and Engineering Research Council of Canada PDF 373333 (to K.E.S.), and by the National Security Agency under Grant H98230-14-1-0106 (to K.E.S.). The United States Government is authorized to reproduce and distribute reprints notwithstanding any copyright notation herein.
Acknowledgments
The authors thank Jeff Achter, Alina Bucur, Hao Chen, Francesc Fité, Nathan Jones, Kiran Kedlaya, Emmanuel Kowalski, Antonella Perucca, Karl Rubin, Jean-Pierre Serre, Drew Sutherland, Cassie Williams, and Jonathan Wise for valuable discussions related to this paper. They are grateful for the hospitality of the Banff International Research Station, Alberta, Canada, and the support of the organizers Chantal David, Matilde Lalín, and Michelle Manes of the conference Women in Numbers 2 (2011), where this work was initiated. The last three authors thank the first author for her leadership and hard work on this project.
Appendix 1. Letter by J-P. Serre on Dimension of Conjugacy Classes in Symplectic Groups
Paris, May 8, 2015
Dear professor Cojocaru,
In case you want optimal estimates for the dimensions of conjugacy classes in |$\operatorname{Sp}$| and |$\operatorname{GSp}$|, here is what one can say:
Let us consider the algebraic groups |$G = \operatorname{GSp}_{2n}$| or |$\operatorname{Sp}_{2n}$| over a field |$k$| of characteristic |$0$| (there are some small changes in char. |$p > 0$|). Assume |$n > 1$|, since the case of |$\mathbb GL_2$| and |$\operatorname{SL}_2$| is obvious. If |$g \in G(k)$|, let |$d(g)$| be the dimension of the conjugacy class of |$g$|, viewed as an algebraic subvariety of |$G$|; we have |$d(g) = \dim G - \dim Z_G(g)$|, where |$Z_G(g)$| is the centralizer of |$g$| in |$G$|.
Assume that |$g$| is not of the form |$cu$|, where |$c$| is in the centre of |$G$| and |$u$| is unipotent. Then |$d(g) \geqslant 4n-4$|. If moreover |$\operatorname{Tr}(g) = 0$| and |$n > 2$|, we have |$d(g) \geqslant 4n-2$|. □
A few remarks before giving the proof:
(1) This is a ‘geometric’ statement: we may assume that the ground field is algebraically closed.
(2) We may assume that |$G = \operatorname{Sp}_{2n}$|; the case of |$\operatorname{GSp}_{2n}$| follows by writing |$g$| as product of a scalar and an element of |$\operatorname{Sp}_{2n}$|; the dimension of the conjugacy class is the same.
(3) If |$\operatorname{Tr}(g) = 0$|, then the condition ‘|$g \neq cu$|’ is satisfied, thanks to the fact that the characteristic does not divide |$2n.$|
(4) The bounds are optimal. One realizes them by using the obvious embedding |$\iota : \operatorname{SL}_2 \to G$|, fixing a non-degenerate subspace of codimension 2. If one chooses |$g = \iota(-1)$|, the centralizer of |$g$| in |$\operatorname{Sp}_{2n}$| is |$\operatorname{Sp}_{2n-2} \times \operatorname{SL}_2$|; its dimension is |$2(n-1)^2 + n-1 + 3 = 2n^2 -3n +4$|; hence the dimension of the conjugacy class of |$g$| is |$\dim G - (2n^2-3n+4) = 2n^2 + n - (2n^2-3n+4) = 4n-4$|. If one chooses |$g = \iota(x)$|, where |$x \in \operatorname{SL}_2$| is such that |$\operatorname{Tr}(x) = 2-2n$| (this is always possible and gives a non-central element because |$2-2n \neq \pm 2$|), one gets an element of trace 0 with centralizer the product of |$\operatorname{Sp}_{2n-2} $| by a group of dimension 1; its dimension is |$2n^2 -3n +2$|, and the dimension of its conjugacy class is |$4n-2$|.
(5) In the |$\ell$|-adic application, one needs the fact that, if |$g \in \operatorname{GSp}_{2n}(\mathbb Z_\ell)$|, the dimension (as an |$\ell$|-adic manifold) of the conjugacy class of |$g$| is the same as its dimension in the sense of algebraic geometry.
For a given |$t \in \mathbb Z$|, with |$t \neq \pm 2n$|, one needs to choose |$\ell$| so that no element of trace |$t$| of |$\operatorname{GSp}_{2n}(\mathbb Z_\ell)$| can be of the forbidden shape |$cu$| ; as you explain in your paper, this is done by taking |$\ell$| such that |$v_\ell(t/2n) \neq 0$|, which is always possible.
Proof of|$d(g) \geqslant 4n-4$|.
We may assume that |$g$| is semisimple. Indeed, if we decompose |$g$| in Jordan form, as |$g = su = us$|, where |$s$| is semisimple and |$u$| is unipotent, the centralizer of |$g$| is contained in the centralizer of |$s$|, hence |$d(g) \geqslant d(s)$|.
Let us decompose the vector space |$V = k^{2n}$| (with its chosen non-degenerate alternating form) as a direct sum of eigenspaces of |$g$|, say |$V = \oplus V_{\lambda}$|. These spaces have the following properties:
(a) |$V_1$| and |$V_{-1}$| are non-degenerate, hence of even dimension;
(b) If |$\lambda \neq 1,-1$|, then |$V_\lambda$| is totally isotropic and in duality with |$V_{\lambda^{-1}}$|.
(It might be more efficient to use the orthogonal decomposition of |$V$| given by the eigenspaces of |$g + g^{-1}$|.)
Consider first the case |$z=0$| (i.e., the case where |$g$| as order 2). In that case, (A.1) shows that |$ \dim Z_G(g) = 2x^2+x+2y^2+y = 2n^2+n - 4xy$|. We have |$xy \neq 0$|, otherwise |$g$| would be central; hence |$x$| and |$y$| run between |$1$| and |$n-1$|. In that range the product |$xy$| is minimum when either |$x$| or |$y$| is equal to 1, in which case its value is |$n-1$|. Hence |$ \dim Z_G(g) \leqslant 2n^2 + n - 4(n-1)$|, i.e., |$d(g) \geqslant 4(n-1)$|, as wanted.
Suppose |$z \geqslant 1$|; we have |$x,y \geqslant 0$|. With the relation (A.2'), this shows that the point |$(x,y,z) \in \mathbb R^3$| belongs to the triangle with vertices the three points |$(0,0,n), (0,n-1,1), (n-1,0,1).$| Since the function |$2x^2+x+2y^2+y+z^2$| is convex, it attains its maximum at one of the vertices [6, Chapter II, Section 7.1, Proposition 1]; its values there are |$n^2, 2n^2-3n+2, 2n^2-3n+2$|. Since |$n^2 \leqslant 2n^2-3n+2$| for |$n \geqslant 2$|, this shows that |$ \dim Z_G(g) \leqslant 2n^2-3n+2$|, hence |$d(g) \geqslant 4n-2$|, and fortiori|$d(g) \geqslant 4n-4$|, as wanted.
(The proof also shows that |$d(g) = 4n-4$| is only possible when |$g$| is an involution of type |$\pm \ \iota(-1)$|, as in Remark 4.)
Proof of|$d(g) \geqslant 4n-2$| when |$\operatorname{Tr}(g)=0$|.
We use the same notation as in the above proof. The case |$z=0$| is possible only if |$n$| is even, with |$x=y = n/2$|. This gives a centralizer of dimension |$n^2+n$|, hence |$d(g) = n^2$|, which is |$> 4n-2$| when |$n \geqslant 4$|. Hence |$z \geqslant 1$|, in which case the computation given above shows that |$d(g) \geqslant 4n-2$|.
Best wishes
J-P. Serre
Appendix 2. Letter by J-P. Serre on the Continuity of the Density Function
Paris, April 12, 2015
Dear Katz,
Thank you very much for your letter about the density problem for |$\operatorname{USp}(2n)$|.
After writing to you I found a different way of getting the same result, based on an integration formula which is a combination of Weyl’s formula and a formula of Steinberg [46, Lemma 8.2].
This formula of Steinberg gives an integration formula over any local field. Here I shall stick to |$\mathbb R$| but I have no doubt that the |$p$|-adic case should be useful, too. To simplify matters, I shall suppose that -1 is in the Weyl group |$W$|.
Let now call |$UG$| the compact form of |$G$|, and |$UT$| the corresponding torus. The roots |$\alpha$|, and the characters |$\omega_i$| are now viewed as functions on |$UT$| with complex values of absolute value 1. The |$\chi_i$| are real-valued functions (because of my assumption on the Weyl group); let me call them |$x_i$|; they give a map |$x: UG \to \mathbb R^n$| which is well known (since Elie Cartan [9, pp. 803–804]) to have the following properties:
(a) It gives a homeomorphism of the space |$Cl(UG)$| on to a compact subset |$C$| of |$\mathbb R^n$|.(When |$G$| has type |$G_2$|, the set |$C$| is the one I asked you to draw for me.)
(b) Let |$C_T$| be the standard fundamental domain of |$W$| (in the tangent space, it corresponds to the fundamental alcove); the map |$C_T \to C$| is a homeomorphism; the boundary of |$C$| corresponds to the singular classes. (I see that by using topological arguments.)
(c) The function |$D(x_1,\ldots,x_n)$| (where |$D$| is as above) is a polynomial whose restriction to |$C$| is zero on the boundary and nowhere else.
By combining this with H. Weyl’s integration formula, one finds:
The image by |$x : UG \to \mathbb R^n$| of the normalized Haar measure of |$G$| has a continuous density (with respect to the standard measure |$dx_1\cdots dx_n$|), namely the function |$\varphi(x_1,\ldots,x_n)$| which is equal to |$0$| outside |$C$| and to |$ (2\pi)^{-n} |D(x_1,\ldots,x_n)|^{1/2}$| on |$C$|. □
The equidistribution measure associated with a fundamental character of |$G$| has a continuous density. □
More precisely, the density at a number |$c$| of the fundamental character |$\chi_1$| is equal to |$\int \varphi(c,x_2,\ldots,x_n) dx_2 \cdots dx_n.$|
Curiously, this point of view does not seem to give the fact that such densities are real analytic outside a finite number of values (namely, those taken by the character at the points of finite order of |$G$| corresponding to the vertices of the alcove, i.e. the points of |$G$| of order 1 or 2 when |$G = \operatorname{Sp}_{2n}$|).
One can also say when the density is not 0; for instance, for the trace when |$G= \operatorname{Sp}_{2n}$| the density is nonzero when the trace |$c$| is such that |$-2n < c < 2n$|.
When -1 is not in the Weyl group, some fundamental characters come in pairs of conjugate ones and instead of |$\mathbb R^n$| one should take a product of copies of |$\mathbb R$| and |$ \mathbb C$|. The case of |$\operatorname{SL}_3$| is especially nice; the compact |$C$| lies inside |$\mathbb C$| and is the interior (+ boundary) of a ‘hypocycloid with 3 cusps’ (hypocycloïde á trois rebroussements—as I learned when preparing the ENS competition in 1944-45).
Best wishes
J-P. Serre
PS—The explicit formula for |$\varphi$| in the case of |$\operatorname{Sp}_4$| is given in the aritcle of Fité et al. [21]; see Table 5, last line. Their |$a_1$| is my |$x_1$| and their |$a_2$| is my |$x_2+1$|.