Prescribing coefficients of invariant irreducible polynomials
Giorgos Kapetanakis 1
Faculty of Engineering and Natural Sciences, Sabancı ¨ Universitesi. Ortha Mahalle, Tuzla 34956, ˙Istanbul, Turkey
Abstract
Let F q be the finite field of q elements. We define an action of PGL(2, q) on F q [X]
and study the distribution of the irreducible polynomials that remain invariant under this action for lower-triangular matrices. As a result, we describe the possible values of the coefficients of such polynomials and prove that, with a small finite number of possible exceptions, there exist polynomials of given degree with prescribed high-degree coefficients.
Keywords: Hansen-Mullen conjecture, Finite fields, Character sums 2010 MSC: 11T06, 11T23
1. Introduction
Let q be a power of the prime number p. By F q we denote the finite field of q elements. Let A = a b c d ∈ GL(2, q) and F ∈ F q [X]. Following previous works [10, 22, 24], define
A ◦ F = (bX + d) deg(F ) F aX + c bX + d
. (1)
It is clear that the above defines an action of GL(2, q) on F q [X].
Recall the usual equivalence relation in GL(2, q), namely for A, B ∈ GL(2, q), A ∼ B : ⇐⇒ ∃ C ∈ GL(2, q) such that A = C −1 BC.
Further, define the following equivalence relations for A, B ∈ GL(2, q) and F, G ∈ F q [X].
A ∼ q B : ⇐⇒ A = λB, for some λ ∈ F ∗ q and F ∼ q G : ⇐⇒ F = λG, for some λ ∈ F ∗ q
Email address: gnkapet@gmail.com (Giorgos Kapetanakis)
It follows that, for F ∈ F q [X] the equivalence class [F ] := {G ∈ F q [X] | G ∼ q F } consists of polynomials of the same degree with F that are all either irreducible or reducible and every such class contains exactly one monic polynomial. Fur- ther, the action defined in (1) also induces an action of PGL(2, q) = GL(2, q)/ ∼ q
on F q [X]/ ∼ q , see [24]. For A ∈ GL(2, q) and n ∈ N, we define I A n := {P ∈ I n | [A ◦ P ] = [P ]},
where I n stands for the set of monic irreducible polynomials of degree n over F q . Recently, the estimation of the cardinality of I A n has gained attention [10, 22, 24]. In a similar manner, we introduce a natural notation abuse for [A], [B] ∈ PGL(2, q), i.e.
[A] ∼ [B] : ⇐⇒ ∃ [C] ∈ PGL(2, q) such that [A] = [C −1 BC].
We note that throughout this paper, we will denote polynomials with capital latin letters and their coefficients with their corresponding lowercase ones with appropriate indices. In particular, if F ∈ F q [X] is of degree n, then F (X) = P n
i=0 f i X i , in other words, f i will stand for the i-th coefficient of F . Two well- known results in the study of the distribution of polynomials over F q are the following.
Theorem 1.1 (Hansen-Mullen Irreducibility Conjecture). Let a ∈ F q , n ≥ 2 and fix 0 ≤ j < n. There exists an irreducible polynomial P (X) = X n + P n−1
k=0 p k X k ∈ F q [X] with p j = a, except when 1. j = a = 0 or
2. q is even, n = 2, j = 1, and a = 0.
Theorem 1.2 (Hansen-Mullen Primitivity Conjecture). Let a ∈ F q , n ≥ 2 and fix 0 ≤ j < n. There exists a primitive polynomial P (X) = X n + P n−1
k=0 p k X k ∈ F q [X] with p j = a, unless one of the following holds.
1. j = 0 and (−1) n a is non-primitive.
2. n = 2, j = 1 and a = 0.
3. (q, n, j, a) = (4, 3, 2, 0), (4, 3, 1, 0) or (2, 4, 2, 1).
Both results had been conjectured by Hansen and Mullen [16]. Theorem 1.1 was initially proved for q > 19 or n ≥ 36 by Wan [26], while Han and Mullen [15]
verified the remaining cases by computer search. Several extensions to these results have been obtained [9, 20], while most authors use a variation of Wan’s approach [26]. Recently new methods have emerged [14, 21, 25]. The second result was partially settled by Fan and Han [7, 8] and Cohen [4], while the proof was completed by Cohen and Pre˘ sern [5, 6].
One special class of polynomials are self-reciprocal polynomials, that is poly-
nomials such that F R := ( 0 1 1 0 ) ◦ F = F , where F R is called the reciprocal of
F . The problem of prescribing coefficients of such irreducible polynomials has
been investigated in [11, 12, 13].
Nonetheless, a description of the coefficient of the polynomials of I A n has not yet been investigated for arbitrary A. In Table 1, we present the results of a quick experiment regarding the distribution of the monic irreducible polynomials of degree 6 of F 3 that remain invariant under A, where A is chosen to be ( 1 0 2 1 ), ( 2 0 1 1 ) and ( 2 0 0 1 ). We see that the last two columns have the same number of entries, that in any case the coefficient of X 5 is always zero as well as some other coefficients, that in the first column, the coefficient of X 4 is always equal to 1 etc., while on the other hand some coefficients seem to take multiple values.
A =
1 02 1A =
2 01 1A =
2 00 1X
6+ X
4+ X
3+ X
2+ 2X + 2 X
6+ 2X
3+ 2X
2+ X + 1 X
6+ 2X
2+ 1 X
6+ X
4+ 2X
3+ X
2+ X + 2 X
6+ X
4+ 2X
2+ 2X + 2 X
6+ X
4+ 2X
2+ 1
X
6+ 2X
4+ X
3+ 2X + 1 X
6+ 2X
4+ 1 X
6+ 2X
4+ X
3+ X
2+ X + 2 X
6+ 2X
4+ X
2+ 1
Table 1: Monic irreducible polynomials of F
3of degree 6 such that F = A ◦ F .
In this work, we explain these observations. We confine ourselves to the case when A ∈ GL(2, q) is lower-triangular and wonder whether a monic irreducible polynomial over F q of specified degree whose class remains invariant under this action can have a prescribed coefficient. In Section 2, we deal with the case when A ∈ GL(2, q) is a lower-triangular matrix that has one eigenvalue and in Section 3 we deal with the case that A has two eigenvalues. The conditions, whether a certain coefficient of some F ∈ I A n can or cannot take any value in F q are provided. For the former case we adopt Wan’s method [26] and prove sufficient conditions for the existence of polynomials of I A n that indeed have these coefficients.
These results give rise to Theorems 2.8 and 3.4, where it is roughly shown that the high-degree coefficients of an irreducible monic polynomial invariant under A either take specific values or can be arbitrarily prescribed, with a small finite number of possible exceptions.
We note that from now on, without any special mention, A will always denote a lower-triangular matrix, so the eigenvalues of A are the elements of its diagonal.
2. The case of a single eigenvalue If A has a single eigenvalue, then
[A] =
( [( 1 0 0 1 )] , or
[( α 1 1 0 )] , for some α ∈ F ∗ q .
The first situation is settled by Theorem 1.1. For the second case, we have that
that A ◦ F ∼ q F ⇐⇒ F (X) ∼ q F (X + α) ⇐⇒ F (X) = F (X + α). The
polynomials with this property are called periodic. The following characterizes
those polynomials explicitly.
Lemma 2.1. Let α ∈ F ∗ q . Some F ∈ F q [X] satisfies F (X) = F (X + α) if and only if there exist some G ∈ F q [X] such that F (X) = G(X p − α p−1 X).
Proof. Let A = ( α 1 1 0 ). Since ord(A) = p, it follows from [24, Theorem 4.5], that if the degree of an irreducible such polynomial is ≥ 3, then it is pn, for some n. A direct computation reveals that there are no periodic polynomials of degree 1 and the existence of such polynomials of degree 2 requires p = 2. It follows that the degree of an irreducible periodic polynomial is a multiple of p, hence the irreducible factors of F are either of degree pn for some n, or they come in p-tuples of irreducible factors of the same degree, thus all polynomials with this property (irreducible or not) have degree pn for some n.
The left direction of the statement is clear. For the right direction, let F (X) = G(X)(X p − α p−1 X) + H(X)
where deg(H) < p. Also,
F (X) = F (X + α) = G(X + α)(X p − α p−1 X) + H(X + α).
The last two equations imply H(X) ≡ H(X +α) (mod (X p −α p−1 X)) and since deg(H) < p, this means H(X) = H(X + α) which in turn yields deg(H) = 0.
Also, since H(0) = F (0), we conclude that H = f 0 , that is (X p − α p−1 X) | (F − f 0 ).
Next, let pn be the degree of F . We show the desired result by induction on n. The case n = 0 is trivial. Now, assume that G = H(X p − α p−1 X) for all G ∈ F q [X] such that G(X) = G(X + α) and deg(G) = (k − 1)p. Let n = k. We have that (X p − α p−1 X) | (F − f 0 ), hence F = (X p − α p−1 X)G + f 0 , for some G ∈ F q [X] with deg(G) = (k − 1)p. Also, we have that G(X) = G(X + α), so from the induction hypothesis G = Z(X p − α p−1 X), for some Z ∈ F q [X]. The
result follows.
It is now clear that we need the following theorem of [1], also see [19, Theo- rem 3.3.3].
Theorem 2.2 (Agou). Let q be a power of the prime p, α ∈ F q and P ∈ I n . The composition P (X p − α p−1 X) is irreducible if and only if Tr(p n−1 /α p ) 6= 0, where Tr stands for the trace function F q → F p .
So, the monic irreducible periodic polynomials are those of the form Q(X) = P (X p − α p−1 X), where P ∈ I n such that Tr(P n−1 /α p ) 6= 0. Moreover,
Q(X) =
n
X
i=0
p i (X p − α p−1 X) i =
n
X
i=0 i
X
k=0
i k
(−α) (p−1)(i−k) p i X pk+i−k .
It follows that the m-th coefficient of Q, where 0 ≤ m ≤ pn, is
q m = X
dm/pe≤i≤min (m,n) i≡m (mod (p−1))
i
m−i p−1
(−α) pi−m p i = X
max (0,n−m)≤i≤n−dm/pe i≡m−n (mod (p−1))
γ i p R i ,
where
γ i :=
( n−i
m−n+i p−1
(−α) p−n+i , if i ≡ m − n (mod (p − 1))
0, otherwise.
In other words, it is a linear expression of some of the µ+1 low-degree coefficients of the reciprocal of P , i.e. P R := X deg(P ) P (1/X), where µ is the largest number such that γ µ 6= 0. First, we observe that it is possible for such µ to not exist (for example when m = np − 1 and p > 2) and, secondly, we observe that if µ = 0 or 1, then the value of q m has to be a given combination of p 0 and p 1 , but since neither of them is chosen arbitrarily, it can only take certain values.
So, from now on we assume that µ exists and µ ≥ 2. This leads us define to the following map
σ : G µ → F q , H 7→ X
max (0,n−m)≤i≤µ i≡m−n (mod (p−1))
γ i h i ,
where G µ := {f ∈ F q [X] | deg(f ) ≤ µ, f 0 = 1}. Also, it is clear that if P ∈ I n , then P R ∈ J n , where J n := {P ∈ F q [X] | P R ∈ I n }. Furthermore, it is now evident that we will need to correlate the inverse image of σ with a set that is easier to handle. The following proposition, see [12, Proposition 2.5], serves that purpose.
Proposition 2.3. Let κ ∈ F q . Set F ∈ G µ with f i := γ i−1 γ µ −1 for 0 < i < µ and f µ := γ µ −1 (γ 0 − κ). The map
τ : G µ−1 → σ −1 (κ), H 7→ HF −1 (mod X µ+1 ) is a bijection.
The following summarizes our observations.
Proposition 2.4. Let κ ∈ F q and 0 ≤ m ≤ (p−1)n. If m, n and p are such that there exist some i with dm/pe ≤ i ≤ min (m, n − 1) and i ≡ m (mod (p − 1)) and there exists some P ∈ J n such that Tr(p 1 /α p−1 ) 6= 0 and P ≡ HF −1 (mod X µ+1 ) for some H ∈ G µ−1 , then there exists some Q ∈ I pn , such that Q(X) = Q(X + α) and q m = κ.
Let U := (F q [X]/X µ+1 F q [X]) ∗ . Furthermore, set
ψ : U → C ∗ , F 7→ exp(2πi Tr(f 1 /(f 0 α p ))/p)
and notice that for P ∈ J n , Tr(p 1 /α p ) = 0 ⇐⇒ ψ(P ) 6= 1. Additionally, let
Λ(H) :=
( deg(P ), if H is a power of a single irreducible P, 0, otherwise,
be the von Mangoldt function on F q [X]. We define the following weighted sum
w := X
H∈G
µ−1Λ(H) X
P ∈J
n, ψ(P )6=1 P ≡HF
−1(mod X
µ+1)
1,
where F is the polynomial defined in Proposition 2.3. Clearly, if w 6= 0 we have our desired result.
In order to proceed, we will have to introduce the concept of Dirichlet char- acters. Let M be a polynomial of F q of degree ≥ 1. The characters of the group (F q [X]/M F q [X]) ∗ , extended to zero with the rule χ(F ) = 0 ⇐⇒ gcd(F, M ) 6=
0, are called Dirichlet characters modulo M . If χ is a Dirichlet character modulo M , we define
c n (χ) = X
d|n
n d
X
P ∈I
n/dχ(P ) d .
Weil’s theorem of the Riemann hypothesis for function fields implies the follow- ing theorem, see [26] and the references therein.
Theorem 2.5 (Weil). Let χ be a non-trivial Dirichlet character modulo M , then
|c n (χ)| ≤ (deg(M ) − 1)q
n2.
For a detailed account of the above well-known facts, see [23, Chapter 4], while the following can be deduced, see [26, Corollary 2.8].
Proposition 2.6. Let χ be a non-trivial Dirichlet character modulo M such that χ(F q
∗) = 1. Then
X
P ∈I
nχ(P )
≤ 1
n (deg(M )q n/2 + 1).
Further, notice that ψ is a group homomorphism, hence a Dirichlet character modulo X µ+1 , while it is clear that ord(ψ) = p. We deduce the following bounds.
Corollary 2.7. Let χ and ψ be Dirichlet characters modulo M , such that ord(ψ) = p and χ(F ∗ q ) = 1.
1. If χ 6∈ hψi, then
X
P ∈I
n, ψ(P )6=1
χ(P )
≤ 2(p − 1)
pn · (deg(M )q n/2 + 1), 2. If χ ∈ hψi \ {χ 0 }, then
X
P ∈I
n, ψ(P )6=1
χ(P )
≤ π q (n)
p + 2p − 3
pn · (deg(M )q n/2 + 1).
3. If χ = χ 0 , then
X
P ∈I
n, ψ(P )6=1
χ(P )
≥ (p − 1)π q (n)
p − p − 1
pn · (deg(M )q n/2 + 1).
Proof. We utilize the orthogonality relations for the group hψi and conclude
X
P ∈I
n, ψ(P )6=1
χ(P ) = 1 p
X
P ∈I
nχ(P )
(p − 1) −
p−1
X
j=1
ψ j (P )
= p − 1 p
X
P ∈I
nχ(P ) − 1 p
p−1
X
j=1
X
P ∈I
nχψ j (P ).
All three results follow directly from the above and Proposition 2.6. With the orthogonality relations in mind, we define V := {χ ∈ b U | χ(F ∗ q ) = 1}, check that V is a subgroup of b U and then rewrite w as follows:
w = 1
|V | X
H∈G
µ−1Λ(H) X
P ∈J
n, ψ(P )6=1
X
χ∈V
χ(P ) ¯ χ(HF −1 )
= 1
|V | X
χ∈V
χ(F ) X
H∈G
µ−1Λ(H) ¯ χ(H) X
P ∈J
n, ψ(P )6=1
χ(P ).
We separate the term that corresponds to χ = χ 0 and call it A ψ , then the one that corresponds to χ ∈ hψi \ {χ 0 } and call it B ψ and finally C ψ will stand for the term that corresponds to χ / ∈ hψi. Hence w = A ψ + B ψ + C ψ . For C ψ , we have
|C ψ | ≤ 1
|V | X
χ∈V \hφi
X
H∈G
µ−1Λ(H) ¯ χ(H)
X
P ∈J
n, ψ(P )6=1
χ(P ) .
Afterwards, we observe that any character sum that runs through J n that in- volves a character that is trivial on F ∗ q has the same absolute value as if it would run through I n . Also, for those characters we have that
X
H∈G
µ−1Λ(H)χ(H) = X
deg(H)≤µ−1, H monic
Λ(H)χ(H) =
µ−1
X
j=0
c µ−1 (χ).
Now, by taking into account the above, Theorem 2.5 and Corollary 2.7 yield
|C ψ | ≤ |V | − p
|V |
µ−1
X
j=0
µq j/2
· 2(p − 1)
p · µ
n · q n/2
≤ q µ − p
q µ · µ · q µ/2 − 1
q 1/2 − 1 · 2(p − 1) p · µ
n · q n/2
≤ 4µ 2
n · q (n+µ−1)/2 .
Similarly, for B ψ we notice that ψ ∈ V , i.e. hψi \ {χ 0 } ⊆ V , hence we get
|B ψ | ≤ p − 1
q µ · µ · q µ/2 − 1
q 1/2 − 1 · π q (n)
p + 2p − 3 p · µ
n · q n/2
≤ 2µ
q (µ+1)/2 · π q (n) + 4µ 2
n · q (n−µ−1)/2 . Finally, for A ψ , we notice that
X
H∈G
µ−1Λ(H) =
µ−1
X
m=0
X
deg(H)=m h
0=1
Λ(H) =
µ−1
X
m=0
q m = q µ − 1
q − 1 , (2)
thus
|A ψ | ≥ 1
|V | · q µ − 1 q − 1
(p − 1)π q (n)
p − p − 1
p · µ n · q n/2
≥ 1 2q
π q (n) − µ n · q n/2
.
Since w = A ψ +B ψ +C ψ , it follows that w 6= 0 provided that |A ψ | > |B ψ |+|C ψ |.
This implies the following condition for w > 0:
q (µ−1)/2 − 4µ
2q (µ+1)/2 · π q (n) ≥ µ n ·
4µ + 1
2q µ/2 + 4µ q µ
· q (n+µ−1)/2 . (3)
Further, it is well-known, see [18, Theorem 3.25], that π q (n) = 1
n X
d|n
µ(d)q n/d ,
where µ(·) stands for the M¨ obius function. It follows that
π q (n) ≥ 1 n
q n − q · q n/2 − 1 q − 1
. (4)
The combination of the above and Eq. (3) yields another sufficient condition, namely
q n/2 (q (µ−1)/2 − 4µ) + 4µ q − 1 ≥
2µq µ
4µ + 1
2q µ/2 + 4µ
q µ + 1
2µq (µ+1)/2 (q − 1)
. (5)
The above is satisfied for q ≥ 67 for all 2 ≤ µ ≤ n/2. It is also satisfied for
n ≥ 26 for all q and 2 ≤ µ ≤ n/2. In particular, for 2 ≤ q ≤ 64, Table 2
illustrates the values of n such that the Eq. (5) holds for all 2 ≤ µ ≤ n/2. All
in all, in this section we have proved the following theorem.
q = 2, n ≥ 26 q = 3, n ≥ 16 q = 4, n ≥ 12 q = 5, n ≥ 10 7 ≤ q ≤ 11, n ≥ 8 13 ≤ q ≤ 27, n ≥ 6
Table 2: Pairs (q, n) such that Eq. (5) holds for all 2 ≤ µ ≤ n/2.
Theorem 2.8. Let q be a power of the prime p, [A] = [( α 1 1 0 )] ∈ PGL(2, q) and n 0 ∈ Z >0 . If α = 0, then I A n
0= I n
0. If α 6= 0, then I A n
0= ∅ ⇐⇒ p - n 0 . Suppose p | n 0 and write n 0 = pn. Further, fix some 0 ≤ m ≤ pn and for all max(0, n − m) ≤ i ≤ n − dm/pe set
γ i :=
( n−i
m−n+i p−1