Elementary counting formula for the number of $2k$-separated prime pair averages in an interval.

Proof of 2k-separated prime pairs in an interval, counting formula

Let all intervals be of integers. Fix a natural number k1k \geq 1. Primorial pn#p_n\# is defined as usual to be p1p2pnp_1 p_2 \cdots p_n. Let an undeclared pp or qq just mean any prime number.

Lemma 1. The number of x[0,y]x \in [0,y] such that x2=k2(modpi)x^2 = k^2 \pmod {p_i} for no i=1..ni = 1..n is given by:
f(y)=dpn#, dy(1)ω(d)cd, gcd(c,2k)=1y(zc,d)(d)d(0) f(y) = \sum_{d \mid p_n\#, \ \\ d \leq y } (-1)^{\omega(d)}\sum_{c \mid d, \ \\ \gcd(c,2k) = 1} \lfloor \dfrac{y - (z_{c,d})_{(d)}}{d}\rfloor \tag{0}

Where (zc,d)(d)(z_{c,d})_{(d)} is the unique least non-negative residue in Z\Bbb{Z} modulo dd such that zc,d=k(modc), zc,d=k(moddc)z_{c,d} = k \pmod c, \ z_{c,d} = -k \pmod {\frac{d}{c}}. Then condition (zc,d)(d)<dy(z_{c,d})_{(d)} \lt d \leq y ensures that there exists at least one such x=(zc,d)(d)x = (z_{c,d})_{(d)} in the interval [0,y][0, y]. Otherwise this term needs to equal zero as it counts the number of such solutions in the interval.

Proof. Define Ai={x[0,y]:x2=k2(modpi)}A_i = \{ x \in [0, y] : x^2 = k^2 \pmod {p_i}\} where pip_i is the iith prime number.
Then the expression we’re after is yA1A2Any - |A_1 \cup A_2 \cup \dots \cup A_n|. We can compute this via inclusion-exclusion to be:

y+1<dpn#, dy(1)ω(d)pdAπ(p)(1) y +\sum_{1 \lt d \mid p_n\#, \ \\ d \leq y} (-1)^{\omega(d)}|\bigcap_{p \mid d} A_{\pi(p)}| \tag{1}

The size of each intersection term counts the number x[0,y]x \in [0, y] such that x2=k2(modp)x^2 = k^2 \pmod {p} for all prime pdp \mid d. But modulo each pp, we have that x2=k2(modp)    x=kx^2 = k^2 \pmod p \iff x = k or x=k(modp)x = -k \pmod p. So what we can do is split up dd into c,dcc, \dfrac{d}{c} and sum up the count for each special case x=k(modc)x = k \pmod c (equivalently modulo each pcp \mid c) and x=k(moddc)x = -k \pmod {\frac{d}{c}} (equivalently modulo each pdcp \mid \frac{d}{c}).

Except if gcd(c,2k)1\gcd(c, 2k) \neq 1 in which case we have that there exist a prime pcp \mid c such that p(k(k))=2kp \mid (k - (-k)) = 2k. Which means the two possibilities x=kx = k or x=kx = -k modulo pp are equivalent. What happens is a double counting of pdAπ(p)|\bigcap_{p \mid d} A_{\pi(p)}| in the case that gcd(c,2k)1\gcd(c, 2k) \neq 1. Thus we only consider when x=k(mod2)x = -k \pmod 2 if we remove that case from the final summation.

Now focusing on a single special case x=k(modc),x=k(modd),gcd(c,2k)=1x = k \pmod c, x = -k \pmod d, \gcd(c, 2k) = 1. Let (zc,d)(d)=x(z_{c,d})_{(d)} = x. We know that we want to count all solutions to:
0dz+(zc,d)(d)y(2) 0 \leq dz +(z_{c,d})_{(d)} \leq y \tag{2}

So simply rearrange this:

    0(zc,d)(d)dzy(zc,d)(d)     (zc,d)(d)dzy(zc,d)(d)d, but zZ,    (zc,d)(d)dzy(zc,d)(d)d \iff 0 - (z_{c,d})_{(d)} \leq dz \leq y - (z_{c,d})_{(d)} \\ \ \\ \iff \dfrac{-(z_{c,d})_{(d)}}{d} \leq z \leq \dfrac{y - (z_{c,d})_{(d)}}{d}, \text{ but } z \in \Bbb{Z}, \\ \iff \lceil \dfrac{-(z_{c,d})_{(d)}}{d} \rceil \leq z \leq \lfloor\dfrac{y - (z_{c,d})_{(d)}}{d} \rfloor

The expression on the left is just zero, by definition of ()(d)(\cdot)_{(d)} taking things to the least non-negative residue (modd)\pmod d in Z\Bbb{Z}. Thus for this special case, the count is:

y(zc,d)(d)d(3) \lfloor \dfrac{ y - (z_{c,d})_{(d)}}{d} \rfloor \tag{3}

The reason we can (and may have to, in summation bound) take dyd \leq y is that if d>yxd \gt y \geq x, then dz+(zc,d)(d)y<d    dzy(zc,d)(d)<dd z + (z_{c,d})_{(d)} \leq y \lt d \implies dz \leq y - (z_{c,d})_{(d)} \lt d so that zy(zc,d)(d)d<dd=1z \leq \dfrac{y - (z_{c,d})_{(d)}}{d} \lt \dfrac{d}{d} = 1, which means z=0z = 0

All together and referring back to formula (2) we have that we must sum up these special case counts over all possible choices of cd,gcd(c,2k)=1c \mid d, \gcd(c, 2k) = 1:

f(y)=y+1<dpn#, dy(1)ω(d)cd,gcd(c,2k)=1y(zc,d)(d)d f(y) = y + \sum_{1 \lt d \mid p_n\#, \ \\ d \leq y } (-1)^{\omega(d)}\sum_{c \mid d, \\ \gcd(c,2k) = 1} \lfloor \dfrac{y - (z_{c,d})_{(d)}}{d}\rfloor

Now notice that we rewrite it in our goal form (0) by noticing that z1,1=0z_{1, 1} = 0 and so we can allow in d=1d = 1 and when you evaluate at d=1d = 1 the inner sum, you get yy itself. \blacksquare

Corollary 1. With y=pn+122,nNy = p_{n+1}^2 - 2, n \in \Bbb{N}. The 2k2k-separated prime pair conjecture is proven if f(y)f(y) is not eventually 00. For example if f(y)f(y) is always growing as nn \to \infty.

Proof. If x[0,y]x \in [0, y] is such that x2k2x^2 \neq k^2 modulo any pi,i=1..np_i, i = 1..n (that’s what f(y)f(y) counts !), then by Sieve-of-Eratosthenes, we have that x2k2x^2 -k^2 is a product of primes or that xx is a 2k2k-separated prime average. So f(y)f(y) essentially counts all 2k2k-separated prime pair averages in the interval [0,y][0, y] whenever ypn+122y \leq p_{n+1}^2 - 2. That is kind of a strict maximum because if you let in y=pn+121y = p_{n+1}^2 - 1 then y+1=pn+12y + 1 = p_{n+1}^2 is also coprime to all pi,i=1..np_i, i=1..n and yet it’s not a prime number, and so it would get falsely counted. \blacksquare

Corollary 2. The formula is true for any ypn+122y \leq p_{n+1}^2 -2, that is f(y)f(y) counts the number of 2k2k-separated prime pair averages in that interval, as long as of course y>pn+1y \gt p_n + 1.

Proof. It could turn out that pnp_n is a twin prime first and so pn+1p_n + 1 is a twin prime average, but it would not be “picked up by the count” because pnp_n is part of the set pi,i=1..np_i, i=1..n which has canceled out that average. The first statement follows from the discussion in the proof of Corollary 1. \blacksquare

Comments

Popular posts from this blog

Every Map $X \xrightarrow{f} \Omega$ is the Characteristic Map of a Monomorphism

90% visual proof of Contravariant Yoneda Lemma