Rational points on a unit circle
There are some events that happen in the course of ones life that might be considered historical or world-changing. One such event from our lifetime is the proving of the Last Theorem of Fermat (FLT). While the proof of the FLT is inaccessible to persons with meager mathematical understanding like us, its algebraic form can be understood by anyone with a very limited knowledge of mathematics. Its geometric form is deeply linked to a basic aspect of Euclidean space, and it was this form that amazed us as a kid. This can be presented as a very simple problem: given the curve (Figure 1), are there rational points that lie on the curve? A rational point is a point
where both
and
are rational numbers.
These curves may be divided two topologically distinct groups based on the parity of the exponent . If
then the curve is bilaterally symmetric about
and cuts the x-axis and the y-axis once at
and
respectively (Figure 1). If instead
then the curve is closed, symmetric about both axes and the lines
, and cuts the the x-axis and the y-axis twice at
and
respectively (Figure 1). It is easy to see that the case where
has infinite rational points. The case for
defines a circle and that too has infinite rational points on it (see below). Remarkably, the FLT states that there are no rational points for any of the curves with
, other than of course the trivial points where the curve cuts the axes. This is a rather profound statement regarding the geometry of Euclidean space as it means that, other than the line
and the unit circle, none of other members of this family of unit curves can ever pass through non-trivial rational points.
Now, what are the rational points on a circle? These are linked to the defining theorem of Euclidean space — the theorem of the right triangle or the bhujā-koṭi-karṇa-nyāya: we have for a right triangle with the three sides in order of length,
Dividing the equation by we get:
Writing and
we get the equation of the unit circle:
Therefore, the primitive right-triangle triples (because they are
triangles), where
define the rational points on a unit circle of the form
. It has been known since antiquity that the right triangle triples have a deep connection to the greatest common divisor of two numbers. Thus, given two positive integers
, if
and
, we can generate triples using the below formulae (something one learns by secondary school):
For example, yield
, the most primitive of all
triples. It figures in many ancient holy objects like the Śrīcakra or in the layout of the city of Ayodhyā. However, if both
are odd then
,
and
will all be even. Hence, they cannot constitute a primitive
triple: e.g.
will yield
which is simply the
triangle magnified 2 times. Thus, we need an additional condition so that we consider only those triples generated by the above formulae which themselves have
. Thus, we eliminate all
sharing an odd parity. Given that the above procedure ensures that
will be positive, if we further ensure that the sides are arranged by increasing order of length then the minimal set of primitive triples,
, will necessarily occupy only the octant of a circle between
(Figure 2). This set of primitive triples
can then be used to get rational points on the entire perimeter of the circle by placing
before
and changing their signs one at a time.
Figure 2.
The above procedure shows that there are infinite triples (Figure 2 shows those with
) and thus there are infinite rational points that lie on a unit circle. However, for the past 80 or so years it has been known that there is a hierarchy to the
triples with the
triple being the mother of all of them. We represent it as the below matrix:
We then define 3 transformation matrices:
Then we get three further generation-1 triples from thus:
.
These represent the next level in the hierarchy of primitive triples. From those 3 we get a further generations of 9 and then 27 and so on by multiplying each triple of the previous generation by each of the transformation matrices. What serial multiplication by
does is to create right triangles approaching the half-square or
or
triangle. Thus, in the lineage of multiplication by
we get triangles such that
For example, the 2nd generation triple in the
lineage yields
, which approximates
correctly to 4 decimal places. In contrast, the matrices
and
drive the triangles towards the state where one of the sides and the hypotenuse are nearly equal. When we plot the rational points generated by the
triples on the unit circle we get a fractal structure covering the primary octant of the said circle (Figure 3).
Figure 3. Rational points on the unit circle derived from
triples with those derived from the primordial triple
and 3 subsequent generations of triples indicated by disks of decreasing size.
This structure is characterized by “zones of exclusion” defined by the high-ranked triples. The biggest such zone is defined by the primordial triple, followed by those of generation-1 and so on (Figure 3). However, if one zooms in on the smaller zones of exclusion one sees a fractal structure mirroring the larger zones.
If we order the triples by the ascending value of then we get a remarkable relationship for the count of triples with
below or equal to a certain number
:
This famous result of the senior Derrick Lehmer, which can be easily numerically visualized (Figure 4), has links to Leonhard Euler’s discovery of the function.
Its formal proof is complicated (for the mathematical layman) and yields an additional logarithmic term. However, the basic concept leading to to Lehmer’s asymptotic result is readily understood. Euler had famously shown that the probability of two randomly chosen positive integers having
, i.e. being mutually prime, is
. In the above formula for obtaining
triples we start with two mutually prime integers
. Thus, the probability of getting two such mutually prime integers is defined by this formula of Euler. The hypotenuse
; hence, we can see the points defined by
for all
lying inside or on a circle with radius
and area
. However, we do not consider the entire circle; as we saw above, because of the ordering and sign considerations we only account for one octant of a circle, i.e.
of its total area. Hence, we get the number of points defined by the mutually prime
such that
to be
. However, not all these points correspond to valid triples. As we saw above, we have to exclude triples emerging from mutually prime
which are both odd. This approximately eliminates
of the possible mutually prime couplings (e.g. for 1, 2, 3, we have 1, 2 and 2, 3 as valid couplings but 1,3 is not). Therefore, we have multiply the value
by a correction factor of
, leading to the above asymptotic formula of Lehmer.
and
triangles and the rational points on
ellipses
Our exploration of this special class of rational points on an ellipse and its links to integer triangles began in our youth when we added an important lesson to our very rudimentary knowledge of trigonometry. It stemmed from a simple question that anyone with limited amount of geometric knowledge can ask: Are there any other triangles with an angle defined by a rational sector of a circle (i.e. the angle is rational in degrees) that have a regular formula (like the right triangle) relating their sides? In course of our compass and ruler constructions on paper, we stumbled across the triple which defines a
triangle (Figure 5).
Figure 5. defines a
triangle with sides obeying the formula
Since at that point we knew of the properties of the famous Platonic triangle, we were able to prove using the bhujā-koṭi-karṇa-nyāya that if a triangle’s sides are 3, 5, 7 then indeed it has an angle of
. While doing so it struck us that
and we were able to visualize the same geometrically (Figure 5). We then became curious if we could find other integer
triangles to see if this relationship held. Having drawn a
angle on paper, using a sliding ruler we soon obtained another
triangle defined by the triple
and found that here again
. This sparked our curiosity and we looked up one of our father’s mathematics books to see if this was recorded. While we did not find it there, we learned of the generalization of the bhujā-koṭi-karṇa-nyāya, the cosine rule,
. We were able to grasp its implications right away because our father had introduced us to the trigonometric functions early in life. Thus, it became clear that
was indeed a property of
triangles stemming directly from the cosine rule given that
.
Further, this implied that we can have such regular formulae only for one other rational sector triangle, i.e. the triangle. Only the rational sectors corresponding to
have rational cosines. Of these only
can form non-degenerate triangles. These correspond to the formulae:
While for the and the
case the additive formulae imply that that the side
is the largest side, the negative term in the
case implies that
is the middle length side of the triangle.
Figure 6. Difference of cubes relationship for a
triangle
While the FLT shows that there can be no relationship between 3 rational cubes like for squares, the and the
provide a 3-dimensional relationship to the difference and summation of volumes of cubes and a square-faced cuboid.
Thus, the difference of the volumes of 2 cubes with edges equal to each of the two short sides of a triangle
is equal to a cuboid with a base equal to a square erected on the long side of the said triangle
and height equal to the difference of the other two sides
.
Thus, the sum of the volumes of 2 cubes with edges respectively equal to the shortest and longest sides of a triangle
is equal to a cuboid with a base equal to a square erected on the middle side of the said triangle
and height equal to the sum of the other two sides
.
Further, we observed that by dividing the above equations by and writing
we get equations of the ellipses
corresponding to the
triangle and
corresponding to the
triangle (Figure 7).
Figure 7. The ellipses defined by the rational triangle triples
The above implies that the primitive integer triples corresponding to the triangle,
, define rational points that lie on the ellipse
and those corresponding to the
triples,
, define the rational points on the ellipse
(Figure 7). These are special ellipses whose eccentricity and semi-minor axis are equal to the same value,
. Further, they have an aspect ratio of
. We shall see below that certain numbers related to parameters of these ellipses play a role in certain in the counts of primitive
and
integer triangles.
Further, by means of a geometric construction we can show that for each there 2 corresponding
that can be derived from it, which share the lengths of 2 sides with the
triangle (Figure 8). This construction stems from the dissection of an equilateral triangle into 3 triangles such that one of them will again be equilateral (Figure 8). Thus, the primitive equilateral triangle
defines a distinct primitive
.
Figure 8. 2 ;
can be derived from a
As with the family of triangle triples we can derive a formula for constructing the
triples using two mutually prime integers
:
Since these are necessarily scalene triangles, we order the triple such that elements are in ascending order and then term them
. Further, in order to get primitive triples, we have to again ensure that the
. For example,
while mutually prime and yield the triple
which has
. Thus, it is merely the first triple
magnified 3 times. Hence, we have to drop such cases. Once we perform this operation on the result of above formula we can get primitive
triples. The first few are shown in Table 1.
If we plot the independent values of the
triples then they occupy a sector of the ellipse whose shape is the same as the ellipse equation
(Figure 9, panel 1). Further, the corresponding plot of the
points shows them forming subtle circular patterns within the sector which relate to their defining rational points on circles (Figure 2). Here, we see the
points form ellipses with a shape and inclination corresponding to the
ellipse on which they define rational points (Figure 9, panel 1).
Figure 9. The ordered primitive
triples (panel 1) sorted by
. The ordered primitive
triples sorted by middle side
(panel 2). The ordered primitive
triples sorted by the longest side
(panel 3).
Because of the defining relationship for the
triples contains a central term we were able to derive only 1 transformation matrix
for these triples. Unlike the trifurcating matrices for the right triangle triples this one merely converts one primitive triple in to another with no particular hierarchy:
If
We observe that the multiplication of by
drives the triples towards the sides of a
triangle. Thus, for the triples generated by this multiplication we have an arithmetic mean relationship comparable to the
triples:
In terms of the location of the rational points corresponding to the triples we see a fractal structure of “zones of inhibition” just as with the right-triangle triples. However, there is no obvious pattern beyond the higher ranked triples being associated with stronger zones of inhibition (Figure 10).
Figure 10. Location of the rational points corresponding to the high-ranked triples.
As with the right triangles we can ask what will be counts of triples with
? We empirically ascertained it to be (Figure 11):
Following the idea behind Lehmer’s proof for the triples, we can see that just as in that case the bounding curve of the triple-generating coprime
is a circle of radius
, for
triples it is an ellipse of the form
. One can see that this has a shape corresponding to
(Figure 7, 9, panel 2). The area of this ellipse is
(Figure 7). When the triples are in ascending order they only map to the sector of the ellipse from
. The explicit equation of the ellipse is
. Hence, we can find the area occupied by the
corresponding to the ordered
triples thus:
This is just the total area of the ellipse. Further, we have to apply a correction factor as we are only considering primitive triples; some of the coprime
yield the same triple when reduced. If we take first 4 coprime pairs (2,1); (3,1); (3,2); (4,1) get the 4 triples (3,5,7); (7,8,13); (5, 16, 19); (9, 15, 21). Of these the last is not primitive being equivalent to the first. Thus, we keep approximately only
of the triples coming from coprime
. Thus, the approximate number of primitive
triples such that
, which leads to the above result for the count of
triples.
We saw above that each triple generates 2
triples. Thus, the asymptotic count of
triples would be twice that of the
triples. Further, given the specific geometric relationship between the
triangle and its two
triangle children, the longest side
of the
triangle (Figure 8) is the middle side
of the two
children triangles if we consider their sides
in the ascending order. Hence, to observe the this relationship in the asymptotic counts we need to sort the
triples by the middle side
which is the equivalent of
of the
triples (Figure 12). This is also apparent from the sector of the ellipse
occupied by the coprime
giving rise to the
triples (Figure 7, 9). This corresponds to the sector of the ellipse defined by
. By integrating between those limits one can see that the sector has twice the area of sector occupied by the
generating
.
Figure 12. Asymptotic counts of
triangles sorted by the middle side
If we instead order the triples by their longest sides
then we get the below relationship for their asymptotic counts:
This can be empirically verified (Figure 13) and explained thus: If is the longest side then the relationship between the individual elements of a
triples is given as
. Dividing this equation by
and writing
we get the hyperbola
. This hyperbola will determine the shape of the bounding curve of the coprime
generating
triples with
(Figure 9, panel 3). The actual equation of the bounding curve will be
. Thus, the generating
will be contained in an area corresponding to the sector of this hyperbola from
(Figure 9, panel 3). We can get that area thus:
Hence, in this case we get the approximate number of primitive triples such that
, which is the above result.
Figure 13. Asymptotic counts of
triangles sorted by the longest side
Divisibility and composition of triples
The triples illustrate a deep link between geometry and arithmetic in terms of their divisibility and composition. Some of these facts have probably been known since antiquity for the triples. One trivial feature is that all the odd numbers will be represented among the shortest sides for both the
and
triples. Since adjacent odd and even numbers are coprime, and thus valid
for generating the shortest side
, they give rise to the odd numbers because the difference of the squares of two adjacent numbers is an odd number
. Less trivially, in the case of the
triples we also get the multiples of 4 starting from 8 onward as the shortest side
. This happens when we have even numbers couple with 1 (being naturally coprime), such as
. Here
and
; similarly,
would yield
and so on. In the case of the
triples we similarly have in addition to the odd numbers, all multiples of 8 from 16 onward. These come from consecutive odd numbers, which being coprime can be valid
to generate
triples. Thus,
yields
, which from the
onward will yield 16, 24…
A further relationship is related to the hypotenuse of a primitive
triple: it is always an integer of the form
. This can be understood thusly: Given
as the coprime generators of a primitive triple let them be respectively odd and even coprime numbers. We can write
and
.
. Thus, the hypotenuse will be of the form
where
. A similar relationship exists for
triples where every longest side
is always of the form
. This is harder to demonstrate generally than for the
example. But we can do it informally by breaking this down to multiple cases. First, consider a coprime
which are successive odd and even numbers. We can write them as
. We then have
which is an integer of the form
. Second, if we a consecutive odd coprime numbers, e.g. 1, 3 they will be of the form
then
, which is an integer of the form
. Now, if we instead consider a coprime pair of the form
then
. This number will not be of the form
but will be eliminated from the list because it is not a primitive triple being divisible by 3. For example, as we saw above while applying the correction factor for the counts,
or
are such eliminated pairs. By extending this argument we can see that the only
that remain will be of the form
because the other coprime
pairs will generate non-primitive triples.
For the triples there is another interesting pattern: The most primitive triple is
. At least 1 of the 3 sides of every subsequent primitive triple will be divisible by 3, 4 and 5. For example: sorted by the hypotenuse
the 29th triple
has the 1st side divisible by 3, the 2nd by 4 and the 3rd by 5. The 30th triple
has the 1st side divisible by 4, the 2nd by 3, and the 3rd by 5. In other cases, 2 of these might divide the same side: e.g.
, where 5 divides the 1st side and 3, 4 divide the second. Sometimes all 3 numbers might divide the same side, e.g.
where the 2nd side is divisible by 3, 4, 5. A comparable relationship exists for
triples: at least 1 of the 3 sides of the triple will be divided by at least 2 of the following numbers: 3, 5, 7, 8. Of these 3, 5, and 8 divide at least 1 side of around
of the
triples and 7 divides at least one side of around
of the
triples.
Finally, in passing, one may also note that it has been known for a while that the terms meru-średhi (known in the occident as the Fibonacci sequence) can be used to generate particular triples, though not all of them are primitive.
Let be the średhi, then a
triple is defined by:
For example, if we take , we get the
triple: 105, 208, 233
Inspired this, we can derive a similar formula of the triples (again all will not be primitive):
For example if we take , we get the
triple: 105, 272, 337