Science, culture, complexity

Tag: Millennium Prize

  • The Birch and Swinnerton-Dyer conjecture

    On Monday night, I kid you not, I dreamt of the Birch and Swinnerton-Dyer conjecture. It was only by name, a fleeting mention in a heated conversation I was having with a friend. I’m not sure who spoke it or why.

    When I woke up, I looked it up, and found that it’s one of the Millennium Prize problems — one of seven unsolved mathematical problems for each of whose correct solutions the Clay Mathematical Institute offers an award of $1 million.

    I’m vaguely familiar with these problems’ names, and the substance of only three, so after the dream, I resolved to understand the conjecture and why it remains unsolved. Here goes.

    Let’s start at high-school maths.

    The equation y = 2x + 1 is a straight line on a graph.

    For any given value of x, there’s only one corresponding value for y.

    Similarly, in high school, you’d have learnt that the equation for a circle is: x2 + y2 = 1.

    If you look for points on this circle where x and y are fractions, i.e. where they’re rational, you’ll find plenty.

    For example, (⅗, ⅘) is such a point on the circle because (⅗)2 + (⅘)2 = 1.

    The Birch and Swinnerton-Dyer conjecture is about elliptic curves rather than circles.

    Despite the name, these curves aren’t ellipses. An elliptic curve is defined by an equation that looks like this:

    y2 = x3 + Ax + B

    Let’s say A = -1 and B = 1. The equation becomes: y2 = x3x + 1

    If you plot this equation on a graph, you’ll get a smooth, flowing curve.

    Mathematicians are obsessed with finding the rational points on these curves, i.e. points where both x and y are fractions.

    For some elliptic curves, there are only a few rational points. For other elliptic curves, there are infinitely many.

    The question is: how can we tell, just by looking at the equation, how many rational points it has?

    A fascinating property of elliptic curves is that you can add points together.

    If you take two rational points on the curve, called P and Q, draw a line through them, and see where that line hits the curve a third time, that third point — after reflecting it across the x-axis — will also be a rational point.

    Mathematicians call this point P + Q.

    This is how elliptic curves have a ‘rank’.

    If a curve has rank = 0, there are only a finite number of rational points on the curve. You can add them all day but you’ll keep finding the same few spots again and again.

    If a curve has rank ≥ 1, it has infinitely many rational points. You can generate them by adding the rational points together to travel all over the curve.

    The Birch and Swinnerton-Dyer conjecture an attempt to calculate this rank using a completely different part of maths.

    To solve a difficult problem, mathematicians often try a simpler version first.

    For example, in order to calculate the rank of an elliptic curve, mathematicians looked for solutions in modular arithmetic.

    Consider a clock, whose numbers are modulo 12. In normal counting, 10 + 5 = 15. But on a clock, 10 + 5 = 3. This is because once the count hits 12, it resets. Since 10 + 5 = 10 + 2 + 3 = 12 + 3, you’re left with 3.

    This is what modulo 12 means.

    You can do the same thing with an elliptic curve equation.

    You pick a prime number p (like 2, 3, 5, 7, 11…) and ask: how many integer solutions are there if we only care about the remainder when divided by p?

    For instance, let’s use the elliptic curve y2 = x3x + 1 with p = 5.

    We want to find all solutions (x, y) where the values of x are picked from the set {0, 1, 2, 3, 4} — since these are the possible remainders when divided by 5 — and the equation holds modulo 5.

    This means:

    1. Pick a value of x from {0, 1, 2, 3, 4}

    2. Calculate y2 = x3x + 1 using normal arithmetic

    3. Find the remainder when you divide that result (y2) by 5

    4. Now find a y from {0, 1, 2, 3, 4} such that y2 has that same remainder when divided by 5

    So let’s check each possible value of x:

    • x = 0 so y2 = 1. Is there a y in {0, 1, 2, 3, 4} whose square equals 1 mod 5? 1 or 4
    • x = 1 so y2 = 1. Is there a y in {0, 1, 2, 3, 4} whose square equals 1 mod 5? 1 or 4
    • x = 2 so y2 = 7. Is there a y in {0, 1, 2, 3, 4} whose square equals 7 mod 5? None.
    • x = 3 so y2 = 0. Is there a y in {0, 1, 2, 3, 4} whose square equals 0 mod 5? 0
    • x = 4 so y2 = 61. Is there a y in {0, 1, 2, 3, 4} whose square equals 61 mod 5? 1 or 4

    So when p = 5, the elliptic curve y2 = x3x + 1 had seven solutions.

    Now, let Np be the number of solutions for a specific prime p. Because there are only p possible values for x and y in this scenario, finding Np is easy.

    Let’s use the same example.

    Since we’re working with modulo 5, both x and y can only be from {0, 1, 2, 3, 4}. That’s only five possible values each.

    And for each x, we only had to check at most five values of y. That’s at most 25 checks in all — which is very easy for a computer.

    Studying the curve modulo p, for many different values of p, yields information about the original curve over the rational numbers.

    Specifically,finding all the rational points on the curve y2 = x3x + 1, e.g. (0,1), (1,1), (-1,-1), etc., is extremely difficult. There could be infinitely many and they could involve large numerators and denominators.

    But for each prime p, counting how many solutions exist modulo p is easy: you just need to check all p2 possibilities.

    Notice also how for any given p, there are also around a p number of solutions on average.

    The number of solutions per possibility contains information about the rank of the elliptic curve.

    This connection happens via the L-function.

    In the 1960s, Bryan Birch and Peter Swinnerton-Dyer had a radical idea. They wondered if the number of solutions Np for various values of p could reveal the rank of the curve.

    They created the L-function to hold this information, written L(E, s). This is a complex function built using all the Np values for every prime number p.

    If a curve has many rational points, i.e. a high rank, we’d expect it to also have a high value of Np. If the curve has few rational points, Np should also be low.

    L(E, s) is a function of the variable s.

    Birch and Swinnerton-Dyer used a computer — then a room-sized machine called EDSAC 2 at the University of Cambridge — to calculate these values.

    They noticed a stunning pattern.

    Recall that for a given p, there are around a p number of solutions on average.

    If Np ​> p, the curve was said to have more solutions than average for that prime.

    If Np < p, the curve was said to have fewer solutions than average for that prime.

    Birch and Swinnerton-Dyer checked what happened when they multiplied these results together for thousands of primes. Their product looked like this:

    pXNpp\prod_{p \leq X} \frac{N_p}{p}

    In words, this formula asks: across all the prime numbers up to a certain limit X, is the elliptic curve consistently producing more solutions than average or fewer?

    When they plotted this formula on a graph, they noticed a clear divergence based on the rank of the curve.

    If a curve had only a finite number of rational points, the product fluctuated a bit but remained relatively small and stable.

    If the curve had infinite rational points, the product started to grow. The more primes they included in the calculation, the larger the product became.

    Here’s a visual.

    In the top graph, the blue curve has rank 0, so you see the product fluctuate but stay relatively small and bounded. The red curve has rank 1, so the product grows significantly larger.

    The bottom graph shows the same curves on a logarithmic scale, revealing the pattern over a larger range of values. The blue curve stays relatively flat with small oscillations while the red curve continues to surge upwards.

    Overall, Birch and Swinnerton-Dyer noticed that curves with finite rational points, i.e. rank 0, had a relatively bounded product. And curves with infinite rational points, i.e. rank ≥ 1, had a boundless product.

    Ergo, higher rank means faster growth.

    The product that Birch and Swinnerton-Dyer computed is closely related to the L-function.

    How?

    For each prime number p, they defined a variable ap = p + 1 – Np

    ap measures how Np differs from the expected value p + 1.

    If Np = p + 1, then ap = 0, i.e. it’s exactly average.

    If Np > p + 1, then ap < 0, i.e. there are more solutions than average.

    If Np < p + 1, then ap > 0, i.e. there are fewer solutions than average.

    The L-function makes use of the ap value thus:

    L(E,s)=p(1apps+p12s)1L(E, s) = \prod_p (1 – a_p \cdot p^{-s} + p^{1-2s})^{-1}

    In sum, the behaviour of the product as X grows is mathematically related to whether L(E, s) has a zero at s = 1.

    If you plug s = 1 into the L-function and get 0, the corresponding elliptic curve E has at least some infinite points.

    But if the L-function hugs the zero very closely, the rank of the elliptic curve E is higher.

    Thus, Birch and Swinnerton-Dyer conjectured: the rank of an elliptic curve is equal to the order of the zero of its L-function at s = 1.

    When a function equals zero at some point, the ‘order’ says how strongly it touches zero.

    If the order is 0, the function doesn’t actually equal 0 at that point. If the order is 1, the function crosses through 0 normally. If the order is 2, the function touches 0 and bounces back (e.g. y = x2 when x = 0). If the order is 3 or more, the function hugs zero closely before leaving.

    If the function L(E, s) has a zero of order r at s = 1, it means:

    • L(1) = 0
    • L‘(1) = 0 (the first derivative is also zero)
    • L”(1) = 0 (the second derivative is also zero)
    • … continuing through the (r-1)th derivative
    • But Lr(1) ≠ 0 (the r-th derivative is not zero)

    The conjecture states that this order r equals the rank of the elliptic curve.

    So if the L-function has a zero of order 2 at s = 1, the curve should have rank 2 — meaning it has infinitely many rational points that can be generated from 2 independent base points (like P and Q earlier).

    While the rank is generally the most interesting part of the conjecture, the full version goes further to provide an exact formula for how the function behaves when s = 1.

    Here’s the conjecture in mathematical terms:

    L(r)(E,1)r!=ΩEReg(E)#Ш(E)cp(#Etor)2\frac{L^{(r)}(E, 1)}{r!} = \frac{\Omega_E \cdot \text{Reg}(E) \cdot \#\text{Ш}(E) \cdot \prod c_p}{(\#E_{\text{tor}})^2}

    The terms on the right side represent different properties of the curve:

    Reg(E)\text{Reg}(E)

    — called the regulator, it measures how spread out the rational points are

    #Ш(E)\#\text{Ш}(E)

    — the Shafarevich-Tate group, which measures how much the curve ‘cheats’ by having solutions that look real but aren’t (this is a very hard part to calculate)

    ΩE and cp\Omega_E \text{ and } c_p

    — factors related to the shape and size of the curve.

    In effect, the right side of the conjecture is about analysis because it’s concerned with the analytic property of the L-function at s = 1.

    The left side of the conjecture is about algebra and geometry because it depicts the rank of the elliptic curve.

    Mathematically, these are such different types of objects that proving they’re always equal is extraordinarily difficult.

    There’s currently no algorithm that’s guaranteed to find the rank of an arbitrary elliptic curve.

    Mathematicians can find some rational points and make educated guesses but proving “that’s all of the points” or that “these points will generate all the rest” is very difficult.

    The L-function is defined as an infinite product over all prime numbers.

    Proving that it even converges to a particular value or that it behaves in a predictable way requires some heavy-duty mathematics.

    While mathematicians know that counting the number of solutions an elliptic curve equation has modulo p can determine the structure of rational solutions, they don’t know why.

    This is called the local to global principle and it’s an unsolved problem in its own right.

    Mathematicians have proven the conjecture for specific families of elliptic curves — but proving it for all possible elliptic curves requires many techniques that mathematicians don’t even possess.

    It’s like finding that the number of ways you can rearrange furniture in your house is secretly determined by the prime factorisation of your door number. You could check millions of houses and see the pattern holds, but why would such different things be related?

    And how do you prove that this must always be true?

    This is why the Birch and Swinnerton-Dyer conjecture remains unsolved.

    Bryan Birch (left) and Peter Swinnerton-Dyer. Credit: William Stein and Renate Schmid

    Elliptic curves are a backbone of modern security. They’re used to secure websites, cryptocurrency transactions, app-based messaging, and so forth.

    Remember that ‘adding’ two rational points P and Q could lead you to a third rational point R? Elliptic curve cryptography exploits this fact.

    Choose a public elliptic curve, i.e. an elliptic curve whose equation is public, and a point G on it.

    Pick a random secret number k — your private key.

    Compute k.G, i.e. add G to itself k number of times. Let’s call the result Q. This is your public key.

    As with all cryptography, you can share the public key (Q) but you must protect the private key (k).

    Given G and Q, the task of finding k is called the elliptic curve discrete logarithm problem.

    Even extremely powerful computers struggle to crack it. There’s no known efficient algorithm to solve it.

    This is why understanding the distribution of rational points on elliptic curves is the foundation of how we’re keeping secrets in the digital age.

    The same difficulty that makes the conjecture so hard to solve is what makes elliptic curve cryptography secure.

    Mathematicians have proven the conjecture for when the rank is 0 or 1 and only for certain curves. For rank 2 or higher and for all curves, the Birch and Swinnerton-Dyer conjecture remains one of the greatest unsolved problems in mathematics.

  • Appa Rao Podile made fellow of science academy that published his problem paper – some questions

    Appa Rao Podile, the former vice-chancellor of the University of Hyderabad, has been elected a fellow of the Indian National Science Academy (INSA) in spite of one of his three papers – which The Wire had identified in April 2016 as containing evidence of plagiarism – having been published by the academy. According to the citation, he “has made important contributions in the field of plant-microbe interactions. His work on chitinases has enabled the development of alternatives to toxic antifungal compounds for plant protection.”

    INSA is one of India’s three science academies. The other two are the National Academy of Sciences and the Indian Academy of Sciences. Between them, they’ve formally divvied up an agenda of three portfolios. The National Academy of Sciences handles women in science; the Indian Academy of Sciences handles science education. And INSA, ironically, handles ethics.

    The paper Appa Rao had coauthored (and for which he also the lead author) and published by the journal Proceedings of the INSA in 2014 was titled ‘Root Colonisation and Quorum Sensing are the Driving Forces of Plant Growth Promoting Rhizobacteria (PGPR) for Growth Promotion’. It contained six instances of plagiarism – the most among the three papers. After The Wire had reported on the offence, Appa Rao assumed complete responsibility and apologised for his mistakes. Proceedings of the INSA also issued a clarification accompanying the paper.

    Two scientists I spoke to said on condition of anonymity that Appa Rao Podile’s election only damaged the credibility of the academy. Om Prasad, a history student at JNU, added, “He cannot be a role model for any aspiring researcher in the sciences or in academia in general” for having handled the Rohith Vemula suicide and protests the way he did (almost completely devoid of dignity) and for his plagiarism in various papers.

    This is an issue I’d explored in January this year, when Appa Rao had been awarded the ‘Millennium Plaque of Honour’ by the Indian Science Congress (ISC). The plaque is awarded every year by the congress’s organisers to ’eminent’ scientists. In a time when the ISC’s credibility has been flagging, and considered by many scientists to be a waste of time, it is odd that the award would be given to someone whose administrative and academic credentials are in question. I expected the INSA also would’ve had similar considerations – but no.

    I’d asked A.K. Sood (INSA president), Subhash Lakhotia (senior scientist at the academy) and Lahiri Majumdar (plant sciences editor of the Proceedings of the INSA) about these issues. In response, I got a carefully worded statement from Alok K. Moitra, the secretary of fellowships at the academy. I’ve pasted the bulk of it below; only one paragraph has been left out because it discussed a set of emails exchanged between INSA members and me last year.

    The question of plagiarism in an article published by him and his colleagues in one of the issues of the Proceedings of the INSA was thoroughly examined by the editorial office of the journal immediately following the allegation made by you in April 2016. The examination revealed that although there were instance of similarities in five-six isolated sentences with some earlier publications, none of them would qualify for typical plagiarism since these did not pertain to someone else’s data. These were general statements, some of which may not need any specific citation as such. Being general in nature, they are also likely to share variable strings of words. Nevertheless, the authors did publish a note of apology in a later issue of our journal for inadvertent identity/similarity of a few isolated sentences in the published paper with those in some other papers.

    The INSA Council while discussing the election of Professor Appa Rao Podile to fellow of INSA considered this allegation and decided that the allegation of plagiarism was without merit. His election to the Fellowship of INSA is based on his scholastic research contributions.

    §

    Based on these facts, I have a few questions. But before that, a short note (just in case for some idiotic readers who comment on a story without reading it first): I’m not saying at all that we forgive Appa Rao Podile for the way he dealt with the students and faculty at the University of Hyderabad campus (under political pressure to boot) as well as for the way he conducted himself when a police inquiry was initiated against him.

    1. Appa Rao admitted to his mistake and issued a correction and an apology (subsequently publicised by the journal). His misconduct wasn’t in the experiment but in the descriptive part of the paper. Prasad argued that none of this exculpates him – but this is quite in opposition to what former UGC chairman Praveen Chaddah had written in 2014: that entire papers shouldn’t be retracted or dumped when misconduct like plagiarism is confined to the paper’s descriptive parts and doesn’t spillover into the data or experiment itself. I don’t know where I myself stand, but I think there’s some introspection to be done here about whether we’re being too strict apropos Appa Rao’s plagiarism infraction because of his role in the University of Hyderabad protests, violence, etc.

    2. An obvious follow-up question arises: when we’re felicitating a scientist for his scientific accomplishments and electing him as a fellow of a reputed science academy, are we allowed to pull up the academy for not having considered his non-scientific work as well? (I realise this is a loaded question because it suggests that I’m not going to be happy with the academy until it recants its fellowship offer, but no – I’m actually curious.)

    3. Are we paying attention to the academy itself only because it has elected a controversial fellow? I know my answer is ‘yes’. India has three science academies and they rarely ever feature in public conversations about science in India, so it feels somewhat embarrassing to suddenly consider the INSA to be important. And part two: do we expect all the fellows at India’s science academies to be role models? If we’re going after Appa Rao now because he’s not been a model citizen, shouldn’t we be asking such questions of all the fellows of the three academies?

    4. Should our consternation at Appa Rao’s election be directed towards Appa Rao or towards INSA? Common sense would dictate that we divert our scrutiny towards INSA. And we immediately realise that as much as Appa Rao had erred in plagiarising in his paper, INSA had also erred in publishing the document without checking it for plagiarism first. We find further that the INSA guidelines for the election of new fellows is insipid, making no room to consider the possibility that some scientists may be great with the science but jerks at other things. There are also no guidelines for what actions it would take against a fellow should he be implicated for some offence in the future (and gradations therein). What happens when the fellow of a science academy commits murder? (Can you imagine anyone rushing to find out what INSA/IAS/NAS is saying?)

    Update: I’d had a follow up question for Moitra, to which I received a reply late yesterday.

    Q: Apart from Appa Rao’s academic credentials, did INSA consider his administrative track record at the University of Hyderabad? Did it consider the fact that a fact-finding team (of three well-regarded academics) concluded that Appa Rao had acted unethically and in a way damaging to the reputation of the University during his term as VC? Wouldn’t Appa Rao’s election to the academy thus seem as if – as long as a scientist does good science, his other transgressions can be ignored?

    A.K.M.: In our earlier response, we did state that the election was based on scholastic achievement. Administrative failures/successes can be subjective impressions depending upon from which angle one looks at it. Election to fellowship is essentially on the basis of scientific contributions. However, only if there are established cases of wrong-doing as judged by the judiciary system of the country, the election would not be made in spite of scholastic achievements.

    Featured image: Appa Rao Podile. Credit: YouTube.

  • Assuming this universe…

    Accomplished physicists I have met or spoken with in the last four months professed little agreement over which parts of physics were set-in-stone and which parts simply largely-corroborated hypotheses. Here are some of them, with a short description of the dispute.

    1. Bosons – Could be an emergent phenomenon arising out of fermion-fermion interaction; current definition could be a local encapsulation of special fermionic properties
    2. Colour-confinement – ‘Tis held that gluons, mediators of the colour force, cannot exist in isolation nor outside the hadrons (that are composed of quarks held together by gluons); while experimental proof of the energy required to pull a quark free being much greater than the energy to pull a quark-antiquark pair out of vacuum exists, denial of confinement hasn’t yet been conclusively refuted (ref: lattice formulation of string theory)
    3. Massive gluons – A Millennium Prize problem
    4. Gravity – Again, could be an emergent phenomenon arising out of energy-corrections of hidden, underlying quantum fields
    5. Compactified extra-dimensions & string theory – There are still many who dispute the “magical” mathematical framework that string theory provides because it is a perturbative theory (i.e., background-dependent); a non-perturbative definition would make its currently divergent approximations convergent

    If you ever get the opportunity to listen to a physicist ruminate on the philosophy of nature, don’t miss it. What lay-people would daily dispute is the macro-physical implications of a quantum world; the result is the all-important subjective clarification that lets us think better. What physicists dispute is the constitution of the quantum world itself; the result is the more objective phenomenological implications for everyone everywhere. We could use both debates.