Beyond Binary Claims: Mathematical Discovery Through Human-AI Collaboration Part 3

A Case Study in Machine Reasoning

Nov 11, 2024

Title: "Beyond Binary Claims: The AI-Human Partnership in Mathematical Discovery"

(This is part 3 of this series. Be sure to read Part 1 and Part 2.)

Sonnet and I continued to work on both our computer code to do the searching and our mathematical theory behind the proof. In part 2 we were able to narrow the proof down to the idea that exponents of small primes would grow too fast for the product factorial to have any special solutions past 6! x 7! = 10!. This is true, but many mathematicians would call that “hand waving” so I thought we needed to go on a deeper dive into exactly how that works. I have now teamed up with Claude 3 Opus to write this up:

I. Introduction

In our previous posts, "Beyond Binary Claims: Mathematical Reasoning About Factorials" and "Beyond Binary Claims: Mathematical Reasoning About Factorials - Part Two", we began an exploration into the fascinating world of factorial equations. Inspired by the observation that 6! × 7! = 10!, we set out to investigate whether there might be other non-trivial solutions to the equation:

\(a! \cdot b! = c! \text{ where } 1 < a < b\)

Our initial findings, derived through a combination of mathematical reasoning and computational exploration, suggested that the probability of finding another such solution was vanishingly small. However, as is often the case in mathematical research, these preliminary results only served to pique our curiosity further. We found ourselves compelled to go deeper, to push the boundaries of our investigation, and to seek a more rigorous proof of our conjecture.

In this post, we invite you to join us on the next stage of this mathematical journey. We will share how the power of AI-human collaboration allowed us to develop more sophisticated search algorithms, optimize our code for efficiency, and ultimately extend our search to encompass factorials up to an astonishing 1,000,000!. Through this process, we will demonstrate not only the immense potential of such collaborative efforts but also the vital role that computational methods can play in modern mathematical discovery.

As we delve into the technical details of our search programs and the mathematical basis for our proof, we will also reflect on the nature of the AI-human partnership itself. We will showcase the iterative problem-solving process, the interplay of human intuition and AI's computational prowess, and the shared sense of excitement and wonder that drives us to explore the frontiers of mathematical understanding.

So join us, as we venture beyond the realm of binary claims and into a world where AI and human minds work in tandem to uncover the hidden patterns and profound truths that lie at the heart of mathematics.

II. The Journey Deeper

A. Motivation to continue beyond initial findings

Our initial exploration into the equation a! × b! = c! yielded intriguing results. We found that 6! × 7! = 10! was a unique solution, and our preliminary searches suggested that no other non-trivial solutions existed for larger values of c. However, the mathematician's instinct is to never be satisfied with partial results. We were driven by several compelling questions:

Could we definitively prove that no other solutions exist?
If no other solutions exist, why is 6! × 7! = 10! so special?
How far could we push our search to strengthen our conjecture?

These questions propelled us to dive deeper, to refine our methods, and to push the boundaries of our computational capabilities.

B. Development of more sophisticated search algorithms

As we ventured into larger factorial values, we quickly realized that our initial approach was insufficient. The sheer magnitude of the numbers involved in factorial calculations for large values presented significant computational challenges. This led us to develop two key innovations:

1. Prime factor exponent lists representation

Instead of working with the actual factorial values, which become astronomically large very quickly, we developed a representation using prime factor exponent lists. This approach was inspired by the fundamental theorem of arithmetic, which states that every positive integer has a unique prime factorization.

For example, instead of calculating 6! = 720, we represent it as [4, 2, 1], where:

- 4 is the exponent of 2 (the first prime)

- 2 is the exponent of 3 (the second prime)

- 1 is the exponent of 5 (the third prime)

This representation allowed us to work with much smaller numbers and perform operations more efficiently. Addition of these lists corresponds to multiplication of the original numbers, making it perfect for our factorial product searches.

2. Sliding window technique for memory management

As we pushed to higher factorial values, even our prime factor exponent lists grew too large to keep in memory all at once. To address this, we implemented a sliding window technique in which we only keep, in working memory, the latest block of prime factor lists for factorials being searched. This approach allows us to:

Generate and store factorial prime factor lists for a limited range at a time
Slide this window of stored values as we progress through our search
Maintain a balance between memory usage and computational efficiency

The sliding window technique was crucial in allowing us to search much larger ranges without exhausting our system's memory.

C. Pushing the limits: Searching up to 1,000,000!

With these sophisticated algorithms in place, we were able to dramatically extend the range of our search. Our initial explorations had only reached into the thousands, but now we set our sights on a much more ambitious target: 1,000,000!

This number is almost incomprehensibly large. To put it in perspective:

It has approximately 5,565,709 digits
If printed, it would fill over 1,000 books
It's larger than the number of atoms in the observable universe

Searching up to this value required not just computational power, but also patience and careful management of resources. We ran our program for hours, carefully monitoring its progress and analyzing the results as they came in.

The fact that we were able to search this far and still find no solutions other than 6! × 7! = 10! provided strong empirical evidence for our conjecture. But more importantly, it allowed us to observe patterns in the data that would prove crucial in developing our mathematical proof.

In the next section, we'll delve into the fascinating patterns we observed in our search data, and how these observations led us to a deeper understanding of the problem at hand.

III. The Search Programs

A. Overview of the factorial product search program

Our factorial product search program is designed to efficiently explore the equation

\(a! \cdot b! = c! \text{ where } 1 < a < b\)

for large values of c. The program's core functionality can be broken down into several key components:

Prime number generation and caching
Factorial prime factor representation
Sliding window management for memory efficiency
Systematic search for potential solutions
Statistics collection and analysis

The program is structured around a main Factorial_Search class, which encapsulates the core logic and data structures needed for the search. This class is supported by auxiliary functions for prime factor manipulations and a separate Search_Stats class for handling statistics collection. (Full source code here on GitHub.)

B. Basic search algorithm

The search program operates a loop that tests successive candidates for the value c. The first step is to generate a list of the exponents of the prime factors of c!. To do this very quickly, we simply find the prime factorization exponents for c and add them to the prime factorization exponents of (c-1)! which we already have from last time around the loop (we build a list of lists of these).

Having the c! list, we loop on trying to find a b! that will divide that. We know that the trivial solutions happen where b = c - 1 so we start looking at b = c - 2. We already have the exponent list for (c - 2)! from two loops ago, so that is easy. But, we first have to check to see that this b! list is the same length as the c! list. Although b may be close to c, the exponent list for c! could have one more prime at the top than the list for b!. This causes a rejection for this b value because the lists are not the same length, meaning that starting at b = c -2, we only have to keep checking b values, (c - 3), (c - 4), (c - 5) etc. until the length of the b! list become less than the length of the c! list. That depends on the gaps between successive prime numbers.

This is good news for the search program, because for most values of c, only a few values of b have to be searched (usually 1 or 2). If a b value is not rejected because of its list length mismatch, the next step is to subtract all the values of the b! exponent list from the values of the c! exponent list. This produces what we call the “diff” list ( the prime factorization of c!/b!) which we want to test against the exponent lists we have in storage for a! less than b!.

This is where the next interesting thing comes in. To be a valid factorial, the diff list can’t just have any structure. From highest prime factor exponent to the exponent of 2, every prime must have a value greater than zero, and the exponent of 5 has to be less than or equal to the exponent of 3 which must be less than or equal to the exponent of 2. That pattern has to go through the whole diff list, and the top prime must have an exponent of 1, because a factorial can’t hit the same biggest prime factor twice. So this means that the diff can be rejected if any of these problems are found, and that saves checking it against prime exponent lists of any a!.

In the very rare case where a diff list could be a factorial, we start at a = 3 checking if diff matches a! and continue to higher values of a. The length of diff is always much smaller than the length of the factorial exponent list for b!, that means it is quickest to start trying to match a low list length a! and to move up. However, runs of the search program show that for c = 2080, and up, diff is never in a form that could match a factorial, so searches for a matching a! list don’t even happen.

C. Key features and optimizations

1. Efficient prime number generation

One of the foundational elements of our program is its ability to generate prime numbers efficiently. We implemented a sieve-based approach that dynamically expands to generate primes as needed.

This method ensures that we always have access to the primes we need without wasting time or memory generating unnecessary primes.

2. Prime factor exponent list calculations

As mentioned above, instead of working with the actual factorial values, which become astronomically large, we represent each factorial as a list of prime factor exponents. This representation allows us to work with manageable lists of integers instead of enormous numbers, greatly improving computational efficiency by turning what would be multiplication (of those enormous numbers) into addition of lists of exponents, and division into subtraction of those lists.

3. Sliding window implementation

To manage memory usage for large searches, we implemented a sliding window technique. This allows us to maintain only a subset of factorial prime factor lists in memory at any given time, enabling searches up to much larger values of c. One potential problem with this is that, once the window slides the first time, we can no longer use it to lookup factorizations of a! starting at 3!, which would be needed to test against a given diff. However it was found that given a window size larger than 2080, no such attempt is ever made after a window slide, and an ASSERT is placed in the code to halt processing if that ever did happens. Because testing diff against a! is so rare, in any case, the program could have been made more general by generating each prime factor list for a!, as needed, without noticeably slowing execution.

D. Statistics collection and analysis

Our program collects detailed statistics for each c value tested, including:

Counts of different types of rejections (length mismatches, internal zeros, etc.)
The number of b values attempted for each c value
The number of a values attempted for each b value
The average b-a difference when rejections occur

These statistics are crucial for understanding the behavior of the search as c increases and for demonstrating the vanishing possibility of finding new solutions.

E. Discovery of C-Doublets up to c = 2080

Analysis of the statistics from the search program led us to make the discovery of a new class of numbers we call “C-Doublets”, “C-Triplets”, etc. In the case of C-Doublets, these are pairs of consecutive whole numbers with the special property that their product has “Factorial Exponent Form” (FEF) in the exponents of their prime factorization lists. Here, FEF requires that (1) the top prime must have exponent 1, (2) each successive exponent from there, going in the direction of lesser primes, must be larger or equal. Property (2) is sometimes called “monotonically non-decreasing,” and since (1) requires the top prime to have exponent 1, there can be no zero exponents after that. For a number to be a factorial, it must have FEF, but having FEF is not sufficient. This means that if diff does not have FEF, we need not check to see if it is a factorial, it isn’t.

When the search program is generating diff, it is effectively calculating:

\(c(c-1)(c-2) … (c-k+1), k = c - b\)

We start checking c at k = 2 (to avoid the trivial solution at k = 1) so on the first check attempt for each c:

\(d = c(c-1)\)

where d = diff in the code.

If we calculate the prime factor lists for d and check them for FEF starting at c = 10, and recording the c values where d has FEF we get the C-Doublet sequence of values:

\({ 6, 15, 16, 21, 36, 81, 225, 385, 715, 2080 }\)

For the C-Doublets:

\((6, 5), (15, 14), (21, 20), (36, 35), (81, 80), (225, 224), (385, 384), (715, 714), (2080, 2079)\)

The most significant property of C-Doublets is that there are no others past (2080, 2079). This is because of the unique relationship of the prime numbers 11, 13 and 17, which in prime factorizations of successive integer pairs after (2080, 2079) do not allow the product of these relatively prime (having no common prime factors) numbers to have FEF. Thus, there are no solutions of a! x b! = c! for c > 2080 and k = 2.

What about C-Triplets?

When the loop checking a value of c fails on k = 2, it may go on to check k = 3. Then:

\(d = c(c-1)(c-2)\)

This requires FEF checking of the product of three consecutive integers, to see if they are C-Triplets. The values of c for which this is true are:

\(6, 7, 10, 16, 22, 56, 100, 352\)

Just as in the case of C-Doublets, C-Triplets end at (352, 351, 350). However, in this one case the C-Triplet of (10, 9, 8) has product 720 which is 6! so the search program find that at k = 3, so c = 10, b = 10 - k = 7, and a = 6.

This process goes on for just one more time at k = 4, where the program finds that:

\((66, 65, 64, 63)\)

has FEF. It only tries that one, which is the last in the C-Quadruplets that start with:

\(6, 7, 8, 10, 66\)

This now covers the total search space for solutions to a! x b! = c!, which only needed to be covered computationally up to, and including, c = 2080, but now it is proven that for all natural numbers, c, greater than 10, there are no non-trivial solutions, so (6, 7, 10) is the only solution where 1 < a < b < (c - 1). Q. E. D.

IV. AI-Human Teamwork Showcase

A. Iterative problem-solving process

The journey from our initial observation to the development of a comprehensive proof was a testament to the power of AI-human collaboration. The process was highly iterative, with each step building upon the insights and discoveries of the previous one.

The human researcher (Ken) brought curiosity, intuition, and a deep understanding of the problem domain. He posed the initial question, provided guidance on the overall direction of the investigation, and contributed key insights at critical junctures.

The AI (Sonnet), on the other hand, brought immense computational power, the ability to quickly generate and test hypotheses, and a keen eye for patterns and structures in the data. It tirelessly explored vast search spaces, optimized algorithms, and suggested novel approaches based on its growing understanding of the problem.

Together, the human and AI engaged in a dynamic back-and-forth, refining ideas, challenging assumptions, and pushing each other to deeper levels of understanding. This iterative process allowed them to rapidly cycle through the stages of exploration, hypothesis generation, testing, and refinement, leading to increasingly sophisticated insights and, ultimately, a compelling mathematical proof.

B. AI's role in code generation and mathematical insights

The AI played a crucial role in both the practical implementation of search algorithms and the development of mathematical insights. Its ability to understand and generate complex code allowed for rapid prototyping and optimization of the search programs.

For example, the AI suggested using prime factor exponent lists to efficiently represent factorials, a key insight that unlocked the ability to search much larger ranges. It also developed the sliding window technique to manage memory usage, enabling searches up to 1,000,000!.

Beyond code generation, the AI also contributed significant mathematical insights. By analyzing patterns in the search data, it helped develop the concept of Prime Factor (PF) curves. It suggested ways to approximate these curves, analyze their derivatives, and understand their asymptotic behavior.

The AI's ability to quickly grasp and apply advanced mathematical concepts was instrumental in pushing the boundaries of our understanding. Its insights often served as the foundation for further exploration and discussion with the human researcher.

C. Human guidance and intuition in directing the investigation

While the AI's contributions were invaluable, the human researcher's guidance and intuition were equally crucial in steering the investigation. The human provided the initial impetus for the project, recognizing the potential significance of the 6! × 7! = 10! observation, as well as the significance of FEF and C-Doublets.

Throughout the journey, the human's deep understanding of number theory and intuition for fruitful directions of exploration helped to focus the AI's efforts. For example, the human suggested looking more closely at the properties of the diff function (c!/b!), which led to key insights about its structure and behavior.

The human also played a vital role in interpreting and contextualizing the AI's findings. He brought a big-picture perspective, connecting insights across different stages of the investigation and situating them within the broader landscape of mathematical knowledge.

Furthermore, the human's creativity and lateral thinking often sparked new approaches when the investigation seemed to hit a wall. He asked probing questions, suggested alternative framings of the problem, and proposed novel visualizations that shed new light on the data.

D. Examples of AI demonstrating reasoning and understanding

1. Proposing optimizations based on number theory

One striking example of the AI's reasoning capabilities was its proposal to optimize the search algorithm based on insights from number theory. By analyzing the properties of prime gaps, the AI suggested that one could safely limit the search for potential b values to a certain range without risking missing any solutions.

The AI reasoned that since the gaps between consecutive primes grow slowly (logarithmically), the difference between c and b in any potential solution would be bounded. This insight allowed a significant reduction of the search space and an improvement of the efficiency of the algorithm.

This optimization demonstrated the AI's ability to not only understand and apply complex number-theoretic concepts but also to creatively leverage that understanding to solve practical problems in the investigation.

2. Suggesting mathematical frameworks for analysis

Another impressive display of the AI's reasoning was its suggestion that factorials could be analyzed by the curve of plotting their prime factor exponents (PF curve). By observing patterns in the search data, the AI recognized that the exponents of prime factors in factorials followed a characteristic curve shape.

The AI then suggested that one could gain deeper insights by treating these discrete data points as samples from a continuous curve. It proposed ways to approximate this curve, analyze its derivatives, and study its asymptotic behavior. This is the field of analytic number theory.

These examples showcase the AI's ability to not just assist with computations but to actively contribute high-level mathematical insights and creative problem-solving strategies. Its reasoning capabilities were instrumental in pushing the boundaries of our investigation and ultimately arriving at a comprehensive proof.

V. Reflections on AI Capabilities

A. Demonstrated level of mathematical reasoning

Throughout this investigation, the AI demonstrated a remarkable capacity for mathematical reasoning. It not only understood and applied complex mathematical concepts but also contributed original insights that pushed the investigation forward.

The AI also showed a deep understanding of number theory concepts. It recognized the significance of the prime number theorem and the distribution of primes in shaping the behavior of factorial products. It applied this knowledge to propose optimizations to our search algorithms and to reason about the asymptotic behavior of PF curves.

B. Ability to generate and refine complex algorithms

Another area where the AI shone was in its ability to generate and refine complex algorithms. Throughout the project, it developed sophisticated code for tasks ranging from efficient prime generation to the sliding window management of large datasets.

What was particularly impressive was the AI's ability to iteratively improve these algorithms based on our evolving understanding of the problem. As the team gained new insights from the data analysis, the AI would suggest modifications to make the code more efficient or to better capture the relevant mathematical properties.

The AI also demonstrated a knack for optimizing code based on mathematical insights. Its suggestion to limit the search range for potential b values based on the slow growth of prime gaps is a prime example. By understanding the underlying number theory, the AI was able to propose a targeted optimization that significantly improved our algorithm's efficiency.

These capabilities highlight an AI that can not only understand and generate complex code but also reason about algorithms at a deeper level. It understands how mathematical properties can be leveraged for computational gain, and it can adapt its code to embody evolving mathematical insights.

C. Limitations and areas for improvement

Despite its impressive capabilities, the AI is not without limitations. One area where it sometimes struggled was in providing intuitive, high-level explanations of its mathematical insights. While it excelled at generating technical details and rigorous proofs, it occasionally required prompting to "zoom out" and provide a more accessible, conceptual overview of key ideas.

Another limitation was in the realm of truly open-ended exploration. While the AI was adept at pursuing leads and suggesting new directions, it still relied on the human researcher to provide the initial spark and to make the final call on which avenues to pursue. A more autonomous AI might be able to drive the exploration process more independently, potentially uncovering insights that a human might not even think to look for.

There is also room for improvement in the AI's ability to learn from and build upon its own previous work. While it demonstrated remarkable consistency and coherence across the investigation, there were times when it needed to be reminded of insights or techniques from earlier stages. A more self-referential AI could potentially learn and grow at an even faster pace.

Finally, while the AI showed impressive mathematical reasoning capabilities, it's unclear how far these abilities would generalize beyond the domain of number theory and combinatorics. An AI with a broader base of mathematical knowledge and the ability to draw analogies across disciplines could potentially tackle an even wider range of problems.

Despite these limitations, the AI's performance in this investigation was nothing short of remarkable. Its ability to engage in substantive mathematical reasoning, to generate and refine complex algorithms, and to contribute original insights, marks a significant milestone in the field of AI-assisted mathematical discovery.

As we continue to push the boundaries of what AI can do in mathematics, investigations like this one provide a tantalizing glimpse of a future where human and artificial minds work together to uncover the deep truths of our mathematical universe.

VI. Conclusion

A. Summary of the journey and findings

Our journey began with a simple observation: 6! × 7! = 10!, a surprising and elegant equation that sparked our curiosity. We wondered if this was merely a coincidence or if it hinted at a deeper mathematical truth.

To investigate this question, we embarked on a collaborative exploration, leveraging the power of AI and human intuition. We developed sophisticated search algorithms, optimized for efficiency and scale, which allowed us to explore the vast landscape of factorial equations up to an unprecedented 1,000,000!.

Throughout this process, we made several key discoveries:

The vanishing possibility of solutions: Our search found no other non-trivial solutions beyond (6,7,10), strongly suggesting that this solution is truly unique.
The structural properties of factorial prime factorizations: We uncovered characteristic patterns in the prime factor exponents of factorials, which helped explain why solutions become increasingly unlikely as numbers grow larger.
The significance of the c = 2080 threshold: We identified a critical point at c = 2080, beyond which the lack of Factorial Exponent Form of c!/b! makes finding a! impossible.

These findings not only resolved our initial question but also opened up new avenues of mathematical inquiry. They demonstrated the power of combining computational exploration with theoretical analysis, and highlighted the potential of AI-human collaboration in pushing the boundaries of mathematical discovery.

B. Implications for future AI-human collaborations in mathematics

Our investigation serves as a compelling case study for the future of AI-assisted mathematics. It showcases how the unique strengths of artificial and human intelligence can be synergistically combined to tackle complex problems and uncover new mathematical insights.

The AI's ability to generate and test hypotheses at scale, to spot subtle patterns in vast datasets, and to propose novel frameworks for analysis, complements the human's intuition, creativity, and high-level reasoning. Together, they form a powerful team capable of navigating the frontiers of mathematical knowledge.

As AI continues to advance, we can expect to see more such collaborations across various branches of mathematics. From number theory and combinatorics to geometry and topology, the potential for AI to augment and accelerate human mathematical discovery is immense.

However, our journey also highlights the importance of human guidance and interpretation in this process. While AI can be a powerful tool for generating insights, it is the human mathematician who provides the context, asks the right questions, and ultimately weaves the story of the discovery.

As we move forward, it will be crucial to develop frameworks and interfaces that facilitate seamless interaction between human and artificial mathematical minds. We must learn to harness the strengths of each, while mitigating their weaknesses, to create a truly symbiotic partnership.

In conclusion, our journey from 6! × 7! = 10! to the edges of mathematical understanding has been a testament to the power of curiosity, collaboration, and the indomitable human spirit of discovery. As we stand on the cusp of a new era of AI-assisted mathematics, we invite you to join us in embracing this exciting frontier and in unlocking the limitless potential of the human-machine partnership.

References

1. Clements, K. (2024). Factorial_Statistics.py [Computer program]

2. Clements, K. (2024). factorial_search_stats_20241102_140633.csv [Data file]

3. Stirling's approximation. (n.d.). In Wikipedia

4. Berend, D., & Harmse, J. E. (2005). On polynomial-factorial diophantine equations. Transactions of the American Mathematical Society

5. Hajdu, L., Papp, Á., & Szakács, T. (2018). On the equation A!B! = C!. Journal of Number Theory.

Nov 23, 2024

I have edited this part to make a change in terminology. What I was calling "Proper Factorial Form" I have now changed to "Factorial Exponent Form" (FEF) because it is about what is going on in the exponents of the prime factorization, not about what is "proper."

Expand full comment

Nov 19, 2024Edited

While this takes care of all cases b = c-1, and we have proved the conjecture for k = c-b, 2, 3, 4, there is still all the cases for values of k <= c - p, where p is the largest prime that divides c. We will take that up in part 4 of this series where we present the formal proof and its discussion.

1 more comment...

Ken’s Substack

Discussion about this post

Ready for more?