Unique Prime Factors in a uint(32) or even a uint(64)

damianmoz · September 22, 2023, 2:45pm

I am trying to port a Fortran routine to Chapel. I probably only need the answer for a uint(32). It is for the number of points in a mixed radix FFT.

To simplify the coding, it would be advantageous to know the maximum number of unique prime factors in a uint(32). With at most 26 significant bits in a uint(32)., the number is 8 if you ignore the fact that 1 is a prime factor of every number.

Where would I look for this answer?

Thanks.

mppf · September 22, 2023, 5:26pm

Hi Damian -

To my understanding, you are asking a question about number theory. If you know a number theorist, it might be time to ask them.

Anyway, I'm not a number theorist, but with some searching I have identified some material that might help you:

the number of distinct prime factors of a number is maximized when multiplying all the distinct primes up to some number (comment in https://mathoverflow.net/questions/23867/bound-on-the-number-of-prime-factors-of-logarithmically-rough-numbers )
the product of the first n distinct primes is called the primorial (note that it considers 2 the first prime number) Primorial -- from Wolfram MathWorld
so, we can compute a number n, where if we take the first n primes and multiply them, the result is just bigger than what is representable in a uint(64): log2(primorial(16)) - Wolfram|Alpha
so, as far as I know, the maximum number of distinct prime factors for a number up to 2^64 (plus some change) is 16, based on the above Wolfram Alpha query.

As I said, I'm not a number theorist, so there very well might be something wrong in this reasoning, but hopefully it gives you enough to go on.

damianmoz · September 23, 2023, 1:57am

Thanks for the tips. I do not know any number theorists.

Thanks for the advice.

I have learnt more about prime numbers in the last week than I ever really wanted to know. I learnt a lot more about prime gaps and how to compute them optimally.

My 30 year old C code to factor N did not port so well so I looked for a better algorithm. When I looked at that code with hindsight, I (thankfully) realized that my programming skills had improved over that time because I looked at that old code somewhat disapprovingly.

I have run my code to factorize every integer up to about 2^31 and it works and your formula confirms those tests of mine. That will suit the client's limit on the number N which they need to be able to factorize. But I would like the code to be totally general, especially as Chapel's default integer is 64 bits and maybe the code needs to last another 30 years. A scary thought.

The Chapel code returns a 2D array [1..n,1..2]. When I rewrote it in C, that new C code is way cleaner than what I started with. So Chapel encourages or (I would almost say) mandates a better programming style. Also, I note than CLANG 15 runs nearly twice as fast as GCC 11 for this purely integer algorithm.

Thanks again for your help.

damianmoz · September 23, 2023, 12:19pm

Your reasoning certainly seems to be on the right path. Given a number X between 1 and 2^n such that

primorial(p) < 2^n < primorial(p+1)

then testing for n in 1..14 shows that X has at most p unique prime factors. I will run the cases of 15, 16 and 17 and 18 overnight to confirm but it seems like the above holds. Note that either or both of those inequalities might be the looser less than or equals to. I might even try 19, 20, 21 and 22 but it certainly looks like your formula has the answer.

Thanks again.

Yes, I could save the unique primes in a list and then allocate the size of the array at the end to capture the unique primes and the number of times each such prime occurs in the factors of the number in question, but I want something a little more predictable than that. Also, programming it is C with an array only uses 133 lines of assembler with GCC11 or 141 lines with CLANG15. As soon as you use a linked list, it blows out the lines of assembler code and slows it down slightly. Besides, clients who long ago were Fortran programmers are most happy with an array.

Your search skills are certainly superior to mine. When I went fishing for clues, the only useful thing I learnt is that there is a relationship which the gap between primes satisfies. Not a predictable formula, but a relationship which means you do not have to go looking for a prime when is just the next odd number. Fascinating. Mind you, I think I will stick to numerical analysis. Number theory seems a bit too esoteric for me.

Now I can get onto the main game which is to port the C++ code of an FFT for an arbitrary number of points into Chapel.

damianmoz · September 26, 2023, 5:14am

Slightly more robust reporting of results to date - 3 larger numbers are running. I factorize every number up to N^2-1 and the number in the #F column is the most factors I found. I note the primorial bounds between which N^2-1 lays.

              N  #F             N^2-1
            1024 (7): P(7) <   1048575 < P(8)
            2048 (7): P(7) <   4194303 < P(8)
            4196 (8): P(8) <  17606415 < P(9)
            8192 (8): P(8) <  67108863 < P(9)
           16385 (9): P(9) < 268468224 < P(10)
           32769 (9): P(9) < 1073807360 < P(10)
           65537 (9): P(9) < 4295098368 < P(10)

Topic		Replies	Views
Allocating a nested array of arrays Users	23	252	October 2, 2023
Naming enum variable for `bigint.probab_prime_p` method Users	2	253	December 15, 2021
[design] proposed change to small int -> real/complex conversions Users	0	177	May 4, 2022
Qualified (Restricted Range) Floating Point Numbers - Long Term Users	2	142	November 16, 2022
[Design] Make unary negation of unsigned integers an error? Users	0	137	October 10, 2022

Unique Prime Factors in a uint(32) or even a uint(64)

Related Topics