Easy interview question got harder: given numbers 1..100, find the missing number(s) given exactly k are missing Ask Question

Question

Here's a summary of Dimitris Andreou's link.

Remember sum of i-th powers, where i=1,2,..,k. This reduces the problem to solving the system of equations

a₁ + a₂ + ... + a_k = b₁

a₁² + a₂² + ... + a_k² = b₂

...

a₁^k + a₂^k + ... + a_k^k = b_k

Using Newton's identities, knowing b_i allows to compute

c₁ = a₁ + a₂ + ... a_k

c₂ = a₁a₂ + a₁a₃ + ... + a_k-1a_k

...

c_k = a₁a₂ ... a_k

If you expand the polynomial (x-a₁)...(x-a_k) the coefficients will be exactly c₁, ..., c_k - see Viète's formulas. Since every polynomial factors uniquely (ring of polynomials is an Euclidean domain), this means a_i are uniquely determined, up to permutation.

This ends a proof that remembering powers is enough to recover the numbers. For constant k, this is a good approach.

However, when k is varying, the direct approach of computing c₁,...,c_k is prohibitely expensive, since e.g. c_k is the product of all missing numbers, magnitude n!/(n-k)!. To overcome this, perform computations in Z_q field、ここでqはn <= q < 2nを満たす素数であり、ベルトランの公理証明は変更する必要はありません。なぜなら、公式は依然として有効であり、多項式の因数分解は依然として一意だからです。また、有限体上の因数分解のアルゴリズムも必要です。たとえば、ベルレカンプまたはカントル・ザッセンハウス。

定数 k の高レベル疑似コード:

与えられた数のi乗を計算する
未知の数の i 乗の合計を求めるために減算します。合計を b _{i と}呼びます。
_{ニュートンの恒等式を使って b i}の係数を計算します。これを c _iと呼びます。基本的に、 c ₁ = b ₁、 c ₂ = (c ₁ b ₁ - b ₂ )/2 です。正確な公式については Wikipedia を参照してください。
多項式 x ^k -c ₁ x ^k-1 + ... + c _kを因数分解します。
多項式の根は必要な数 a ₁、...、 a _kです。

k を変化させて、例えば Miller-Rabin 法を使用して n <= q < 2n の素数を見つけ、すべての数を q で割った値で手順を実行します。

編集: この回答の以前のバージョンでは、q が素数である Z _qの代わりに、特性 2 の有限体 (q=2^(log n)) を使用できると述べられていました。これは当てはまりません。ニュートンの公式では、k までの数による除算が必要なためです。

Answer 1