Beckmann Sample, near normal, slope_x

Percentage Accurate: 57.1% → 99.1%
Time: 12.4s
Alternatives: 16
Speedup: 11.6×

Specification

?
\[\left(\left(cosTheta\_i > 0.9999 \land cosTheta\_i \leq 1\right) \land \left(2.328306437 \cdot 10^{-10} \leq u1 \land u1 \leq 1\right)\right) \land \left(2.328306437 \cdot 10^{-10} \leq u2 \land u2 \leq 1\right)\]
\[\begin{array}{l} \\ \sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-logf((1.0f - u1))) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log(Float32(Float32(1.0) - u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
function tmp = code(cosTheta_i, u1, u2)
	tmp = sqrt(-log((single(1.0) - u1))) * cos(((single(2.0) * single(pi)) * u2));
end
\begin{array}{l}

\\
\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}

Sampling outcomes in binary32 precision:

Local Percentage Accuracy vs ?

The average percentage accuracy by input value. Horizontal axis shows value of an input variable; the variable is choosen in the title. Vertical axis is accuracy; higher is better. Red represent the original program, while blue represents Herbie's suggestion. These can be toggled with buttons below the plot. The line is an average while dots represent individual samples.

Accuracy vs Speed?

Herbie found 16 alternatives:

AlternativeAccuracySpeedup
The accuracy (vertical axis) and speed (horizontal axis) of each alternatives. Up and to the right is better. The red square shows the initial program, and each blue circle shows an alternative.The line shows the best available speed-accuracy tradeoffs.

Initial Program: 57.1% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-logf((1.0f - u1))) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log(Float32(Float32(1.0) - u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
function tmp = code(cosTheta_i, u1, u2)
	tmp = sqrt(-log((single(1.0) - u1))) * cos(((single(2.0) * single(pi)) * u2));
end
\begin{array}{l}

\\
\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}

Alternative 1: 99.1% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log1p (- u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-log1pf(-u1)) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log1p(Float32(-u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
\begin{array}{l}

\\
\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}
Derivation
  1. Initial program 62.6%

    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  2. Add Preprocessing
  3. Step-by-step derivation
    1. lift-log.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    2. lift--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    3. sub-negN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. lower-log1p.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    5. lower-neg.f3299.0

      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  4. Applied rewrites99.0%

    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  5. Add Preprocessing

Alternative 2: 97.9% accurate, 0.5× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ t_1 := \left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.20000000298023224:\\ \;\;\;\;t\_0 \cdot \sqrt{-\frac{t\_1 \cdot t\_1 - u1 \cdot u1}{u1 + t\_1}}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (cos (* (* 2.0 PI) u2)))
        (t_1 (* (* u1 u1) (fma u1 (fma u1 -0.25 -0.3333333333333333) -0.5))))
   (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.20000000298023224)
     (* t_0 (sqrt (- (/ (- (* t_1 t_1) (* u1 u1)) (+ u1 t_1)))))
     (* (sqrt (- (log1p (- u1)))) (fma -2.0 (* PI (* PI (* u2 u2))) 1.0)))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
	float t_1 = (u1 * u1) * fmaf(u1, fmaf(u1, -0.25f, -0.3333333333333333f), -0.5f);
	float tmp;
	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.20000000298023224f) {
		tmp = t_0 * sqrtf(-(((t_1 * t_1) - (u1 * u1)) / (u1 + t_1)));
	} else {
		tmp = sqrtf(-log1pf(-u1)) * fmaf(-2.0f, (((float) M_PI) * (((float) M_PI) * (u2 * u2))), 1.0f);
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
	t_1 = Float32(Float32(u1 * u1) * fma(u1, fma(u1, Float32(-0.25), Float32(-0.3333333333333333)), Float32(-0.5)))
	tmp = Float32(0.0)
	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.20000000298023224))
		tmp = Float32(t_0 * sqrt(Float32(-Float32(Float32(Float32(t_1 * t_1) - Float32(u1 * u1)) / Float32(u1 + t_1)))));
	else
		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(-2.0), Float32(Float32(pi) * Float32(Float32(pi) * Float32(u2 * u2))), Float32(1.0)));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
t_1 := \left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)\\
\mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.20000000298023224:\\
\;\;\;\;t\_0 \cdot \sqrt{-\frac{t\_1 \cdot t\_1 - u1 \cdot u1}{u1 + t\_1}}\\

\mathbf{else}:\\
\;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.200000003

    1. Initial program 55.2%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Applied rewrites88.4%

      \[\leadsto \sqrt{-\color{blue}{\left(\log \left(-\left(-1 + u1 \cdot u1\right)\right) - \mathsf{log1p}\left(u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    5. Step-by-step derivation
      1. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. metadata-evalN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. metadata-evalN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \left(\frac{-1}{4} \cdot u1 - \frac{1}{3}\right) + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      7. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{4} \cdot u1 - \frac{1}{3}, \frac{-1}{2}\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      8. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{4} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{3}\right)\right)}, \frac{-1}{2}\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      9. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{4}} + \left(\mathsf{neg}\left(\frac{1}{3}\right)\right), \frac{-1}{2}\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      10. metadata-evalN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{4} + \color{blue}{\frac{-1}{3}}, \frac{-1}{2}\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      11. lower-fma.f3298.3

        \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right)}, -0.5\right), -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    6. Applied rewrites98.3%

      \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    7. Step-by-step derivation
      1. Applied rewrites98.4%

        \[\leadsto \sqrt{-\frac{\left(\left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)\right) \cdot \left(\left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)\right) - u1 \cdot u1}{\color{blue}{\left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right) - \left(-u1\right)}}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

      if 0.200000003 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

      1. Initial program 98.1%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Step-by-step derivation
        1. lift-log.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. lift--.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. sub-negN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. lower-log1p.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. lower-neg.f3299.1

          \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      4. Applied rewrites99.1%

        \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Taylor expanded in u2 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
      6. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
        2. lower-fma.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, {u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}, 1\right)} \]
        3. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}}, 1\right) \]
        4. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot {u2}^{2}, 1\right) \]
        5. associate-*l*N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
        6. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
        7. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right)} \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right), 1\right) \]
        8. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
        9. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot {u2}^{2}\right), 1\right) \]
        10. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
        11. lower-*.f3296.2

          \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
      7. Applied rewrites96.2%

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)} \]
    8. Recombined 2 regimes into one program.
    9. Final simplification98.0%

      \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.20000000298023224:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\frac{\left(\left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)\right) \cdot \left(\left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)\right) - u1 \cdot u1}{u1 + \left(u1 \cdot u1\right) \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.25, -0.3333333333333333\right), -0.5\right)}}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\ \end{array} \]
    10. Add Preprocessing

    Alternative 3: 98.0% accurate, 0.6× speedup?

    \[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.20000000298023224:\\ \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\ \end{array} \end{array} \]
    (FPCore (cosTheta_i u1 u2)
     :precision binary32
     (let* ((t_0 (cos (* (* 2.0 PI) u2))))
       (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.20000000298023224)
         (*
          t_0
          (sqrt (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1)))
         (* (sqrt (- (log1p (- u1)))) (fma -2.0 (* PI (* PI (* u2 u2))) 1.0)))))
    float code(float cosTheta_i, float u1, float u2) {
    	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
    	float tmp;
    	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.20000000298023224f) {
    		tmp = t_0 * sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1));
    	} else {
    		tmp = sqrtf(-log1pf(-u1)) * fmaf(-2.0f, (((float) M_PI) * (((float) M_PI) * (u2 * u2))), 1.0f);
    	}
    	return tmp;
    }
    
    function code(cosTheta_i, u1, u2)
    	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
    	tmp = Float32(0.0)
    	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.20000000298023224))
    		tmp = Float32(t_0 * sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)));
    	else
    		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(-2.0), Float32(Float32(pi) * Float32(Float32(pi) * Float32(u2 * u2))), Float32(1.0)));
    	end
    	return tmp
    end
    
    \begin{array}{l}
    
    \\
    \begin{array}{l}
    t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
    \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.20000000298023224:\\
    \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\
    
    \mathbf{else}:\\
    \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\
    
    
    \end{array}
    \end{array}
    
    Derivation
    1. Split input into 2 regimes
    2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.200000003

      1. Initial program 55.2%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Taylor expanded in u1 around 0

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. distribute-lft-inN/A

          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. associate-*r*N/A

          \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. unpow2N/A

          \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. *-rgt-identityN/A

          \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        6. lower-fma.f32N/A

          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        7. unpow2N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        8. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        9. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        10. lower-fma.f32N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        11. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        12. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        13. lower-fma.f3298.4

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Applied rewrites98.4%

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

      if 0.200000003 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

      1. Initial program 98.1%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Step-by-step derivation
        1. lift-log.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. lift--.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. sub-negN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. lower-log1p.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. lower-neg.f3299.1

          \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      4. Applied rewrites99.1%

        \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Taylor expanded in u2 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
      6. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
        2. lower-fma.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, {u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}, 1\right)} \]
        3. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}}, 1\right) \]
        4. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot {u2}^{2}, 1\right) \]
        5. associate-*l*N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
        6. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
        7. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right)} \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right), 1\right) \]
        8. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
        9. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot {u2}^{2}\right), 1\right) \]
        10. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
        11. lower-*.f3296.2

          \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
      7. Applied rewrites96.2%

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)} \]
    3. Recombined 2 regimes into one program.
    4. Final simplification98.0%

      \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.20000000298023224:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\ \end{array} \]
    5. Add Preprocessing

    Alternative 4: 97.6% accurate, 0.6× speedup?

    \[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.11999999731779099:\\ \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \end{array} \end{array} \]
    (FPCore (cosTheta_i u1 u2)
     :precision binary32
     (let* ((t_0 (cos (* (* 2.0 PI) u2))))
       (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.11999999731779099)
         (* t_0 (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)))
         (* (sqrt (- (log1p (- u1)))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0)))))
    float code(float cosTheta_i, float u1, float u2) {
    	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
    	float tmp;
    	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.11999999731779099f) {
    		tmp = t_0 * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
    	} else {
    		tmp = sqrtf(-log1pf(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
    	}
    	return tmp;
    }
    
    function code(cosTheta_i, u1, u2)
    	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
    	tmp = Float32(0.0)
    	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.11999999731779099))
    		tmp = Float32(t_0 * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)));
    	else
    		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
    	end
    	return tmp
    end
    
    \begin{array}{l}
    
    \\
    \begin{array}{l}
    t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
    \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.11999999731779099:\\
    \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\
    
    \mathbf{else}:\\
    \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\
    
    
    \end{array}
    \end{array}
    
    Derivation
    1. Split input into 2 regimes
    2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.119999997

      1. Initial program 52.5%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Taylor expanded in u1 around 0

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. distribute-lft-inN/A

          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. associate-*r*N/A

          \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. unpow2N/A

          \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. *-rgt-identityN/A

          \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        6. lower-fma.f32N/A

          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        7. unpow2N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        8. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        9. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        10. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        11. lower-fma.f3297.9

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Applied rewrites97.9%

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

      if 0.119999997 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

      1. Initial program 97.1%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Step-by-step derivation
        1. lift-log.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. lift--.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. sub-negN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. lower-log1p.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. lower-neg.f3299.2

          \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      4. Applied rewrites99.2%

        \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Step-by-step derivation
        1. lift-cos.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
        2. lift-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
        3. lift-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\color{blue}{\left(2 \cdot \mathsf{PI}\left(\right)\right)} \cdot u2\right) \]
        4. lift-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
        5. lift-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
        6. associate-*l*N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
        7. cos-2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
        8. lower--.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
      6. Applied rewrites99.1%

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right)} \]
      7. Taylor expanded in u2 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
      8. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
        2. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
        3. associate-*r*N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
        4. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
        5. associate-*l*N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
        6. lower-fma.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
        7. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
        8. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
        9. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
        10. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
        11. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
        12. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
        13. lower-*.f3296.1

          \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
      9. Applied rewrites96.1%

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]
    3. Recombined 2 regimes into one program.
    4. Final simplification97.5%

      \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.11999999731779099:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \end{array} \]
    5. Add Preprocessing

    Alternative 5: 97.5% accurate, 0.6× speedup?

    \[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.11999999731779099:\\ \;\;\;\;t\_0 \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), 1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \end{array} \end{array} \]
    (FPCore (cosTheta_i u1 u2)
     :precision binary32
     (let* ((t_0 (cos (* (* 2.0 PI) u2))))
       (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.11999999731779099)
         (* t_0 (sqrt (* u1 (fma u1 (fma u1 0.3333333333333333 0.5) 1.0))))
         (* (sqrt (- (log1p (- u1)))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0)))))
    float code(float cosTheta_i, float u1, float u2) {
    	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
    	float tmp;
    	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.11999999731779099f) {
    		tmp = t_0 * sqrtf((u1 * fmaf(u1, fmaf(u1, 0.3333333333333333f, 0.5f), 1.0f)));
    	} else {
    		tmp = sqrtf(-log1pf(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
    	}
    	return tmp;
    }
    
    function code(cosTheta_i, u1, u2)
    	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
    	tmp = Float32(0.0)
    	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.11999999731779099))
    		tmp = Float32(t_0 * sqrt(Float32(u1 * fma(u1, fma(u1, Float32(0.3333333333333333), Float32(0.5)), Float32(1.0)))));
    	else
    		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
    	end
    	return tmp
    end
    
    \begin{array}{l}
    
    \\
    \begin{array}{l}
    t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
    \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.11999999731779099:\\
    \;\;\;\;t\_0 \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), 1\right)}\\
    
    \mathbf{else}:\\
    \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\
    
    
    \end{array}
    \end{array}
    
    Derivation
    1. Split input into 2 regimes
    2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.119999997

      1. Initial program 52.5%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Taylor expanded in u1 around 0

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. distribute-lft-inN/A

          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. associate-*r*N/A

          \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. unpow2N/A

          \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. *-rgt-identityN/A

          \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        6. lower-fma.f32N/A

          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        7. unpow2N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        8. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        9. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        10. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        11. lower-fma.f3297.9

          \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Applied rewrites97.9%

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      6. Step-by-step derivation
        1. Applied rewrites97.5%

          \[\leadsto \sqrt{\frac{1}{\color{blue}{\frac{\mathsf{fma}\left(u1, u1, u1 \cdot \left(\left(u1 \cdot \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)\right) \cdot \left(u1 \cdot \left(u1 \cdot \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)\right)\right)\right) - \left(u1 \cdot u1\right) \cdot \left(u1 \cdot \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)\right)\right)}{\mathsf{fma}\left(u1 \cdot \left(u1 \cdot \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)\right), u1 \cdot \left(\left(u1 \cdot \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)\right) \cdot \left(u1 \cdot \left(u1 \cdot \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)\right)\right)\right), u1 \cdot \left(u1 \cdot u1\right)\right)}}}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
        2. Step-by-step derivation
          1. Applied rewrites97.7%

            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), 1\right) \cdot \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

          if 0.119999997 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

          1. Initial program 97.1%

            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Add Preprocessing
          3. Step-by-step derivation
            1. lift-log.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            2. lift--.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            3. sub-negN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. lower-log1p.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. lower-neg.f3299.2

              \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          4. Applied rewrites99.2%

            \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          5. Step-by-step derivation
            1. lift-cos.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
            2. lift-*.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
            3. lift-*.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\color{blue}{\left(2 \cdot \mathsf{PI}\left(\right)\right)} \cdot u2\right) \]
            4. lift-PI.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
            5. lift-PI.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
            6. associate-*l*N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
            7. cos-2N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
            8. lower--.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
          6. Applied rewrites99.1%

            \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right)} \]
          7. Taylor expanded in u2 around 0

            \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
          8. Step-by-step derivation
            1. +-commutativeN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
            2. *-commutativeN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
            3. associate-*r*N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
            4. *-commutativeN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
            5. associate-*l*N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
            6. lower-fma.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
            7. unpow2N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
            8. lower-*.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
            9. lower-PI.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
            10. lower-PI.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
            11. lower-*.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
            12. unpow2N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
            13. lower-*.f3296.1

              \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
          9. Applied rewrites96.1%

            \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]
        3. Recombined 2 regimes into one program.
        4. Final simplification97.3%

          \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.11999999731779099:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), 1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \end{array} \]
        5. Add Preprocessing

        Alternative 6: 94.5% accurate, 0.6× speedup?

        \[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.10499999672174454:\\ \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \end{array} \end{array} \]
        (FPCore (cosTheta_i u1 u2)
         :precision binary32
         (let* ((t_0 (cos (* (* 2.0 PI) u2))))
           (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.10499999672174454)
             (* t_0 (sqrt (fma u1 (* u1 0.5) u1)))
             (* (sqrt (- (log1p (- u1)))) 1.0))))
        float code(float cosTheta_i, float u1, float u2) {
        	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
        	float tmp;
        	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.10499999672174454f) {
        		tmp = t_0 * sqrtf(fmaf(u1, (u1 * 0.5f), u1));
        	} else {
        		tmp = sqrtf(-log1pf(-u1)) * 1.0f;
        	}
        	return tmp;
        }
        
        function code(cosTheta_i, u1, u2)
        	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
        	tmp = Float32(0.0)
        	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.10499999672174454))
        		tmp = Float32(t_0 * sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1)));
        	else
        		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(1.0));
        	end
        	return tmp
        end
        
        \begin{array}{l}
        
        \\
        \begin{array}{l}
        t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
        \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.10499999672174454:\\
        \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\
        
        \mathbf{else}:\\
        \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\
        
        
        \end{array}
        \end{array}
        
        Derivation
        1. Split input into 2 regimes
        2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.104999997

          1. Initial program 51.2%

            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Add Preprocessing
          3. Taylor expanded in u1 around 0

            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          4. Step-by-step derivation
            1. +-commutativeN/A

              \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            2. distribute-lft-inN/A

              \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            3. *-rgt-identityN/A

              \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. lower-fma.f32N/A

              \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. *-commutativeN/A

              \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            6. lower-*.f3296.2

              \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          5. Applied rewrites96.2%

            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

          if 0.104999997 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

          1. Initial program 96.7%

            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Add Preprocessing
          3. Step-by-step derivation
            1. lift-log.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            2. lift--.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            3. sub-negN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. lower-log1p.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. lower-neg.f3299.2

              \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          4. Applied rewrites99.2%

            \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          5. Taylor expanded in u2 around 0

            \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
          6. Step-by-step derivation
            1. Applied rewrites85.1%

              \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]
          7. Recombined 2 regimes into one program.
          8. Final simplification93.4%

            \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.10499999672174454:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \end{array} \]
          9. Add Preprocessing

          Alternative 7: 87.3% accurate, 0.6× speedup?

          \[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.10999999940395355:\\ \;\;\;\;\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \end{array} \end{array} \]
          (FPCore (cosTheta_i u1 u2)
           :precision binary32
           (if (<=
                (* (cos (* (* 2.0 PI) u2)) (sqrt (- (log (- 1.0 u1)))))
                0.10999999940395355)
             (*
              (fma -2.0 (* PI (* PI (* u2 u2))) 1.0)
              (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)))
             (* (sqrt (- (log1p (- u1)))) 1.0)))
          float code(float cosTheta_i, float u1, float u2) {
          	float tmp;
          	if ((cosf(((2.0f * ((float) M_PI)) * u2)) * sqrtf(-logf((1.0f - u1)))) <= 0.10999999940395355f) {
          		tmp = fmaf(-2.0f, (((float) M_PI) * (((float) M_PI) * (u2 * u2))), 1.0f) * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
          	} else {
          		tmp = sqrtf(-log1pf(-u1)) * 1.0f;
          	}
          	return tmp;
          }
          
          function code(cosTheta_i, u1, u2)
          	tmp = Float32(0.0)
          	if (Float32(cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)) * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.10999999940395355))
          		tmp = Float32(fma(Float32(-2.0), Float32(Float32(pi) * Float32(Float32(pi) * Float32(u2 * u2))), Float32(1.0)) * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)));
          	else
          		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(1.0));
          	end
          	return tmp
          end
          
          \begin{array}{l}
          
          \\
          \begin{array}{l}
          \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.10999999940395355:\\
          \;\;\;\;\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\
          
          \mathbf{else}:\\
          \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\
          
          
          \end{array}
          \end{array}
          
          Derivation
          1. Split input into 2 regimes
          2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.109999999

            1. Initial program 51.8%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Taylor expanded in u1 around 0

              \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. Step-by-step derivation
              1. +-commutativeN/A

                \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              2. distribute-lft-inN/A

                \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              3. associate-*r*N/A

                \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              4. unpow2N/A

                \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              5. *-rgt-identityN/A

                \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              6. lower-fma.f32N/A

                \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              7. unpow2N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              8. lower-*.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              9. +-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              10. *-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              11. lower-fma.f3298.0

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            5. Applied rewrites98.0%

              \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            6. Taylor expanded in u2 around 0

              \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
            7. Step-by-step derivation
              1. +-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
              2. lower-fma.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, {u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}, 1\right)} \]
              3. *-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}}, 1\right) \]
              4. unpow2N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot {u2}^{2}, 1\right) \]
              5. associate-*l*N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
              6. lower-*.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
              7. lower-PI.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right)} \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right), 1\right) \]
              8. lower-*.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
              9. lower-PI.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot {u2}^{2}\right), 1\right) \]
              10. unpow2N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
              11. lower-*.f3284.7

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
            8. Applied rewrites84.7%

              \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)} \]

            if 0.109999999 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

            1. Initial program 97.0%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Step-by-step derivation
              1. lift-log.f32N/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              2. lift--.f32N/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              3. sub-negN/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              4. lower-log1p.f32N/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              5. lower-neg.f3299.2

                \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            4. Applied rewrites99.2%

              \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            5. Taylor expanded in u2 around 0

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
            6. Step-by-step derivation
              1. Applied rewrites84.8%

                \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]
            7. Recombined 2 regimes into one program.
            8. Final simplification84.7%

              \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.10999999940395355:\\ \;\;\;\;\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \end{array} \]
            9. Add Preprocessing

            Alternative 8: 80.8% accurate, 0.8× speedup?

            \[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.0006905319751240313:\\ \;\;\;\;\sqrt{-\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1\\ \end{array} \end{array} \]
            (FPCore (cosTheta_i u1 u2)
             :precision binary32
             (if (<=
                  (* (cos (* (* 2.0 PI) u2)) (sqrt (- (log (- 1.0 u1)))))
                  0.0006905319751240313)
               (* (sqrt (- (- u1))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0))
               (*
                (sqrt (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1))
                1.0)))
            float code(float cosTheta_i, float u1, float u2) {
            	float tmp;
            	if ((cosf(((2.0f * ((float) M_PI)) * u2)) * sqrtf(-logf((1.0f - u1)))) <= 0.0006905319751240313f) {
            		tmp = sqrtf(-(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
            	} else {
            		tmp = sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1)) * 1.0f;
            	}
            	return tmp;
            }
            
            function code(cosTheta_i, u1, u2)
            	tmp = Float32(0.0)
            	if (Float32(cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)) * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.0006905319751240313))
            		tmp = Float32(sqrt(Float32(-Float32(-u1))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
            	else
            		tmp = Float32(sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)) * Float32(1.0));
            	end
            	return tmp
            end
            
            \begin{array}{l}
            
            \\
            \begin{array}{l}
            \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.0006905319751240313:\\
            \;\;\;\;\sqrt{-\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\
            
            \mathbf{else}:\\
            \;\;\;\;\sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1\\
            
            
            \end{array}
            \end{array}
            
            Derivation
            1. Split input into 2 regimes
            2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 6.90531975e-4

              1. Initial program 29.0%

                \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              2. Add Preprocessing
              3. Taylor expanded in u2 around 0

                \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
              4. Step-by-step derivation
                1. Applied rewrites17.3%

                  \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                2. Taylor expanded in u1 around 0

                  \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{-1 \cdot u1}\right)} \cdot 1 \]
                3. Step-by-step derivation
                  1. mul-1-negN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot 1 \]
                  2. lower-neg.f3268.2

                    \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                4. Applied rewrites68.2%

                  \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                5. Taylor expanded in u2 around 0

                  \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
                6. Step-by-step derivation
                  1. +-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                  2. *-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
                  3. associate-*r*N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
                  4. *-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
                  5. associate-*l*N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
                  6. lower-fma.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
                  7. unpow2N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
                  8. lower-*.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
                  9. lower-PI.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
                  10. lower-PI.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
                  11. lower-*.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
                  12. unpow2N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
                  13. lower-*.f3279.4

                    \[\leadsto \sqrt{-\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
                7. Applied rewrites79.4%

                  \[\leadsto \sqrt{-\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]

                if 6.90531975e-4 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

                1. Initial program 78.4%

                  \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                2. Add Preprocessing
                3. Taylor expanded in u2 around 0

                  \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                4. Step-by-step derivation
                  1. Applied rewrites68.3%

                    \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                  2. Taylor expanded in u1 around 0

                    \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot 1 \]
                  3. Step-by-step derivation
                    1. +-commutativeN/A

                      \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot 1 \]
                    2. distribute-lft-inN/A

                      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot 1 \]
                    3. associate-*r*N/A

                      \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot 1 \]
                    4. unpow2N/A

                      \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot 1 \]
                    5. *-rgt-identityN/A

                      \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot 1 \]
                    6. lower-fma.f32N/A

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot 1 \]
                    7. unpow2N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
                    8. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
                    9. +-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot 1 \]
                    10. lower-fma.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot 1 \]
                    11. +-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
                    12. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
                    13. lower-fma.f3277.1

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot 1 \]
                  4. Applied rewrites77.1%

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot 1 \]
                5. Recombined 2 regimes into one program.
                6. Final simplification77.8%

                  \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.0006905319751240313:\\ \;\;\;\;\sqrt{-\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1\\ \end{array} \]
                7. Add Preprocessing

                Alternative 9: 96.9% accurate, 1.5× speedup?

                \[\begin{array}{l} \\ \begin{array}{l} t_0 := \left(2 \cdot \pi\right) \cdot u2\\ \mathbf{if}\;t\_0 \leq 0.054999999701976776:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, -0.5, -1\right)}\\ \end{array} \end{array} \]
                (FPCore (cosTheta_i u1 u2)
                 :precision binary32
                 (let* ((t_0 (* (* 2.0 PI) u2)))
                   (if (<= t_0 0.054999999701976776)
                     (* (sqrt (- (log1p (- u1)))) (fma -2.0 (* PI (* PI (* u2 u2))) 1.0))
                     (* (cos t_0) (sqrt (- (* u1 (fma u1 -0.5 -1.0))))))))
                float code(float cosTheta_i, float u1, float u2) {
                	float t_0 = (2.0f * ((float) M_PI)) * u2;
                	float tmp;
                	if (t_0 <= 0.054999999701976776f) {
                		tmp = sqrtf(-log1pf(-u1)) * fmaf(-2.0f, (((float) M_PI) * (((float) M_PI) * (u2 * u2))), 1.0f);
                	} else {
                		tmp = cosf(t_0) * sqrtf(-(u1 * fmaf(u1, -0.5f, -1.0f)));
                	}
                	return tmp;
                }
                
                function code(cosTheta_i, u1, u2)
                	t_0 = Float32(Float32(Float32(2.0) * Float32(pi)) * u2)
                	tmp = Float32(0.0)
                	if (t_0 <= Float32(0.054999999701976776))
                		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(-2.0), Float32(Float32(pi) * Float32(Float32(pi) * Float32(u2 * u2))), Float32(1.0)));
                	else
                		tmp = Float32(cos(t_0) * sqrt(Float32(-Float32(u1 * fma(u1, Float32(-0.5), Float32(-1.0))))));
                	end
                	return tmp
                end
                
                \begin{array}{l}
                
                \\
                \begin{array}{l}
                t_0 := \left(2 \cdot \pi\right) \cdot u2\\
                \mathbf{if}\;t\_0 \leq 0.054999999701976776:\\
                \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\
                
                \mathbf{else}:\\
                \;\;\;\;\cos t\_0 \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, -0.5, -1\right)}\\
                
                
                \end{array}
                \end{array}
                
                Derivation
                1. Split input into 2 regimes
                2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.0549999997

                  1. Initial program 62.5%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Step-by-step derivation
                    1. lift-log.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. lift--.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. sub-negN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. lower-log1p.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    5. lower-neg.f3299.4

                      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  4. Applied rewrites99.4%

                    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  5. Taylor expanded in u2 around 0

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
                  6. Step-by-step derivation
                    1. +-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                    2. lower-fma.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, {u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}, 1\right)} \]
                    3. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}}, 1\right) \]
                    4. unpow2N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot {u2}^{2}, 1\right) \]
                    5. associate-*l*N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
                    6. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
                    7. lower-PI.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right)} \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right), 1\right) \]
                    8. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
                    9. lower-PI.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot {u2}^{2}\right), 1\right) \]
                    10. unpow2N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
                    11. lower-*.f3299.3

                      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
                  7. Applied rewrites99.3%

                    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)} \]

                  if 0.0549999997 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

                  1. Initial program 63.0%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Taylor expanded in u1 around 0

                    \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(\frac{-1}{2} \cdot u1 - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  4. Step-by-step derivation
                    1. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(\frac{-1}{2} \cdot u1 - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. sub-negN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(\frac{-1}{2} \cdot u1 + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(\color{blue}{u1 \cdot \frac{-1}{2}} + \left(\mathsf{neg}\left(1\right)\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. metadata-evalN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \frac{-1}{2} + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    5. lower-fma.f3286.8

                      \[\leadsto \sqrt{-u1 \cdot \color{blue}{\mathsf{fma}\left(u1, -0.5, -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  5. Applied rewrites86.8%

                    \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, -0.5, -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                3. Recombined 2 regimes into one program.
                4. Final simplification96.4%

                  \[\leadsto \begin{array}{l} \mathbf{if}\;\left(2 \cdot \pi\right) \cdot u2 \leq 0.054999999701976776:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, -0.5, -1\right)}\\ \end{array} \]
                5. Add Preprocessing

                Alternative 10: 90.4% accurate, 1.6× speedup?

                \[\begin{array}{l} \\ \begin{array}{l} t_0 := \left(2 \cdot \pi\right) \cdot u2\\ \mathbf{if}\;t\_0 \leq 0.03200000151991844:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{u1}\\ \end{array} \end{array} \]
                (FPCore (cosTheta_i u1 u2)
                 :precision binary32
                 (let* ((t_0 (* (* 2.0 PI) u2)))
                   (if (<= t_0 0.03200000151991844)
                     (* (sqrt (- (log1p (- u1)))) 1.0)
                     (* (cos t_0) (sqrt u1)))))
                float code(float cosTheta_i, float u1, float u2) {
                	float t_0 = (2.0f * ((float) M_PI)) * u2;
                	float tmp;
                	if (t_0 <= 0.03200000151991844f) {
                		tmp = sqrtf(-log1pf(-u1)) * 1.0f;
                	} else {
                		tmp = cosf(t_0) * sqrtf(u1);
                	}
                	return tmp;
                }
                
                function code(cosTheta_i, u1, u2)
                	t_0 = Float32(Float32(Float32(2.0) * Float32(pi)) * u2)
                	tmp = Float32(0.0)
                	if (t_0 <= Float32(0.03200000151991844))
                		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(1.0));
                	else
                		tmp = Float32(cos(t_0) * sqrt(u1));
                	end
                	return tmp
                end
                
                \begin{array}{l}
                
                \\
                \begin{array}{l}
                t_0 := \left(2 \cdot \pi\right) \cdot u2\\
                \mathbf{if}\;t\_0 \leq 0.03200000151991844:\\
                \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\
                
                \mathbf{else}:\\
                \;\;\;\;\cos t\_0 \cdot \sqrt{u1}\\
                
                
                \end{array}
                \end{array}
                
                Derivation
                1. Split input into 2 regimes
                2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.0320000015

                  1. Initial program 63.0%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Step-by-step derivation
                    1. lift-log.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. lift--.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. sub-negN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. lower-log1p.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    5. lower-neg.f3299.4

                      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  4. Applied rewrites99.4%

                    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  5. Taylor expanded in u2 around 0

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
                  6. Step-by-step derivation
                    1. Applied rewrites93.2%

                      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

                    if 0.0320000015 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

                    1. Initial program 61.4%

                      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    2. Add Preprocessing
                    3. Applied rewrites61.2%

                      \[\leadsto \sqrt{-\color{blue}{\left(\log \left(\left(1 - u1 \cdot \left(u1 \cdot \left(u1 \cdot u1\right)\right)\right) \cdot \frac{1}{1 + u1}\right) - \mathsf{log1p}\left(u1 \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    4. Taylor expanded in u1 around 0

                      \[\leadsto \color{blue}{\sqrt{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    5. Step-by-step derivation
                      1. lower-sqrt.f3273.6

                        \[\leadsto \color{blue}{\sqrt{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    6. Applied rewrites73.6%

                      \[\leadsto \color{blue}{\sqrt{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  7. Recombined 2 regimes into one program.
                  8. Final simplification88.4%

                    \[\leadsto \begin{array}{l} \mathbf{if}\;\left(2 \cdot \pi\right) \cdot u2 \leq 0.03200000151991844:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{u1}\\ \end{array} \]
                  9. Add Preprocessing

                  Alternative 11: 83.2% accurate, 4.3× speedup?

                  \[\begin{array}{l} \\ \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \end{array} \]
                  (FPCore (cosTheta_i u1 u2)
                   :precision binary32
                   (*
                    (fma -2.0 (* PI (* PI (* u2 u2))) 1.0)
                    (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1))))
                  float code(float cosTheta_i, float u1, float u2) {
                  	return fmaf(-2.0f, (((float) M_PI) * (((float) M_PI) * (u2 * u2))), 1.0f) * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
                  }
                  
                  function code(cosTheta_i, u1, u2)
                  	return Float32(fma(Float32(-2.0), Float32(Float32(pi) * Float32(Float32(pi) * Float32(u2 * u2))), Float32(1.0)) * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)))
                  end
                  
                  \begin{array}{l}
                  
                  \\
                  \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}
                  \end{array}
                  
                  Derivation
                  1. Initial program 62.6%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Taylor expanded in u1 around 0

                    \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  4. Step-by-step derivation
                    1. +-commutativeN/A

                      \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. distribute-lft-inN/A

                      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. associate-*r*N/A

                      \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. unpow2N/A

                      \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    5. *-rgt-identityN/A

                      \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    6. lower-fma.f32N/A

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    7. unpow2N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    8. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    9. +-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    10. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    11. lower-fma.f3290.0

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  5. Applied rewrites90.0%

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  6. Taylor expanded in u2 around 0

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
                  7. Step-by-step derivation
                    1. +-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                    2. lower-fma.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, {u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}, 1\right)} \]
                    3. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}}, 1\right) \]
                    4. unpow2N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot {u2}^{2}, 1\right) \]
                    5. associate-*l*N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
                    6. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
                    7. lower-PI.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \color{blue}{\mathsf{PI}\left(\right)} \cdot \left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right), 1\right) \]
                    8. lower-*.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot {u2}^{2}\right)}, 1\right) \]
                    9. lower-PI.f32N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot {u2}^{2}\right), 1\right) \]
                    10. unpow2N/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(-2, \mathsf{PI}\left(\right) \cdot \left(\mathsf{PI}\left(\right) \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
                    11. lower-*.f3279.5

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), 1\right) \]
                  8. Applied rewrites79.5%

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right)} \]
                  9. Final simplification79.5%

                    \[\leadsto \mathsf{fma}\left(-2, \pi \cdot \left(\pi \cdot \left(u2 \cdot u2\right)\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \]
                  10. Add Preprocessing

                  Alternative 12: 77.2% accurate, 5.9× speedup?

                  \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1 \end{array} \]
                  (FPCore (cosTheta_i u1 u2)
                   :precision binary32
                   (*
                    (sqrt (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1))
                    1.0))
                  float code(float cosTheta_i, float u1, float u2) {
                  	return sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1)) * 1.0f;
                  }
                  
                  function code(cosTheta_i, u1, u2)
                  	return Float32(sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)) * Float32(1.0))
                  end
                  
                  \begin{array}{l}
                  
                  \\
                  \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1
                  \end{array}
                  
                  Derivation
                  1. Initial program 62.6%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Taylor expanded in u2 around 0

                    \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                  4. Step-by-step derivation
                    1. Applied rewrites52.0%

                      \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                    2. Taylor expanded in u1 around 0

                      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot 1 \]
                    3. Step-by-step derivation
                      1. +-commutativeN/A

                        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot 1 \]
                      2. distribute-lft-inN/A

                        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot 1 \]
                      3. associate-*r*N/A

                        \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot 1 \]
                      4. unpow2N/A

                        \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot 1 \]
                      5. *-rgt-identityN/A

                        \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot 1 \]
                      6. lower-fma.f32N/A

                        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot 1 \]
                      7. unpow2N/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
                      8. lower-*.f32N/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
                      9. +-commutativeN/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot 1 \]
                      10. lower-fma.f32N/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot 1 \]
                      11. +-commutativeN/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
                      12. *-commutativeN/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
                      13. lower-fma.f3274.3

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot 1 \]
                    4. Applied rewrites74.3%

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot 1 \]
                    5. Add Preprocessing

                    Alternative 13: 76.0% accurate, 7.0× speedup?

                    \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot 1 \end{array} \]
                    (FPCore (cosTheta_i u1 u2)
                     :precision binary32
                     (* (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)) 1.0))
                    float code(float cosTheta_i, float u1, float u2) {
                    	return sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1)) * 1.0f;
                    }
                    
                    function code(cosTheta_i, u1, u2)
                    	return Float32(sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)) * Float32(1.0))
                    end
                    
                    \begin{array}{l}
                    
                    \\
                    \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot 1
                    \end{array}
                    
                    Derivation
                    1. Initial program 62.6%

                      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    2. Add Preprocessing
                    3. Taylor expanded in u1 around 0

                      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. Step-by-step derivation
                      1. +-commutativeN/A

                        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      2. distribute-lft-inN/A

                        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      3. associate-*r*N/A

                        \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      4. unpow2N/A

                        \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      5. *-rgt-identityN/A

                        \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      6. lower-fma.f32N/A

                        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      7. unpow2N/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      8. lower-*.f32N/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      9. +-commutativeN/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      10. *-commutativeN/A

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      11. lower-fma.f3290.0

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    5. Applied rewrites90.0%

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    6. Taylor expanded in u2 around 0

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{1} \]
                    7. Step-by-step derivation
                      1. Applied rewrites72.8%

                        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \color{blue}{1} \]
                      2. Add Preprocessing

                      Alternative 14: 73.4% accurate, 8.6× speedup?

                      \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)} \cdot 1 \end{array} \]
                      (FPCore (cosTheta_i u1 u2)
                       :precision binary32
                       (* (sqrt (fma u1 (* u1 0.5) u1)) 1.0))
                      float code(float cosTheta_i, float u1, float u2) {
                      	return sqrtf(fmaf(u1, (u1 * 0.5f), u1)) * 1.0f;
                      }
                      
                      function code(cosTheta_i, u1, u2)
                      	return Float32(sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1)) * Float32(1.0))
                      end
                      
                      \begin{array}{l}
                      
                      \\
                      \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)} \cdot 1
                      \end{array}
                      
                      Derivation
                      1. Initial program 62.6%

                        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      2. Add Preprocessing
                      3. Taylor expanded in u2 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                      4. Step-by-step derivation
                        1. Applied rewrites52.0%

                          \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                        2. Taylor expanded in u1 around 0

                          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot 1 \]
                        3. Step-by-step derivation
                          1. +-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot 1 \]
                          2. distribute-lft-inN/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot 1 \]
                          3. *-rgt-identityN/A

                            \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot 1 \]
                          4. lower-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot 1 \]
                          5. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot 1 \]
                          6. lower-*.f3269.9

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot 1 \]
                        4. Applied rewrites69.9%

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot 1 \]
                        5. Add Preprocessing

                        Alternative 15: 65.4% accurate, 11.6× speedup?

                        \[\begin{array}{l} \\ 1 \cdot \sqrt{-\left(-u1\right)} \end{array} \]
                        (FPCore (cosTheta_i u1 u2) :precision binary32 (* 1.0 (sqrt (- (- u1)))))
                        float code(float cosTheta_i, float u1, float u2) {
                        	return 1.0f * sqrtf(-(-u1));
                        }
                        
                        real(4) function code(costheta_i, u1, u2)
                            real(4), intent (in) :: costheta_i
                            real(4), intent (in) :: u1
                            real(4), intent (in) :: u2
                            code = 1.0e0 * sqrt(-(-u1))
                        end function
                        
                        function code(cosTheta_i, u1, u2)
                        	return Float32(Float32(1.0) * sqrt(Float32(-Float32(-u1))))
                        end
                        
                        function tmp = code(cosTheta_i, u1, u2)
                        	tmp = single(1.0) * sqrt(-(-u1));
                        end
                        
                        \begin{array}{l}
                        
                        \\
                        1 \cdot \sqrt{-\left(-u1\right)}
                        \end{array}
                        
                        Derivation
                        1. Initial program 62.6%

                          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        2. Add Preprocessing
                        3. Taylor expanded in u2 around 0

                          \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                        4. Step-by-step derivation
                          1. Applied rewrites52.0%

                            \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                          2. Taylor expanded in u1 around 0

                            \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{-1 \cdot u1}\right)} \cdot 1 \]
                          3. Step-by-step derivation
                            1. mul-1-negN/A

                              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot 1 \]
                            2. lower-neg.f3261.9

                              \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                          4. Applied rewrites61.9%

                            \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                          5. Final simplification61.9%

                            \[\leadsto 1 \cdot \sqrt{-\left(-u1\right)} \]
                          6. Add Preprocessing

                          Alternative 16: 4.9% accurate, 12.8× speedup?

                          \[\begin{array}{l} \\ 1 \cdot \left(-\sqrt{u1}\right) \end{array} \]
                          (FPCore (cosTheta_i u1 u2) :precision binary32 (* 1.0 (- (sqrt u1))))
                          float code(float cosTheta_i, float u1, float u2) {
                          	return 1.0f * -sqrtf(u1);
                          }
                          
                          real(4) function code(costheta_i, u1, u2)
                              real(4), intent (in) :: costheta_i
                              real(4), intent (in) :: u1
                              real(4), intent (in) :: u2
                              code = 1.0e0 * -sqrt(u1)
                          end function
                          
                          function code(cosTheta_i, u1, u2)
                          	return Float32(Float32(1.0) * Float32(-sqrt(u1)))
                          end
                          
                          function tmp = code(cosTheta_i, u1, u2)
                          	tmp = single(1.0) * -sqrt(u1);
                          end
                          
                          \begin{array}{l}
                          
                          \\
                          1 \cdot \left(-\sqrt{u1}\right)
                          \end{array}
                          
                          Derivation
                          1. Initial program 62.6%

                            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                          2. Add Preprocessing
                          3. Taylor expanded in u2 around 0

                            \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                          4. Step-by-step derivation
                            1. Applied rewrites52.0%

                              \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                            2. Taylor expanded in u1 around 0

                              \[\leadsto \color{blue}{\left(\sqrt{u1} \cdot {\left(\sqrt{-1}\right)}^{2}\right)} \cdot 1 \]
                            3. Step-by-step derivation
                              1. *-commutativeN/A

                                \[\leadsto \color{blue}{\left({\left(\sqrt{-1}\right)}^{2} \cdot \sqrt{u1}\right)} \cdot 1 \]
                              2. unpow2N/A

                                \[\leadsto \left(\color{blue}{\left(\sqrt{-1} \cdot \sqrt{-1}\right)} \cdot \sqrt{u1}\right) \cdot 1 \]
                              3. rem-square-sqrtN/A

                                \[\leadsto \left(\color{blue}{-1} \cdot \sqrt{u1}\right) \cdot 1 \]
                              4. mul-1-negN/A

                                \[\leadsto \color{blue}{\left(\mathsf{neg}\left(\sqrt{u1}\right)\right)} \cdot 1 \]
                              5. lower-neg.f32N/A

                                \[\leadsto \color{blue}{\left(\mathsf{neg}\left(\sqrt{u1}\right)\right)} \cdot 1 \]
                              6. lower-sqrt.f324.9

                                \[\leadsto \left(-\color{blue}{\sqrt{u1}}\right) \cdot 1 \]
                            4. Applied rewrites4.9%

                              \[\leadsto \color{blue}{\left(-\sqrt{u1}\right)} \cdot 1 \]
                            5. Final simplification4.9%

                              \[\leadsto 1 \cdot \left(-\sqrt{u1}\right) \]
                            6. Add Preprocessing

                            Reproduce

                            ?
                            herbie shell --seed 2024226 
                            (FPCore (cosTheta_i u1 u2)
                              :name "Beckmann Sample, near normal, slope_x"
                              :precision binary32
                              :pre (and (and (and (> cosTheta_i 0.9999) (<= cosTheta_i 1.0)) (and (<= 2.328306437e-10 u1) (<= u1 1.0))) (and (<= 2.328306437e-10 u2) (<= u2 1.0)))
                              (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))