Beckmann Sample, near normal, slope_x

Percentage Accurate: 57.8% → 99.0%
Time: 13.0s
Alternatives: 17
Speedup: 21.0×

Specification

?
\[\left(\left(cosTheta\_i > 0.9999 \land cosTheta\_i \leq 1\right) \land \left(2.328306437 \cdot 10^{-10} \leq u1 \land u1 \leq 1\right)\right) \land \left(2.328306437 \cdot 10^{-10} \leq u2 \land u2 \leq 1\right)\]
\[\begin{array}{l} \\ \sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-logf((1.0f - u1))) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log(Float32(Float32(1.0) - u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
function tmp = code(cosTheta_i, u1, u2)
	tmp = sqrt(-log((single(1.0) - u1))) * cos(((single(2.0) * single(pi)) * u2));
end
\begin{array}{l}

\\
\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}

Sampling outcomes in binary32 precision:

Local Percentage Accuracy vs ?

The average percentage accuracy by input value. Horizontal axis shows value of an input variable; the variable is choosen in the title. Vertical axis is accuracy; higher is better. Red represent the original program, while blue represents Herbie's suggestion. These can be toggled with buttons below the plot. The line is an average while dots represent individual samples.

Accuracy vs Speed?

Herbie found 17 alternatives:

AlternativeAccuracySpeedup
The accuracy (vertical axis) and speed (horizontal axis) of each alternatives. Up and to the right is better. The red square shows the initial program, and each blue circle shows an alternative.The line shows the best available speed-accuracy tradeoffs.

Initial Program: 57.8% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-logf((1.0f - u1))) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log(Float32(Float32(1.0) - u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
function tmp = code(cosTheta_i, u1, u2)
	tmp = sqrt(-log((single(1.0) - u1))) * cos(((single(2.0) * single(pi)) * u2));
end
\begin{array}{l}

\\
\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}

Alternative 1: 99.0% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (cos (* (* 2.0 PI) u2)) (sqrt (- (log1p (- u1))))))
float code(float cosTheta_i, float u1, float u2) {
	return cosf(((2.0f * ((float) M_PI)) * u2)) * sqrtf(-log1pf(-u1));
}
function code(cosTheta_i, u1, u2)
	return Float32(cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)) * sqrt(Float32(-log1p(Float32(-u1)))))
end
\begin{array}{l}

\\
\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}
\end{array}
Derivation
  1. Initial program 57.5%

    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  2. Add Preprocessing
  3. Step-by-step derivation
    1. sub-negN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    2. accelerator-lowering-log1p.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    3. neg-lowering-neg.f3299.4

      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  4. Applied egg-rr99.4%

    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  5. Final simplification99.4%

    \[\leadsto \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)} \]
  6. Add Preprocessing

Alternative 2: 97.8% accurate, 0.6× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.13199999928474426:\\ \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (cos (* (* 2.0 PI) u2))))
   (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.13199999928474426)
     (*
      t_0
      (sqrt (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1)))
     (* (fma u2 (* u2 (* -2.0 (* PI PI))) 1.0) (sqrt (- (log1p (- u1))))))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
	float tmp;
	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.13199999928474426f) {
		tmp = t_0 * sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1));
	} else {
		tmp = fmaf(u2, (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI)))), 1.0f) * sqrtf(-log1pf(-u1));
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
	tmp = Float32(0.0)
	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.13199999928474426))
		tmp = Float32(t_0 * sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)));
	else
		tmp = Float32(fma(u2, Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi)))), Float32(1.0)) * sqrt(Float32(-log1p(Float32(-u1)))));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
\mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.13199999928474426:\\
\;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\

\mathbf{else}:\\
\;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.131999999

    1. Initial program 47.5%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. distribute-lft-inN/A

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. unpow2N/A

        \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. *-rgt-identityN/A

        \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. accelerator-lowering-fma.f32N/A

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      8. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      9. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      10. accelerator-lowering-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      11. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      12. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      13. accelerator-lowering-fma.f3299.3

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Simplified99.3%

      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

    if 0.131999999 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

    1. Initial program 97.4%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Step-by-step derivation
      1. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. accelerator-lowering-log1p.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. neg-lowering-neg.f3299.7

        \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Applied egg-rr99.7%

      \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
    6. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
      2. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2} + 1\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{u2}^{2} \cdot \left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} + 1\right) \]
      4. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left({u2}^{2} \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)} + 1\right) \]
      5. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(u2 \cdot u2\right)} \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \]
      6. associate-*l*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{u2 \cdot \left(u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + 1\right) \]
      7. accelerator-lowering-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right), 1\right)} \]
      8. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, \color{blue}{u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
      9. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
      10. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
      11. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
      12. PI-lowering-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right)\right), 1\right) \]
      13. PI-lowering-PI.f3297.8

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \color{blue}{\pi}\right)\right), 1\right) \]
    7. Simplified97.8%

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right)} \]
  3. Recombined 2 regimes into one program.
  4. Final simplification99.0%

    \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.13199999928474426:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \end{array} \]
  5. Add Preprocessing

Alternative 3: 97.5% accurate, 0.6× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\ \mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.13199999928474426:\\ \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (cos (* (* 2.0 PI) u2))))
   (if (<= (* t_0 (sqrt (- (log (- 1.0 u1))))) 0.13199999928474426)
     (* t_0 (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)))
     (* (fma u2 (* u2 (* -2.0 (* PI PI))) 1.0) (sqrt (- (log1p (- u1))))))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = cosf(((2.0f * ((float) M_PI)) * u2));
	float tmp;
	if ((t_0 * sqrtf(-logf((1.0f - u1)))) <= 0.13199999928474426f) {
		tmp = t_0 * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
	} else {
		tmp = fmaf(u2, (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI)))), 1.0f) * sqrtf(-log1pf(-u1));
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2))
	tmp = Float32(0.0)
	if (Float32(t_0 * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.13199999928474426))
		tmp = Float32(t_0 * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)));
	else
		tmp = Float32(fma(u2, Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi)))), Float32(1.0)) * sqrt(Float32(-log1p(Float32(-u1)))));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)\\
\mathbf{if}\;t\_0 \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.13199999928474426:\\
\;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\

\mathbf{else}:\\
\;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.131999999

    1. Initial program 47.5%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. distribute-lft-inN/A

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. unpow2N/A

        \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. *-rgt-identityN/A

        \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. accelerator-lowering-fma.f32N/A

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      8. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      9. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      10. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      11. accelerator-lowering-fma.f3298.9

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Simplified98.9%

      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

    if 0.131999999 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

    1. Initial program 97.4%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Step-by-step derivation
      1. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. accelerator-lowering-log1p.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. neg-lowering-neg.f3299.7

        \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Applied egg-rr99.7%

      \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
    6. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
      2. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2} + 1\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{u2}^{2} \cdot \left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} + 1\right) \]
      4. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left({u2}^{2} \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)} + 1\right) \]
      5. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(u2 \cdot u2\right)} \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \]
      6. associate-*l*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{u2 \cdot \left(u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + 1\right) \]
      7. accelerator-lowering-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right), 1\right)} \]
      8. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, \color{blue}{u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
      9. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
      10. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
      11. *-lowering-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
      12. PI-lowering-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right)\right), 1\right) \]
      13. PI-lowering-PI.f3297.8

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \color{blue}{\pi}\right)\right), 1\right) \]
    7. Simplified97.8%

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right)} \]
  3. Recombined 2 regimes into one program.
  4. Final simplification98.7%

    \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.13199999928474426:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \end{array} \]
  5. Add Preprocessing

Alternative 4: 78.8% accurate, 0.8× speedup?

\[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\ \;\;\;\;\mathsf{fma}\left(\sqrt{u1}, \left(\pi \cdot \pi\right) \cdot \left(-2 \cdot \left(u2 \cdot u2\right)\right), \sqrt{u1}\right)\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (if (<=
      (* (cos (* (* 2.0 PI) u2)) (sqrt (- (log (- 1.0 u1)))))
      0.002199999988079071)
   (fma (sqrt u1) (* (* PI PI) (* -2.0 (* u2 u2))) (sqrt u1))
   (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0))))))
float code(float cosTheta_i, float u1, float u2) {
	float tmp;
	if ((cosf(((2.0f * ((float) M_PI)) * u2)) * sqrtf(-logf((1.0f - u1)))) <= 0.002199999988079071f) {
		tmp = fmaf(sqrtf(u1), ((((float) M_PI) * ((float) M_PI)) * (-2.0f * (u2 * u2))), sqrtf(u1));
	} else {
		tmp = sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	tmp = Float32(0.0)
	if (Float32(cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)) * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.002199999988079071))
		tmp = fma(sqrt(u1), Float32(Float32(Float32(pi) * Float32(pi)) * Float32(Float32(-2.0) * Float32(u2 * u2))), sqrt(u1));
	else
		tmp = sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0)))));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
\mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\
\;\;\;\;\mathsf{fma}\left(\sqrt{u1}, \left(\pi \cdot \pi\right) \cdot \left(-2 \cdot \left(u2 \cdot u2\right)\right), \sqrt{u1}\right)\\

\mathbf{else}:\\
\;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.0022

    1. Initial program 27.5%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. Simplified96.5%

        \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Step-by-step derivation
        1. associate-*l*N/A

          \[\leadsto \sqrt{u1} \cdot \cos \color{blue}{\left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
        2. cos-2N/A

          \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
        3. --lowering--.f32N/A

          \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
      3. Applied egg-rr96.4%

        \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right)} \]
      4. Taylor expanded in u2 around 0

        \[\leadsto \color{blue}{\sqrt{u1} + -2 \cdot \left(\sqrt{u1} \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
      5. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \color{blue}{-2 \cdot \left(\sqrt{u1} \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right) + \sqrt{u1}} \]
        2. *-commutativeN/A

          \[\leadsto \color{blue}{\left(\sqrt{u1} \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right) \cdot -2} + \sqrt{u1} \]
        3. associate-*l*N/A

          \[\leadsto \color{blue}{\sqrt{u1} \cdot \left(\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2\right)} + \sqrt{u1} \]
        4. *-commutativeN/A

          \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + \sqrt{u1} \]
        5. *-commutativeN/A

          \[\leadsto \sqrt{u1} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)}\right) + \sqrt{u1} \]
        6. associate-*r*N/A

          \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}\right)} + \sqrt{u1} \]
        7. accelerator-lowering-fma.f32N/A

          \[\leadsto \color{blue}{\mathsf{fma}\left(\sqrt{u1}, \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}, \sqrt{u1}\right)} \]
        8. sqrt-lowering-sqrt.f32N/A

          \[\leadsto \mathsf{fma}\left(\color{blue}{\sqrt{u1}}, \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}, \sqrt{u1}\right) \]
        9. *-commutativeN/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2}, \sqrt{u1}\right) \]
        10. associate-*l*N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)}, \sqrt{u1}\right) \]
        11. *-lowering-*.f32N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)}, \sqrt{u1}\right) \]
        12. unpow2N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot \left(-2 \cdot {u2}^{2}\right), \sqrt{u1}\right) \]
        13. *-lowering-*.f32N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)} \cdot \left(-2 \cdot {u2}^{2}\right), \sqrt{u1}\right) \]
        14. PI-lowering-PI.f32N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right) \cdot \left(-2 \cdot {u2}^{2}\right), \sqrt{u1}\right) \]
        15. PI-lowering-PI.f32N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot \left(-2 \cdot {u2}^{2}\right), \sqrt{u1}\right) \]
        16. *-lowering-*.f32N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right) \cdot \color{blue}{\left(-2 \cdot {u2}^{2}\right)}, \sqrt{u1}\right) \]
        17. unpow2N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right) \cdot \left(-2 \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), \sqrt{u1}\right) \]
        18. *-lowering-*.f32N/A

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right) \cdot \left(-2 \cdot \color{blue}{\left(u2 \cdot u2\right)}\right), \sqrt{u1}\right) \]
        19. sqrt-lowering-sqrt.f3285.0

          \[\leadsto \mathsf{fma}\left(\sqrt{u1}, \left(\pi \cdot \pi\right) \cdot \left(-2 \cdot \left(u2 \cdot u2\right)\right), \color{blue}{\sqrt{u1}}\right) \]
      6. Simplified85.0%

        \[\leadsto \color{blue}{\mathsf{fma}\left(\sqrt{u1}, \left(\pi \cdot \pi\right) \cdot \left(-2 \cdot \left(u2 \cdot u2\right)\right), \sqrt{u1}\right)} \]

      if 0.0022 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

      1. Initial program 83.9%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Taylor expanded in u1 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. Step-by-step derivation
        1. *-lowering-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        2. sub-negN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        3. metadata-evalN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. accelerator-lowering-fma.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        5. sub-negN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        6. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        7. metadata-evalN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        8. accelerator-lowering-fma.f3285.0

          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      5. Simplified85.0%

        \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      6. Taylor expanded in u2 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{1} \]
      7. Step-by-step derivation
        1. Simplified76.8%

          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \color{blue}{1} \]
      8. Recombined 2 regimes into one program.
      9. Final simplification80.7%

        \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\ \;\;\;\;\mathsf{fma}\left(\sqrt{u1}, \left(\pi \cdot \pi\right) \cdot \left(-2 \cdot \left(u2 \cdot u2\right)\right), \sqrt{u1}\right)\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \]
      10. Add Preprocessing

      Alternative 5: 78.8% accurate, 0.8× speedup?

      \[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\ \;\;\;\;\mathsf{fma}\left(\sqrt{u1}, u2 \cdot \left(u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right)\right), \sqrt{u1}\right)\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \end{array} \]
      (FPCore (cosTheta_i u1 u2)
       :precision binary32
       (if (<=
            (* (cos (* (* 2.0 PI) u2)) (sqrt (- (log (- 1.0 u1)))))
            0.002199999988079071)
         (fma (sqrt u1) (* u2 (* u2 (* -2.0 (* PI PI)))) (sqrt u1))
         (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0))))))
      float code(float cosTheta_i, float u1, float u2) {
      	float tmp;
      	if ((cosf(((2.0f * ((float) M_PI)) * u2)) * sqrtf(-logf((1.0f - u1)))) <= 0.002199999988079071f) {
      		tmp = fmaf(sqrtf(u1), (u2 * (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI))))), sqrtf(u1));
      	} else {
      		tmp = sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
      	}
      	return tmp;
      }
      
      function code(cosTheta_i, u1, u2)
      	tmp = Float32(0.0)
      	if (Float32(cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)) * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.002199999988079071))
      		tmp = fma(sqrt(u1), Float32(u2 * Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi))))), sqrt(u1));
      	else
      		tmp = sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0)))));
      	end
      	return tmp
      end
      
      \begin{array}{l}
      
      \\
      \begin{array}{l}
      \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\
      \;\;\;\;\mathsf{fma}\left(\sqrt{u1}, u2 \cdot \left(u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right)\right), \sqrt{u1}\right)\\
      
      \mathbf{else}:\\
      \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\
      
      
      \end{array}
      \end{array}
      
      Derivation
      1. Split input into 2 regimes
      2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.0022

        1. Initial program 27.5%

          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
        2. Add Preprocessing
        3. Taylor expanded in u1 around 0

          \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
        4. Step-by-step derivation
          1. Simplified96.5%

            \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Taylor expanded in u2 around 0

            \[\leadsto \color{blue}{\sqrt{u1} + -2 \cdot \left(\sqrt{u1} \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
          3. Step-by-step derivation
            1. +-commutativeN/A

              \[\leadsto \color{blue}{-2 \cdot \left(\sqrt{u1} \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right) + \sqrt{u1}} \]
            2. *-commutativeN/A

              \[\leadsto \color{blue}{\left(\sqrt{u1} \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right) \cdot -2} + \sqrt{u1} \]
            3. associate-*l*N/A

              \[\leadsto \color{blue}{\sqrt{u1} \cdot \left(\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2\right)} + \sqrt{u1} \]
            4. *-commutativeN/A

              \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + \sqrt{u1} \]
            5. *-commutativeN/A

              \[\leadsto \sqrt{u1} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)}\right) + \sqrt{u1} \]
            6. associate-*r*N/A

              \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}\right)} + \sqrt{u1} \]
            7. accelerator-lowering-fma.f32N/A

              \[\leadsto \color{blue}{\mathsf{fma}\left(\sqrt{u1}, \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}, \sqrt{u1}\right)} \]
          4. Simplified85.0%

            \[\leadsto \color{blue}{\mathsf{fma}\left(\sqrt{u1}, u2 \cdot \left(u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right)\right), \sqrt{u1}\right)} \]

          if 0.0022 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

          1. Initial program 83.9%

            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Add Preprocessing
          3. Taylor expanded in u1 around 0

            \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          4. Step-by-step derivation
            1. *-lowering-*.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            2. sub-negN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            3. metadata-evalN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. accelerator-lowering-fma.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. sub-negN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            6. *-commutativeN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            7. metadata-evalN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            8. accelerator-lowering-fma.f3285.0

              \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          5. Simplified85.0%

            \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          6. Taylor expanded in u2 around 0

            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{1} \]
          7. Step-by-step derivation
            1. Simplified76.8%

              \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \color{blue}{1} \]
          8. Recombined 2 regimes into one program.
          9. Final simplification80.7%

            \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\ \;\;\;\;\mathsf{fma}\left(\sqrt{u1}, u2 \cdot \left(u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right)\right), \sqrt{u1}\right)\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \]
          10. Add Preprocessing

          Alternative 6: 78.8% accurate, 0.8× speedup?

          \[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{u1}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \end{array} \]
          (FPCore (cosTheta_i u1 u2)
           :precision binary32
           (if (<=
                (* (cos (* (* 2.0 PI) u2)) (sqrt (- (log (- 1.0 u1)))))
                0.002199999988079071)
             (* (fma u2 (* u2 (* -2.0 (* PI PI))) 1.0) (sqrt u1))
             (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0))))))
          float code(float cosTheta_i, float u1, float u2) {
          	float tmp;
          	if ((cosf(((2.0f * ((float) M_PI)) * u2)) * sqrtf(-logf((1.0f - u1)))) <= 0.002199999988079071f) {
          		tmp = fmaf(u2, (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI)))), 1.0f) * sqrtf(u1);
          	} else {
          		tmp = sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
          	}
          	return tmp;
          }
          
          function code(cosTheta_i, u1, u2)
          	tmp = Float32(0.0)
          	if (Float32(cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)) * sqrt(Float32(-log(Float32(Float32(1.0) - u1))))) <= Float32(0.002199999988079071))
          		tmp = Float32(fma(u2, Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi)))), Float32(1.0)) * sqrt(u1));
          	else
          		tmp = sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0)))));
          	end
          	return tmp
          end
          
          \begin{array}{l}
          
          \\
          \begin{array}{l}
          \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\
          \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{u1}\\
          
          \mathbf{else}:\\
          \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\
          
          
          \end{array}
          \end{array}
          
          Derivation
          1. Split input into 2 regimes
          2. if (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))) < 0.0022

            1. Initial program 27.5%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Taylor expanded in u1 around 0

              \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. Step-by-step derivation
              1. Simplified96.5%

                \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              2. Taylor expanded in u2 around 0

                \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
              3. Step-by-step derivation
                1. +-commutativeN/A

                  \[\leadsto \sqrt{u1} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                2. *-commutativeN/A

                  \[\leadsto \sqrt{u1} \cdot \left(\color{blue}{\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2} + 1\right) \]
                3. associate-*r*N/A

                  \[\leadsto \sqrt{u1} \cdot \left(\color{blue}{{u2}^{2} \cdot \left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} + 1\right) \]
                4. *-commutativeN/A

                  \[\leadsto \sqrt{u1} \cdot \left({u2}^{2} \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)} + 1\right) \]
                5. unpow2N/A

                  \[\leadsto \sqrt{u1} \cdot \left(\color{blue}{\left(u2 \cdot u2\right)} \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \]
                6. associate-*l*N/A

                  \[\leadsto \sqrt{u1} \cdot \left(\color{blue}{u2 \cdot \left(u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + 1\right) \]
                7. accelerator-lowering-fma.f32N/A

                  \[\leadsto \sqrt{u1} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right), 1\right)} \]
                8. *-lowering-*.f32N/A

                  \[\leadsto \sqrt{u1} \cdot \mathsf{fma}\left(u2, \color{blue}{u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                9. *-lowering-*.f32N/A

                  \[\leadsto \sqrt{u1} \cdot \mathsf{fma}\left(u2, u2 \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                10. unpow2N/A

                  \[\leadsto \sqrt{u1} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                11. *-lowering-*.f32N/A

                  \[\leadsto \sqrt{u1} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                12. PI-lowering-PI.f32N/A

                  \[\leadsto \sqrt{u1} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right)\right), 1\right) \]
                13. PI-lowering-PI.f3284.9

                  \[\leadsto \sqrt{u1} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \color{blue}{\pi}\right)\right), 1\right) \]
              4. Simplified84.9%

                \[\leadsto \sqrt{u1} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right)} \]

              if 0.0022 < (*.f32 (sqrt.f32 (neg.f32 (log.f32 (-.f32 #s(literal 1 binary32) u1)))) (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)))

              1. Initial program 83.9%

                \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              2. Add Preprocessing
              3. Taylor expanded in u1 around 0

                \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              4. Step-by-step derivation
                1. *-lowering-*.f32N/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                2. sub-negN/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                3. metadata-evalN/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                4. accelerator-lowering-fma.f32N/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                5. sub-negN/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                6. *-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                7. metadata-evalN/A

                  \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                8. accelerator-lowering-fma.f3285.0

                  \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              5. Simplified85.0%

                \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              6. Taylor expanded in u2 around 0

                \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{1} \]
              7. Step-by-step derivation
                1. Simplified76.8%

                  \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \color{blue}{1} \]
              8. Recombined 2 regimes into one program.
              9. Final simplification80.6%

                \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{-\log \left(1 - u1\right)} \leq 0.002199999988079071:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{u1}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \]
              10. Add Preprocessing

              Alternative 7: 96.7% accurate, 1.5× speedup?

              \[\begin{array}{l} \\ \begin{array}{l} t_0 := \left(2 \cdot \pi\right) \cdot u2\\ \mathbf{if}\;t\_0 \leq 0.04800000041723251:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}\\ \end{array} \end{array} \]
              (FPCore (cosTheta_i u1 u2)
               :precision binary32
               (let* ((t_0 (* (* 2.0 PI) u2)))
                 (if (<= t_0 0.04800000041723251)
                   (* (fma u2 (* u2 (* -2.0 (* PI PI))) 1.0) (sqrt (- (log1p (- u1)))))
                   (* (cos t_0) (sqrt (* u1 (fma u1 0.5 1.0)))))))
              float code(float cosTheta_i, float u1, float u2) {
              	float t_0 = (2.0f * ((float) M_PI)) * u2;
              	float tmp;
              	if (t_0 <= 0.04800000041723251f) {
              		tmp = fmaf(u2, (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI)))), 1.0f) * sqrtf(-log1pf(-u1));
              	} else {
              		tmp = cosf(t_0) * sqrtf((u1 * fmaf(u1, 0.5f, 1.0f)));
              	}
              	return tmp;
              }
              
              function code(cosTheta_i, u1, u2)
              	t_0 = Float32(Float32(Float32(2.0) * Float32(pi)) * u2)
              	tmp = Float32(0.0)
              	if (t_0 <= Float32(0.04800000041723251))
              		tmp = Float32(fma(u2, Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi)))), Float32(1.0)) * sqrt(Float32(-log1p(Float32(-u1)))));
              	else
              		tmp = Float32(cos(t_0) * sqrt(Float32(u1 * fma(u1, Float32(0.5), Float32(1.0)))));
              	end
              	return tmp
              end
              
              \begin{array}{l}
              
              \\
              \begin{array}{l}
              t_0 := \left(2 \cdot \pi\right) \cdot u2\\
              \mathbf{if}\;t\_0 \leq 0.04800000041723251:\\
              \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\
              
              \mathbf{else}:\\
              \;\;\;\;\cos t\_0 \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}\\
              
              
              \end{array}
              \end{array}
              
              Derivation
              1. Split input into 2 regimes
              2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.0480000004

                1. Initial program 59.3%

                  \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                2. Add Preprocessing
                3. Step-by-step derivation
                  1. sub-negN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  2. accelerator-lowering-log1p.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  3. neg-lowering-neg.f3299.6

                    \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                4. Applied egg-rr99.6%

                  \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                5. Taylor expanded in u2 around 0

                  \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
                6. Step-by-step derivation
                  1. +-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                  2. *-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2} + 1\right) \]
                  3. associate-*r*N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{u2}^{2} \cdot \left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} + 1\right) \]
                  4. *-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left({u2}^{2} \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)} + 1\right) \]
                  5. unpow2N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(u2 \cdot u2\right)} \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \]
                  6. associate-*l*N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{u2 \cdot \left(u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + 1\right) \]
                  7. accelerator-lowering-fma.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right), 1\right)} \]
                  8. *-lowering-*.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, \color{blue}{u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                  9. *-lowering-*.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                  10. unpow2N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                  11. *-lowering-*.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                  12. PI-lowering-PI.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right)\right), 1\right) \]
                  13. PI-lowering-PI.f3299.5

                    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \color{blue}{\pi}\right)\right), 1\right) \]
                7. Simplified99.5%

                  \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right)} \]

                if 0.0480000004 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

                1. Initial program 49.8%

                  \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                2. Add Preprocessing
                3. Taylor expanded in u1 around 0

                  \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                4. Step-by-step derivation
                  1. +-commutativeN/A

                    \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  2. distribute-lft-inN/A

                    \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  3. *-rgt-identityN/A

                    \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  4. accelerator-lowering-fma.f32N/A

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  5. *-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  6. *-lowering-*.f3290.9

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                5. Simplified90.9%

                  \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                6. Step-by-step derivation
                  1. *-commutativeN/A

                    \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2}\right) \cdot u1} + u1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  2. distribute-lft1-inN/A

                    \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2} + 1\right) \cdot u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  3. *-lowering-*.f32N/A

                    \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2} + 1\right) \cdot u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  4. accelerator-lowering-fma.f3290.9

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, 0.5, 1\right)} \cdot u1} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                7. Applied egg-rr90.9%

                  \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, 0.5, 1\right) \cdot u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              3. Recombined 2 regimes into one program.
              4. Final simplification97.8%

                \[\leadsto \begin{array}{l} \mathbf{if}\;\left(2 \cdot \pi\right) \cdot u2 \leq 0.04800000041723251:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}\\ \end{array} \]
              5. Add Preprocessing

              Alternative 8: 94.5% accurate, 1.5× speedup?

              \[\begin{array}{l} \\ \begin{array}{l} t_0 := \left(2 \cdot \pi\right) \cdot u2\\ \mathbf{if}\;t\_0 \leq 0.0011599999852478504:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}\\ \end{array} \end{array} \]
              (FPCore (cosTheta_i u1 u2)
               :precision binary32
               (let* ((t_0 (* (* 2.0 PI) u2)))
                 (if (<= t_0 0.0011599999852478504)
                   (sqrt (- (log1p (- u1))))
                   (* (cos t_0) (sqrt (* u1 (fma u1 0.5 1.0)))))))
              float code(float cosTheta_i, float u1, float u2) {
              	float t_0 = (2.0f * ((float) M_PI)) * u2;
              	float tmp;
              	if (t_0 <= 0.0011599999852478504f) {
              		tmp = sqrtf(-log1pf(-u1));
              	} else {
              		tmp = cosf(t_0) * sqrtf((u1 * fmaf(u1, 0.5f, 1.0f)));
              	}
              	return tmp;
              }
              
              function code(cosTheta_i, u1, u2)
              	t_0 = Float32(Float32(Float32(2.0) * Float32(pi)) * u2)
              	tmp = Float32(0.0)
              	if (t_0 <= Float32(0.0011599999852478504))
              		tmp = sqrt(Float32(-log1p(Float32(-u1))));
              	else
              		tmp = Float32(cos(t_0) * sqrt(Float32(u1 * fma(u1, Float32(0.5), Float32(1.0)))));
              	end
              	return tmp
              end
              
              \begin{array}{l}
              
              \\
              \begin{array}{l}
              t_0 := \left(2 \cdot \pi\right) \cdot u2\\
              \mathbf{if}\;t\_0 \leq 0.0011599999852478504:\\
              \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\
              
              \mathbf{else}:\\
              \;\;\;\;\cos t\_0 \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}\\
              
              
              \end{array}
              \end{array}
              
              Derivation
              1. Split input into 2 regimes
              2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.00115999999

                1. Initial program 60.9%

                  \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                2. Add Preprocessing
                3. Step-by-step derivation
                  1. sub-negN/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  2. accelerator-lowering-log1p.f32N/A

                    \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  3. neg-lowering-neg.f3299.6

                    \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                4. Applied egg-rr99.6%

                  \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                5. Taylor expanded in u2 around 0

                  \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
                6. Step-by-step derivation
                  1. Simplified99.3%

                    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

                  if 0.00115999999 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

                  1. Initial program 51.2%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Taylor expanded in u1 around 0

                    \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                  4. Step-by-step derivation
                    1. +-commutativeN/A

                      \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. distribute-lft-inN/A

                      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. *-rgt-identityN/A

                      \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. accelerator-lowering-fma.f32N/A

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    5. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    6. *-lowering-*.f3289.8

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  5. Simplified89.8%

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  6. Step-by-step derivation
                    1. *-commutativeN/A

                      \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2}\right) \cdot u1} + u1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. distribute-lft1-inN/A

                      \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2} + 1\right) \cdot u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. *-lowering-*.f32N/A

                      \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2} + 1\right) \cdot u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. accelerator-lowering-fma.f3289.8

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, 0.5, 1\right)} \cdot u1} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  7. Applied egg-rr89.8%

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, 0.5, 1\right) \cdot u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                7. Recombined 2 regimes into one program.
                8. Final simplification95.9%

                  \[\leadsto \begin{array}{l} \mathbf{if}\;\left(2 \cdot \pi\right) \cdot u2 \leq 0.0011599999852478504:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}\\ \end{array} \]
                9. Add Preprocessing

                Alternative 9: 90.2% accurate, 1.6× speedup?

                \[\begin{array}{l} \\ \begin{array}{l} t_0 := \left(2 \cdot \pi\right) \cdot u2\\ \mathbf{if}\;t\_0 \leq 0.0017999999690800905:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{u1}\\ \end{array} \end{array} \]
                (FPCore (cosTheta_i u1 u2)
                 :precision binary32
                 (let* ((t_0 (* (* 2.0 PI) u2)))
                   (if (<= t_0 0.0017999999690800905)
                     (sqrt (- (log1p (- u1))))
                     (* (cos t_0) (sqrt u1)))))
                float code(float cosTheta_i, float u1, float u2) {
                	float t_0 = (2.0f * ((float) M_PI)) * u2;
                	float tmp;
                	if (t_0 <= 0.0017999999690800905f) {
                		tmp = sqrtf(-log1pf(-u1));
                	} else {
                		tmp = cosf(t_0) * sqrtf(u1);
                	}
                	return tmp;
                }
                
                function code(cosTheta_i, u1, u2)
                	t_0 = Float32(Float32(Float32(2.0) * Float32(pi)) * u2)
                	tmp = Float32(0.0)
                	if (t_0 <= Float32(0.0017999999690800905))
                		tmp = sqrt(Float32(-log1p(Float32(-u1))));
                	else
                		tmp = Float32(cos(t_0) * sqrt(u1));
                	end
                	return tmp
                end
                
                \begin{array}{l}
                
                \\
                \begin{array}{l}
                t_0 := \left(2 \cdot \pi\right) \cdot u2\\
                \mathbf{if}\;t\_0 \leq 0.0017999999690800905:\\
                \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\
                
                \mathbf{else}:\\
                \;\;\;\;\cos t\_0 \cdot \sqrt{u1}\\
                
                
                \end{array}
                \end{array}
                
                Derivation
                1. Split input into 2 regimes
                2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.00179999997

                  1. Initial program 60.8%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Step-by-step derivation
                    1. sub-negN/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    2. accelerator-lowering-log1p.f32N/A

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    3. neg-lowering-neg.f3299.6

                      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  4. Applied egg-rr99.6%

                    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  5. Taylor expanded in u2 around 0

                    \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
                  6. Step-by-step derivation
                    1. Simplified98.9%

                      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

                    if 0.00179999997 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

                    1. Initial program 50.6%

                      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    2. Add Preprocessing
                    3. Taylor expanded in u1 around 0

                      \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                    4. Step-by-step derivation
                      1. Simplified80.1%

                        \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    5. Recombined 2 regimes into one program.
                    6. Final simplification92.7%

                      \[\leadsto \begin{array}{l} \mathbf{if}\;\left(2 \cdot \pi\right) \cdot u2 \leq 0.0017999999690800905:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \cdot \sqrt{u1}\\ \end{array} \]
                    7. Add Preprocessing

                    Alternative 10: 86.2% accurate, 1.9× speedup?

                    \[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;1 - u1 \leq 0.9965000152587891:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \end{array} \]
                    (FPCore (cosTheta_i u1 u2)
                     :precision binary32
                     (if (<= (- 1.0 u1) 0.9965000152587891)
                       (sqrt (- (log1p (- u1))))
                       (*
                        (fma u2 (* u2 (* -2.0 (* PI PI))) 1.0)
                        (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0)))))))
                    float code(float cosTheta_i, float u1, float u2) {
                    	float tmp;
                    	if ((1.0f - u1) <= 0.9965000152587891f) {
                    		tmp = sqrtf(-log1pf(-u1));
                    	} else {
                    		tmp = fmaf(u2, (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI)))), 1.0f) * sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
                    	}
                    	return tmp;
                    }
                    
                    function code(cosTheta_i, u1, u2)
                    	tmp = Float32(0.0)
                    	if (Float32(Float32(1.0) - u1) <= Float32(0.9965000152587891))
                    		tmp = sqrt(Float32(-log1p(Float32(-u1))));
                    	else
                    		tmp = Float32(fma(u2, Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi)))), Float32(1.0)) * sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0))))));
                    	end
                    	return tmp
                    end
                    
                    \begin{array}{l}
                    
                    \\
                    \begin{array}{l}
                    \mathbf{if}\;1 - u1 \leq 0.9965000152587891:\\
                    \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\
                    
                    \mathbf{else}:\\
                    \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\
                    
                    
                    \end{array}
                    \end{array}
                    
                    Derivation
                    1. Split input into 2 regimes
                    2. if (-.f32 #s(literal 1 binary32) u1) < 0.99650002

                      1. Initial program 96.1%

                        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      2. Add Preprocessing
                      3. Step-by-step derivation
                        1. sub-negN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        2. accelerator-lowering-log1p.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        3. neg-lowering-neg.f3299.7

                          \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      4. Applied egg-rr99.7%

                        \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      5. Taylor expanded in u2 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
                      6. Step-by-step derivation
                        1. Simplified86.0%

                          \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

                        if 0.99650002 < (-.f32 #s(literal 1 binary32) u1)

                        1. Initial program 43.8%

                          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        2. Add Preprocessing
                        3. Taylor expanded in u1 around 0

                          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. Step-by-step derivation
                          1. *-lowering-*.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          2. sub-negN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          3. metadata-evalN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          4. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          5. sub-negN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          6. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          7. metadata-evalN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          8. accelerator-lowering-fma.f3299.2

                            \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        5. Simplified99.2%

                          \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        6. Taylor expanded in u2 around 0

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
                        7. Step-by-step derivation
                          1. +-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                          2. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2} + 1\right) \]
                          3. associate-*r*N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{{u2}^{2} \cdot \left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} + 1\right) \]
                          4. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left({u2}^{2} \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)} + 1\right) \]
                          5. unpow2N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{\left(u2 \cdot u2\right)} \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \]
                          6. associate-*l*N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{u2 \cdot \left(u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + 1\right) \]
                          7. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right), 1\right)} \]
                          8. *-lowering-*.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, \color{blue}{u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                          9. *-lowering-*.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                          10. unpow2N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                          11. *-lowering-*.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                          12. PI-lowering-PI.f32N/A

                            \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right)\right), 1\right) \]
                          13. PI-lowering-PI.f3289.4

                            \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \color{blue}{\pi}\right)\right), 1\right) \]
                        8. Simplified89.4%

                          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right)} \]
                      7. Recombined 2 regimes into one program.
                      8. Final simplification88.5%

                        \[\leadsto \begin{array}{l} \mathbf{if}\;1 - u1 \leq 0.9965000152587891:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \]
                      9. Add Preprocessing

                      Alternative 11: 82.5% accurate, 4.1× speedup?

                      \[\begin{array}{l} \\ \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \end{array} \]
                      (FPCore (cosTheta_i u1 u2)
                       :precision binary32
                       (*
                        (fma u2 (* u2 (* -2.0 (* PI PI))) 1.0)
                        (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0))))))
                      float code(float cosTheta_i, float u1, float u2) {
                      	return fmaf(u2, (u2 * (-2.0f * (((float) M_PI) * ((float) M_PI)))), 1.0f) * sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
                      }
                      
                      function code(cosTheta_i, u1, u2)
                      	return Float32(fma(u2, Float32(u2 * Float32(Float32(-2.0) * Float32(Float32(pi) * Float32(pi)))), Float32(1.0)) * sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0))))))
                      end
                      
                      \begin{array}{l}
                      
                      \\
                      \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}
                      \end{array}
                      
                      Derivation
                      1. Initial program 57.5%

                        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      2. Add Preprocessing
                      3. Taylor expanded in u1 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      4. Step-by-step derivation
                        1. *-lowering-*.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        2. sub-negN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        3. metadata-evalN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. accelerator-lowering-fma.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        5. sub-negN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        6. *-commutativeN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        7. metadata-evalN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        8. accelerator-lowering-fma.f3291.6

                          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      5. Simplified91.6%

                        \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      6. Taylor expanded in u2 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
                      7. Step-by-step derivation
                        1. +-commutativeN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                        2. *-commutativeN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot -2} + 1\right) \]
                        3. associate-*r*N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{{u2}^{2} \cdot \left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} + 1\right) \]
                        4. *-commutativeN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left({u2}^{2} \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)} + 1\right) \]
                        5. unpow2N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{\left(u2 \cdot u2\right)} \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \]
                        6. associate-*l*N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \left(\color{blue}{u2 \cdot \left(u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} + 1\right) \]
                        7. accelerator-lowering-fma.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right), 1\right)} \]
                        8. *-lowering-*.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, \color{blue}{u2 \cdot \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                        9. *-lowering-*.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right)}, 1\right) \]
                        10. unpow2N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                        11. *-lowering-*.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \color{blue}{\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)\right)}\right), 1\right) \]
                        12. PI-lowering-PI.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right)\right)\right), 1\right) \]
                        13. PI-lowering-PI.f3282.6

                          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \color{blue}{\pi}\right)\right), 1\right) \]
                      8. Simplified82.6%

                        \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \color{blue}{\mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right)} \]
                      9. Final simplification82.6%

                        \[\leadsto \mathsf{fma}\left(u2, u2 \cdot \left(-2 \cdot \left(\pi \cdot \pi\right)\right), 1\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \]
                      10. Add Preprocessing

                      Alternative 12: 79.4% accurate, 4.8× speedup?

                      \[\begin{array}{l} \\ \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)} \end{array} \]
                      (FPCore (cosTheta_i u1 u2)
                       :precision binary32
                       (* (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0) (sqrt (fma u1 (* u1 0.5) u1))))
                      float code(float cosTheta_i, float u1, float u2) {
                      	return fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f) * sqrtf(fmaf(u1, (u1 * 0.5f), u1));
                      }
                      
                      function code(cosTheta_i, u1, u2)
                      	return Float32(fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)) * sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1)))
                      end
                      
                      \begin{array}{l}
                      
                      \\
                      \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}
                      \end{array}
                      
                      Derivation
                      1. Initial program 57.5%

                        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      2. Add Preprocessing
                      3. Taylor expanded in u1 around 0

                        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      4. Step-by-step derivation
                        1. +-commutativeN/A

                          \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        2. distribute-lft-inN/A

                          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        3. *-rgt-identityN/A

                          \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. accelerator-lowering-fma.f32N/A

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        5. *-commutativeN/A

                          \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        6. *-lowering-*.f3287.7

                          \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      5. Simplified87.7%

                        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      6. Taylor expanded in u2 around 0

                        \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}} + -2 \cdot \left(\left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}\right)} \]
                      7. Step-by-step derivation
                        1. associate-*r*N/A

                          \[\leadsto \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}} + \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right) \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                        2. distribute-rgt1-inN/A

                          \[\leadsto \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right) \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                        3. *-commutativeN/A

                          \[\leadsto \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}} \]
                        4. associate-*r*N/A

                          \[\leadsto \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}} \]
                        5. +-commutativeN/A

                          \[\leadsto \color{blue}{\left(1 + \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}\right)} \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}} \]
                        6. *-lowering-*.f32N/A

                          \[\leadsto \color{blue}{\left(1 + \left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}\right) \cdot \sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                      8. Simplified79.4%

                        \[\leadsto \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \]
                      9. Add Preprocessing

                      Alternative 13: 75.3% accurate, 7.7× speedup?

                      \[\begin{array}{l} \\ \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \end{array} \]
                      (FPCore (cosTheta_i u1 u2)
                       :precision binary32
                       (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0)))))
                      float code(float cosTheta_i, float u1, float u2) {
                      	return sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
                      }
                      
                      function code(cosTheta_i, u1, u2)
                      	return sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0)))))
                      end
                      
                      \begin{array}{l}
                      
                      \\
                      \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}
                      \end{array}
                      
                      Derivation
                      1. Initial program 57.5%

                        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      2. Add Preprocessing
                      3. Taylor expanded in u1 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                      4. Step-by-step derivation
                        1. *-lowering-*.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        2. sub-negN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        3. metadata-evalN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. accelerator-lowering-fma.f32N/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        5. sub-negN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        6. *-commutativeN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        7. metadata-evalN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        8. accelerator-lowering-fma.f3291.6

                          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      5. Simplified91.6%

                        \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                      6. Taylor expanded in u2 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, \frac{-1}{3}, \frac{-1}{2}\right), -1\right)\right)} \cdot \color{blue}{1} \]
                      7. Step-by-step derivation
                        1. Simplified77.1%

                          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \cdot \color{blue}{1} \]
                        2. Final simplification77.1%

                          \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)} \]
                        3. Add Preprocessing

                        Alternative 14: 72.8% accurate, 10.5× speedup?

                        \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1 \cdot u1, 0.5, u1\right)} \end{array} \]
                        (FPCore (cosTheta_i u1 u2) :precision binary32 (sqrt (fma (* u1 u1) 0.5 u1)))
                        float code(float cosTheta_i, float u1, float u2) {
                        	return sqrtf(fmaf((u1 * u1), 0.5f, u1));
                        }
                        
                        function code(cosTheta_i, u1, u2)
                        	return sqrt(fma(Float32(u1 * u1), Float32(0.5), u1))
                        end
                        
                        \begin{array}{l}
                        
                        \\
                        \sqrt{\mathsf{fma}\left(u1 \cdot u1, 0.5, u1\right)}
                        \end{array}
                        
                        Derivation
                        1. Initial program 57.5%

                          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        2. Add Preprocessing
                        3. Taylor expanded in u1 around 0

                          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. Step-by-step derivation
                          1. +-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          2. distribute-lft-inN/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          3. *-rgt-identityN/A

                            \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          4. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          5. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          6. *-lowering-*.f3287.7

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        5. Simplified87.7%

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        6. Taylor expanded in u2 around 0

                          \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                        7. Step-by-step derivation
                          1. sqrt-lowering-sqrt.f32N/A

                            \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                          2. +-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{\frac{1}{2} \cdot {u1}^{2} + u1}} \]
                          3. *-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{{u1}^{2} \cdot \frac{1}{2}} + u1} \]
                          4. unpow2N/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right)} \cdot \frac{1}{2} + u1} \]
                          5. associate-*r*N/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \frac{1}{2}\right)} + u1} \]
                          6. *-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1\right)} + u1} \]
                          7. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \]
                          8. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \]
                          9. *-lowering-*.f3274.1

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \]
                        8. Simplified74.1%

                          \[\leadsto \color{blue}{\sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \]
                        9. Step-by-step derivation
                          1. associate-*r*N/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \frac{1}{2}} + u1} \]
                          2. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \frac{1}{2}, u1\right)}} \]
                          3. *-lowering-*.f3274.1

                            \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, 0.5, u1\right)} \]
                        10. Applied egg-rr74.1%

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, 0.5, u1\right)}} \]
                        11. Add Preprocessing

                        Alternative 15: 72.8% accurate, 10.5× speedup?

                        \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)} \end{array} \]
                        (FPCore (cosTheta_i u1 u2) :precision binary32 (sqrt (fma u1 (* u1 0.5) u1)))
                        float code(float cosTheta_i, float u1, float u2) {
                        	return sqrtf(fmaf(u1, (u1 * 0.5f), u1));
                        }
                        
                        function code(cosTheta_i, u1, u2)
                        	return sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1))
                        end
                        
                        \begin{array}{l}
                        
                        \\
                        \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}
                        \end{array}
                        
                        Derivation
                        1. Initial program 57.5%

                          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        2. Add Preprocessing
                        3. Taylor expanded in u1 around 0

                          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. Step-by-step derivation
                          1. +-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          2. distribute-lft-inN/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          3. *-rgt-identityN/A

                            \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          4. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          5. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          6. *-lowering-*.f3287.7

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        5. Simplified87.7%

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        6. Taylor expanded in u2 around 0

                          \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                        7. Step-by-step derivation
                          1. sqrt-lowering-sqrt.f32N/A

                            \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                          2. +-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{\frac{1}{2} \cdot {u1}^{2} + u1}} \]
                          3. *-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{{u1}^{2} \cdot \frac{1}{2}} + u1} \]
                          4. unpow2N/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right)} \cdot \frac{1}{2} + u1} \]
                          5. associate-*r*N/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \frac{1}{2}\right)} + u1} \]
                          6. *-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1\right)} + u1} \]
                          7. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \]
                          8. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \]
                          9. *-lowering-*.f3274.1

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \]
                        8. Simplified74.1%

                          \[\leadsto \color{blue}{\sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \]
                        9. Add Preprocessing

                        Alternative 16: 72.8% accurate, 10.5× speedup?

                        \[\begin{array}{l} \\ \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)} \end{array} \]
                        (FPCore (cosTheta_i u1 u2) :precision binary32 (sqrt (* u1 (fma u1 0.5 1.0))))
                        float code(float cosTheta_i, float u1, float u2) {
                        	return sqrtf((u1 * fmaf(u1, 0.5f, 1.0f)));
                        }
                        
                        function code(cosTheta_i, u1, u2)
                        	return sqrt(Float32(u1 * fma(u1, Float32(0.5), Float32(1.0))))
                        end
                        
                        \begin{array}{l}
                        
                        \\
                        \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)}
                        \end{array}
                        
                        Derivation
                        1. Initial program 57.5%

                          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        2. Add Preprocessing
                        3. Taylor expanded in u1 around 0

                          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. Step-by-step derivation
                          1. +-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          2. distribute-lft-inN/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          3. *-rgt-identityN/A

                            \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          4. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          5. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                          6. *-lowering-*.f3287.7

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        5. Simplified87.7%

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        6. Taylor expanded in u2 around 0

                          \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                        7. Step-by-step derivation
                          1. sqrt-lowering-sqrt.f32N/A

                            \[\leadsto \color{blue}{\sqrt{u1 + \frac{1}{2} \cdot {u1}^{2}}} \]
                          2. +-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{\frac{1}{2} \cdot {u1}^{2} + u1}} \]
                          3. *-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{{u1}^{2} \cdot \frac{1}{2}} + u1} \]
                          4. unpow2N/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right)} \cdot \frac{1}{2} + u1} \]
                          5. associate-*r*N/A

                            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \frac{1}{2}\right)} + u1} \]
                          6. *-commutativeN/A

                            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1\right)} + u1} \]
                          7. accelerator-lowering-fma.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \]
                          8. *-commutativeN/A

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \]
                          9. *-lowering-*.f3274.1

                            \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \]
                        8. Simplified74.1%

                          \[\leadsto \color{blue}{\sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \]
                        9. Step-by-step derivation
                          1. *-commutativeN/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2}\right) \cdot u1} + u1} \]
                          2. distribute-lft1-inN/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2} + 1\right) \cdot u1}} \]
                          3. *-lowering-*.f32N/A

                            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot \frac{1}{2} + 1\right) \cdot u1}} \]
                          4. accelerator-lowering-fma.f3274.0

                            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, 0.5, 1\right)} \cdot u1} \]
                        10. Applied egg-rr74.0%

                          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, 0.5, 1\right) \cdot u1}} \]
                        11. Final simplification74.0%

                          \[\leadsto \sqrt{u1 \cdot \mathsf{fma}\left(u1, 0.5, 1\right)} \]
                        12. Add Preprocessing

                        Alternative 17: 65.1% accurate, 21.0× speedup?

                        \[\begin{array}{l} \\ \sqrt{u1} \end{array} \]
                        (FPCore (cosTheta_i u1 u2) :precision binary32 (sqrt u1))
                        float code(float cosTheta_i, float u1, float u2) {
                        	return sqrtf(u1);
                        }
                        
                        real(4) function code(costheta_i, u1, u2)
                            real(4), intent (in) :: costheta_i
                            real(4), intent (in) :: u1
                            real(4), intent (in) :: u2
                            code = sqrt(u1)
                        end function
                        
                        function code(cosTheta_i, u1, u2)
                        	return sqrt(u1)
                        end
                        
                        function tmp = code(cosTheta_i, u1, u2)
                        	tmp = sqrt(u1);
                        end
                        
                        \begin{array}{l}
                        
                        \\
                        \sqrt{u1}
                        \end{array}
                        
                        Derivation
                        1. Initial program 57.5%

                          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                        2. Add Preprocessing
                        3. Taylor expanded in u1 around 0

                          \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                        4. Step-by-step derivation
                          1. Simplified76.5%

                            \[\leadsto \sqrt{\color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                          2. Taylor expanded in u2 around 0

                            \[\leadsto \color{blue}{\sqrt{u1}} \]
                          3. Step-by-step derivation
                            1. sqrt-lowering-sqrt.f3265.6

                              \[\leadsto \color{blue}{\sqrt{u1}} \]
                          4. Simplified65.6%

                            \[\leadsto \color{blue}{\sqrt{u1}} \]
                          5. Add Preprocessing

                          Reproduce

                          ?
                          herbie shell --seed 2024204 
                          (FPCore (cosTheta_i u1 u2)
                            :name "Beckmann Sample, near normal, slope_x"
                            :precision binary32
                            :pre (and (and (and (> cosTheta_i 0.9999) (<= cosTheta_i 1.0)) (and (<= 2.328306437e-10 u1) (<= u1 1.0))) (and (<= 2.328306437e-10 u2) (<= u2 1.0)))
                            (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))