Beckmann Sample, near normal, slope_x

Percentage Accurate: 57.1% → 99.0%
Time: 13.9s
Alternatives: 17
Speedup: 11.6×

Specification

?
\[\left(\left(cosTheta\_i > 0.9999 \land cosTheta\_i \leq 1\right) \land \left(2.328306437 \cdot 10^{-10} \leq u1 \land u1 \leq 1\right)\right) \land \left(2.328306437 \cdot 10^{-10} \leq u2 \land u2 \leq 1\right)\]
\[\begin{array}{l} \\ \sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-logf((1.0f - u1))) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log(Float32(Float32(1.0) - u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
function tmp = code(cosTheta_i, u1, u2)
	tmp = sqrt(-log((single(1.0) - u1))) * cos(((single(2.0) * single(pi)) * u2));
end
\begin{array}{l}

\\
\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}

Sampling outcomes in binary32 precision:

Local Percentage Accuracy vs ?

The average percentage accuracy by input value. Horizontal axis shows value of an input variable; the variable is choosen in the title. Vertical axis is accuracy; higher is better. Red represent the original program, while blue represents Herbie's suggestion. These can be toggled with buttons below the plot. The line is an average while dots represent individual samples.

Accuracy vs Speed?

Herbie found 17 alternatives:

AlternativeAccuracySpeedup
The accuracy (vertical axis) and speed (horizontal axis) of each alternatives. Up and to the right is better. The red square shows the initial program, and each blue circle shows an alternative.The line shows the best available speed-accuracy tradeoffs.

Initial Program: 57.1% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-logf((1.0f - u1))) * cosf(((2.0f * ((float) M_PI)) * u2));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log(Float32(Float32(1.0) - u1)))) * cos(Float32(Float32(Float32(2.0) * Float32(pi)) * u2)))
end
function tmp = code(cosTheta_i, u1, u2)
	tmp = sqrt(-log((single(1.0) - u1))) * cos(((single(2.0) * single(pi)) * u2));
end
\begin{array}{l}

\\
\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right)
\end{array}

Alternative 1: 99.0% accurate, 0.3× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\\ \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - \frac{0.125 + {\left(-0.5 \cdot t\_0\right)}^{3}}{\mathsf{fma}\left(0.5 \cdot t\_0, \mathsf{fma}\left(0.5, t\_0, 0.5\right), 0.25\right)}\right) \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (cos (* 2.0 (* PI u2)))))
   (*
    (sqrt (- (log1p (- u1))))
    (-
     (+ 0.5 (* 0.5 (cos (* 2.0 (* (* PI (log E)) u2)))))
     (/
      (+ 0.125 (pow (* -0.5 t_0) 3.0))
      (fma (* 0.5 t_0) (fma 0.5 t_0 0.5) 0.25))))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = cosf((2.0f * (((float) M_PI) * u2)));
	return sqrtf(-log1pf(-u1)) * ((0.5f + (0.5f * cosf((2.0f * ((((float) M_PI) * logf(((float) M_E))) * u2))))) - ((0.125f + powf((-0.5f * t_0), 3.0f)) / fmaf((0.5f * t_0), fmaf(0.5f, t_0, 0.5f), 0.25f)));
}
function code(cosTheta_i, u1, u2)
	t_0 = cos(Float32(Float32(2.0) * Float32(Float32(pi) * u2)))
	return Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(Float32(Float32(0.5) + Float32(Float32(0.5) * cos(Float32(Float32(2.0) * Float32(Float32(Float32(pi) * log(Float32(exp(1)))) * u2))))) - Float32(Float32(Float32(0.125) + (Float32(Float32(-0.5) * t_0) ^ Float32(3.0))) / fma(Float32(Float32(0.5) * t_0), fma(Float32(0.5), t_0, Float32(0.5)), Float32(0.25)))))
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\\
\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - \frac{0.125 + {\left(-0.5 \cdot t\_0\right)}^{3}}{\mathsf{fma}\left(0.5 \cdot t\_0, \mathsf{fma}\left(0.5, t\_0, 0.5\right), 0.25\right)}\right)
\end{array}
\end{array}
Derivation
  1. Initial program 56.4%

    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  2. Add Preprocessing
  3. Step-by-step derivation
    1. lift-log.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    2. lift--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    3. sub-negN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. lower-log1p.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    5. lower-neg.f3298.9

      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  4. Applied rewrites98.9%

    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  5. Step-by-step derivation
    1. lift-cos.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
    2. lift-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
    3. lift-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\color{blue}{\left(2 \cdot \mathsf{PI}\left(\right)\right)} \cdot u2\right) \]
    4. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
    5. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
    6. associate-*l*N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
    7. cos-2N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
    8. lower--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
  6. Applied rewrites98.7%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right)} \]
  7. Step-by-step derivation
    1. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    2. add-log-expN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\log \left(e^{\mathsf{PI}\left(\right)}\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    3. *-un-lft-identityN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\log \left(e^{\color{blue}{1 \cdot \mathsf{PI}\left(\right)}}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    4. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\log \left(e^{1 \cdot \color{blue}{\mathsf{PI}\left(\right)}}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    5. exp-prodN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\log \color{blue}{\left({\left(e^{1}\right)}^{\mathsf{PI}\left(\right)}\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    6. log-powN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\left(\mathsf{PI}\left(\right) \cdot \log \left(e^{1}\right)\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    7. lower-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\left(\mathsf{PI}\left(\right) \cdot \log \left(e^{1}\right)\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    8. lower-log.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\log \left(e^{1}\right)}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    9. exp-1-eN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \color{blue}{\mathsf{E}\left(\right)}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    10. lower-E.f3298.8

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log \color{blue}{e}\right) \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right) \]
  8. Applied rewrites98.8%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\color{blue}{\left(\pi \cdot \log e\right)} \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right) \]
  9. Applied rewrites99.0%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - \color{blue}{\frac{0.125 + {\left(-0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)}^{3}}{\mathsf{fma}\left(0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right), \mathsf{fma}\left(0.5, \cos \left(2 \cdot \left(\pi \cdot u2\right)\right), 0.5\right), 0.25\right)}}\right) \]
  10. Add Preprocessing

Alternative 2: 99.0% accurate, 0.4× speedup?

\[\begin{array}{l} \\ \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - {\sin \left(\pi \cdot u2\right)}^{2}\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (*
  (sqrt (- (log1p (- u1))))
  (-
   (+ 0.5 (* 0.5 (cos (* 2.0 (* (* PI (log E)) u2)))))
   (pow (sin (* PI u2)) 2.0))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-log1pf(-u1)) * ((0.5f + (0.5f * cosf((2.0f * ((((float) M_PI) * logf(((float) M_E))) * u2))))) - powf(sinf((((float) M_PI) * u2)), 2.0f));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(Float32(Float32(0.5) + Float32(Float32(0.5) * cos(Float32(Float32(2.0) * Float32(Float32(Float32(pi) * log(Float32(exp(1)))) * u2))))) - (sin(Float32(Float32(pi) * u2)) ^ Float32(2.0))))
end
\begin{array}{l}

\\
\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - {\sin \left(\pi \cdot u2\right)}^{2}\right)
\end{array}
Derivation
  1. Initial program 56.4%

    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  2. Add Preprocessing
  3. Step-by-step derivation
    1. lift-log.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    2. lift--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    3. sub-negN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. lower-log1p.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    5. lower-neg.f3298.9

      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  4. Applied rewrites98.9%

    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  5. Step-by-step derivation
    1. lift-cos.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
    2. lift-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right)} \]
    3. lift-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\color{blue}{\left(2 \cdot \mathsf{PI}\left(\right)\right)} \cdot u2\right) \]
    4. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
    5. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \left(\left(2 \cdot \color{blue}{\mathsf{PI}\left(\right)}\right) \cdot u2\right) \]
    6. associate-*l*N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \cos \color{blue}{\left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
    7. cos-2N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
    8. lower--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(\cos \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \cos \left(\mathsf{PI}\left(\right) \cdot u2\right) - \sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)} \]
  6. Applied rewrites98.7%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right)} \]
  7. Step-by-step derivation
    1. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\mathsf{PI}\left(\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    2. add-log-expN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\log \left(e^{\mathsf{PI}\left(\right)}\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    3. *-un-lft-identityN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\log \left(e^{\color{blue}{1 \cdot \mathsf{PI}\left(\right)}}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    4. lift-PI.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\log \left(e^{1 \cdot \color{blue}{\mathsf{PI}\left(\right)}}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    5. exp-prodN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\log \color{blue}{\left({\left(e^{1}\right)}^{\mathsf{PI}\left(\right)}\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    6. log-powN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\left(\mathsf{PI}\left(\right) \cdot \log \left(e^{1}\right)\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    7. lower-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\color{blue}{\left(\mathsf{PI}\left(\right) \cdot \log \left(e^{1}\right)\right)} \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    8. lower-log.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\log \left(e^{1}\right)}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    9. exp-1-eN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \color{blue}{\mathsf{E}\left(\right)}\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)\right) \]
    10. lower-E.f3298.8

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log \color{blue}{e}\right) \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right) \]
  8. Applied rewrites98.8%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\color{blue}{\left(\pi \cdot \log e\right)} \cdot u2\right)\right)\right) - \left(0.5 - 0.5 \cdot \cos \left(2 \cdot \left(\pi \cdot u2\right)\right)\right)\right) \]
  9. Step-by-step derivation
    1. lift--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \color{blue}{\left(\frac{1}{2} - \frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)\right)}\right) \]
    2. lift-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \color{blue}{\frac{1}{2} \cdot \cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)}\right)\right) \]
    3. lift-cos.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \color{blue}{\cos \left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)}\right)\right) \]
    4. lift-*.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \left(\frac{1}{2} - \frac{1}{2} \cdot \cos \color{blue}{\left(2 \cdot \left(\mathsf{PI}\left(\right) \cdot u2\right)\right)}\right)\right) \]
    5. sqr-sin-aN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \color{blue}{\sin \left(\mathsf{PI}\left(\right) \cdot u2\right) \cdot \sin \left(\mathsf{PI}\left(\right) \cdot u2\right)}\right) \]
    6. pow2N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \color{blue}{{\sin \left(\mathsf{PI}\left(\right) \cdot u2\right)}^{2}}\right) \]
    7. lower-pow.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\left(\frac{1}{2} + \frac{1}{2} \cdot \cos \left(2 \cdot \left(\left(\mathsf{PI}\left(\right) \cdot \log \mathsf{E}\left(\right)\right) \cdot u2\right)\right)\right) - \color{blue}{{\sin \left(\mathsf{PI}\left(\right) \cdot u2\right)}^{2}}\right) \]
    8. lower-sin.f3298.9

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - {\color{blue}{\sin \left(\pi \cdot u2\right)}}^{2}\right) \]
  10. Applied rewrites98.9%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \left(\left(0.5 + 0.5 \cdot \cos \left(2 \cdot \left(\left(\pi \cdot \log e\right) \cdot u2\right)\right)\right) - \color{blue}{{\sin \left(\pi \cdot u2\right)}^{2}}\right) \]
  11. Add Preprocessing

Alternative 3: 97.5% accurate, 0.9× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \cos \left(u2 \cdot \left(2 \cdot \pi\right)\right)\\ \mathbf{if}\;t\_0 \leq 0.9994999766349792:\\ \;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (cos (* u2 (* 2.0 PI)))))
   (if (<= t_0 0.9994999766349792)
     (* t_0 (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)))
     (* (sqrt (- (log1p (- u1)))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0)))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = cosf((u2 * (2.0f * ((float) M_PI))));
	float tmp;
	if (t_0 <= 0.9994999766349792f) {
		tmp = t_0 * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
	} else {
		tmp = sqrtf(-log1pf(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = cos(Float32(u2 * Float32(Float32(2.0) * Float32(pi))))
	tmp = Float32(0.0)
	if (t_0 <= Float32(0.9994999766349792))
		tmp = Float32(t_0 * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)));
	else
		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \cos \left(u2 \cdot \left(2 \cdot \pi\right)\right)\\
\mathbf{if}\;t\_0 \leq 0.9994999766349792:\\
\;\;\;\;t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\

\mathbf{else}:\\
\;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)) < 0.999499977

    1. Initial program 56.9%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. distribute-lft-inN/A

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. unpow2N/A

        \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. *-rgt-identityN/A

        \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. lower-fma.f32N/A

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      8. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      9. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      10. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      11. lower-fma.f3291.3

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Applied rewrites91.3%

      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]

    if 0.999499977 < (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))

    1. Initial program 56.3%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Step-by-step derivation
      1. lift-log.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. lift--.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-log1p.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. lower-neg.f3299.5

        \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Applied rewrites99.5%

      \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
    6. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
      2. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
      4. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
      5. associate-*l*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
      6. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      8. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      9. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
      10. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      11. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
      12. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
      13. lower-*.f3299.5

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
    7. Applied rewrites99.5%

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]
  3. Recombined 2 regimes into one program.
  4. Final simplification97.2%

    \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \leq 0.9994999766349792:\\ \;\;\;\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \end{array} \]
  5. Add Preprocessing

Alternative 4: 99.0% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (* (sqrt (- (log1p (- u1)))) (cos (* u2 (* 2.0 PI)))))
float code(float cosTheta_i, float u1, float u2) {
	return sqrtf(-log1pf(-u1)) * cosf((u2 * (2.0f * ((float) M_PI))));
}
function code(cosTheta_i, u1, u2)
	return Float32(sqrt(Float32(-log1p(Float32(-u1)))) * cos(Float32(u2 * Float32(Float32(2.0) * Float32(pi)))))
end
\begin{array}{l}

\\
\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \cos \left(u2 \cdot \left(2 \cdot \pi\right)\right)
\end{array}
Derivation
  1. Initial program 56.4%

    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  2. Add Preprocessing
  3. Step-by-step derivation
    1. lift-log.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    2. lift--.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    3. sub-negN/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. lower-log1p.f32N/A

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    5. lower-neg.f3298.9

      \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  4. Applied rewrites98.9%

    \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  5. Final simplification98.9%

    \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \]
  6. Add Preprocessing

Alternative 5: 97.9% accurate, 1.4× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := u2 \cdot \left(2 \cdot \pi\right)\\ \mathbf{if}\;t\_0 \leq 0.02199999988079071:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (* u2 (* 2.0 PI))))
   (if (<= t_0 0.02199999988079071)
     (* (sqrt (- (log1p (- u1)))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0))
     (*
      (cos t_0)
      (sqrt
       (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1))))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = u2 * (2.0f * ((float) M_PI));
	float tmp;
	if (t_0 <= 0.02199999988079071f) {
		tmp = sqrtf(-log1pf(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
	} else {
		tmp = cosf(t_0) * sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1));
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = Float32(u2 * Float32(Float32(2.0) * Float32(pi)))
	tmp = Float32(0.0)
	if (t_0 <= Float32(0.02199999988079071))
		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
	else
		tmp = Float32(cos(t_0) * sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := u2 \cdot \left(2 \cdot \pi\right)\\
\mathbf{if}\;t\_0 \leq 0.02199999988079071:\\
\;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\

\mathbf{else}:\\
\;\;\;\;\cos t\_0 \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.0219999999

    1. Initial program 56.0%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Step-by-step derivation
      1. lift-log.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. lift--.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-log1p.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. lower-neg.f3299.5

        \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Applied rewrites99.5%

      \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
    6. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
      2. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
      4. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
      5. associate-*l*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
      6. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      8. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      9. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
      10. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      11. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
      12. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
      13. lower-*.f3299.5

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
    7. Applied rewrites99.5%

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]

    if 0.0219999999 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

    1. Initial program 57.4%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. distribute-lft-inN/A

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. unpow2N/A

        \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. *-rgt-identityN/A

        \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. lower-fma.f32N/A

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      8. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      9. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      10. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      11. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      12. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      13. lower-fma.f3293.5

        \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Applied rewrites93.5%

      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  3. Recombined 2 regimes into one program.
  4. Final simplification97.7%

    \[\leadsto \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.02199999988079071:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}\\ \end{array} \]
  5. Add Preprocessing

Alternative 6: 97.5% accurate, 1.4× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := u2 \cdot \left(2 \cdot \pi\right)\\ \mathbf{if}\;t\_0 \leq 0.05999999865889549:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (* u2 (* 2.0 PI))))
   (if (<= t_0 0.05999999865889549)
     (* (sqrt (- (log1p (- u1)))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0))
     (*
      (cos t_0)
      (sqrt (- (* u1 (fma u1 (fma u1 -0.3333333333333333 -0.5) -1.0))))))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = u2 * (2.0f * ((float) M_PI));
	float tmp;
	if (t_0 <= 0.05999999865889549f) {
		tmp = sqrtf(-log1pf(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
	} else {
		tmp = cosf(t_0) * sqrtf(-(u1 * fmaf(u1, fmaf(u1, -0.3333333333333333f, -0.5f), -1.0f)));
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = Float32(u2 * Float32(Float32(2.0) * Float32(pi)))
	tmp = Float32(0.0)
	if (t_0 <= Float32(0.05999999865889549))
		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
	else
		tmp = Float32(cos(t_0) * sqrt(Float32(-Float32(u1 * fma(u1, fma(u1, Float32(-0.3333333333333333), Float32(-0.5)), Float32(-1.0))))));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := u2 \cdot \left(2 \cdot \pi\right)\\
\mathbf{if}\;t\_0 \leq 0.05999999865889549:\\
\;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\

\mathbf{else}:\\
\;\;\;\;\cos t\_0 \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.0599999987

    1. Initial program 56.6%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Step-by-step derivation
      1. lift-log.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. lift--.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-log1p.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. lower-neg.f3299.5

        \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Applied rewrites99.5%

      \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
    6. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
      2. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
      4. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
      5. associate-*l*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
      6. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      8. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      9. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
      10. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      11. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
      12. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
      13. lower-*.f3299.4

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
    7. Applied rewrites99.4%

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]

    if 0.0599999987 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

    1. Initial program 56.0%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) - 1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \left(\mathsf{neg}\left(1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. metadata-evalN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \left(u1 \cdot \left(\frac{-1}{3} \cdot u1 - \frac{1}{2}\right) + \color{blue}{-1}\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \color{blue}{\mathsf{fma}\left(u1, \frac{-1}{3} \cdot u1 - \frac{1}{2}, -1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\frac{-1}{3} \cdot u1 + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right)}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{-1}{3}} + \left(\mathsf{neg}\left(\frac{1}{2}\right)\right), -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      7. metadata-evalN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(u1 \cdot \mathsf{fma}\left(u1, u1 \cdot \frac{-1}{3} + \color{blue}{\frac{-1}{2}}, -1\right)\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      8. lower-fma.f3291.0

        \[\leadsto \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right)}, -1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Applied rewrites91.0%

      \[\leadsto \sqrt{-\color{blue}{u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  3. Recombined 2 regimes into one program.
  4. Final simplification97.2%

    \[\leadsto \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.05999999865889549:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \cdot \sqrt{-u1 \cdot \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, -0.3333333333333333, -0.5\right), -1\right)}\\ \end{array} \]
  5. Add Preprocessing

Alternative 7: 96.7% accurate, 1.5× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := u2 \cdot \left(2 \cdot \pi\right)\\ \mathbf{if}\;t\_0 \leq 0.05999999865889549:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (let* ((t_0 (* u2 (* 2.0 PI))))
   (if (<= t_0 0.05999999865889549)
     (* (sqrt (- (log1p (- u1)))) (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0))
     (* (cos t_0) (sqrt (fma u1 (* u1 0.5) u1))))))
float code(float cosTheta_i, float u1, float u2) {
	float t_0 = u2 * (2.0f * ((float) M_PI));
	float tmp;
	if (t_0 <= 0.05999999865889549f) {
		tmp = sqrtf(-log1pf(-u1)) * fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f);
	} else {
		tmp = cosf(t_0) * sqrtf(fmaf(u1, (u1 * 0.5f), u1));
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	t_0 = Float32(u2 * Float32(Float32(2.0) * Float32(pi)))
	tmp = Float32(0.0)
	if (t_0 <= Float32(0.05999999865889549))
		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)));
	else
		tmp = Float32(cos(t_0) * sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1)));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := u2 \cdot \left(2 \cdot \pi\right)\\
\mathbf{if}\;t\_0 \leq 0.05999999865889549:\\
\;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\

\mathbf{else}:\\
\;\;\;\;\cos t\_0 \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.0599999987

    1. Initial program 56.6%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Step-by-step derivation
      1. lift-log.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. lift--.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. sub-negN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-log1p.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. lower-neg.f3299.5

        \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    4. Applied rewrites99.5%

      \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
    6. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
      2. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
      3. associate-*r*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
      4. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
      5. associate-*l*N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
      6. lower-fma.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
      7. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      8. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      9. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
      10. lower-PI.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
      11. lower-*.f32N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
      12. unpow2N/A

        \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
      13. lower-*.f3299.4

        \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
    7. Applied rewrites99.4%

      \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]

    if 0.0599999987 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

    1. Initial program 56.0%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u1 around 0

      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
    4. Step-by-step derivation
      1. +-commutativeN/A

        \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      2. distribute-lft-inN/A

        \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      3. *-rgt-identityN/A

        \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      4. lower-fma.f32N/A

        \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      5. *-commutativeN/A

        \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
      6. lower-*.f3287.1

        \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    5. Applied rewrites87.1%

      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
  3. Recombined 2 regimes into one program.
  4. Final simplification96.2%

    \[\leadsto \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.05999999865889549:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)\\ \mathbf{else}:\\ \;\;\;\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\ \end{array} \]
  5. Add Preprocessing

Alternative 8: 80.2% accurate, 1.5× speedup?

\[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \leq 0.9999985098838806:\\ \;\;\;\;\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{-\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1\\ \end{array} \end{array} \]
(FPCore (cosTheta_i u1 u2)
 :precision binary32
 (if (<= (cos (* u2 (* 2.0 PI))) 0.9999985098838806)
   (* (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0) (sqrt (- (- u1))))
   (*
    (sqrt (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1))
    1.0)))
float code(float cosTheta_i, float u1, float u2) {
	float tmp;
	if (cosf((u2 * (2.0f * ((float) M_PI)))) <= 0.9999985098838806f) {
		tmp = fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f) * sqrtf(-(-u1));
	} else {
		tmp = sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1)) * 1.0f;
	}
	return tmp;
}
function code(cosTheta_i, u1, u2)
	tmp = Float32(0.0)
	if (cos(Float32(u2 * Float32(Float32(2.0) * Float32(pi)))) <= Float32(0.9999985098838806))
		tmp = Float32(fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)) * sqrt(Float32(-Float32(-u1))));
	else
		tmp = Float32(sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)) * Float32(1.0));
	end
	return tmp
end
\begin{array}{l}

\\
\begin{array}{l}
\mathbf{if}\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \leq 0.9999985098838806:\\
\;\;\;\;\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{-\left(-u1\right)}\\

\mathbf{else}:\\
\;\;\;\;\sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1\\


\end{array}
\end{array}
Derivation
  1. Split input into 2 regimes
  2. if (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)) < 0.99999851

    1. Initial program 54.4%

      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
    2. Add Preprocessing
    3. Taylor expanded in u2 around 0

      \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
    4. Step-by-step derivation
      1. Applied rewrites33.7%

        \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
      2. Taylor expanded in u1 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{-1 \cdot u1}\right)} \cdot 1 \]
      3. Step-by-step derivation
        1. mul-1-negN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot 1 \]
        2. lower-neg.f3245.5

          \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
      4. Applied rewrites45.5%

        \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
      5. Taylor expanded in u2 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
      6. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
        2. associate-*r*N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{\left(-2 \cdot {u2}^{2}\right) \cdot {\mathsf{PI}\left(\right)}^{2}} + 1\right) \]
        3. *-commutativeN/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
        4. lower-fma.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
        5. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
        6. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
        7. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
        8. lower-PI.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
        9. lower-*.f32N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
        10. unpow2N/A

          \[\leadsto \sqrt{\mathsf{neg}\left(\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
        11. lower-*.f3259.2

          \[\leadsto \sqrt{-\left(-u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
      7. Applied rewrites59.2%

        \[\leadsto \sqrt{-\left(-u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]

      if 0.99999851 < (cos.f32 (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2))

      1. Initial program 57.9%

        \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
      2. Add Preprocessing
      3. Taylor expanded in u2 around 0

        \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
      4. Step-by-step derivation
        1. Applied rewrites57.9%

          \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
        2. Taylor expanded in u1 around 0

          \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot 1 \]
        3. Step-by-step derivation
          1. +-commutativeN/A

            \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot 1 \]
          2. distribute-lft-inN/A

            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot 1 \]
          3. associate-*r*N/A

            \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot 1 \]
          4. unpow2N/A

            \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot 1 \]
          5. *-rgt-identityN/A

            \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot 1 \]
          6. lower-fma.f32N/A

            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot 1 \]
          7. unpow2N/A

            \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
          8. lower-*.f32N/A

            \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
          9. +-commutativeN/A

            \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot 1 \]
          10. lower-fma.f32N/A

            \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot 1 \]
          11. +-commutativeN/A

            \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
          12. *-commutativeN/A

            \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
          13. lower-fma.f3293.1

            \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot 1 \]
        4. Applied rewrites93.1%

          \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot 1 \]
      5. Recombined 2 regimes into one program.
      6. Final simplification78.9%

        \[\leadsto \begin{array}{l} \mathbf{if}\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \leq 0.9999985098838806:\\ \;\;\;\;\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{-\left(-u1\right)}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1\\ \end{array} \]
      7. Add Preprocessing

      Alternative 9: 94.8% accurate, 1.5× speedup?

      \[\begin{array}{l} \\ \begin{array}{l} t_0 := u2 \cdot \left(2 \cdot \pi\right)\\ \mathbf{if}\;t\_0 \leq 0.0008999999845400453:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\ \end{array} \end{array} \]
      (FPCore (cosTheta_i u1 u2)
       :precision binary32
       (let* ((t_0 (* u2 (* 2.0 PI))))
         (if (<= t_0 0.0008999999845400453)
           (* (sqrt (- (log1p (- u1)))) 1.0)
           (* (cos t_0) (sqrt (fma u1 (* u1 0.5) u1))))))
      float code(float cosTheta_i, float u1, float u2) {
      	float t_0 = u2 * (2.0f * ((float) M_PI));
      	float tmp;
      	if (t_0 <= 0.0008999999845400453f) {
      		tmp = sqrtf(-log1pf(-u1)) * 1.0f;
      	} else {
      		tmp = cosf(t_0) * sqrtf(fmaf(u1, (u1 * 0.5f), u1));
      	}
      	return tmp;
      }
      
      function code(cosTheta_i, u1, u2)
      	t_0 = Float32(u2 * Float32(Float32(2.0) * Float32(pi)))
      	tmp = Float32(0.0)
      	if (t_0 <= Float32(0.0008999999845400453))
      		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(1.0));
      	else
      		tmp = Float32(cos(t_0) * sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1)));
      	end
      	return tmp
      end
      
      \begin{array}{l}
      
      \\
      \begin{array}{l}
      t_0 := u2 \cdot \left(2 \cdot \pi\right)\\
      \mathbf{if}\;t\_0 \leq 0.0008999999845400453:\\
      \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\
      
      \mathbf{else}:\\
      \;\;\;\;\cos t\_0 \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\
      
      
      \end{array}
      \end{array}
      
      Derivation
      1. Split input into 2 regimes
      2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 8.99999985e-4

        1. Initial program 57.5%

          \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
        2. Add Preprocessing
        3. Step-by-step derivation
          1. lift-log.f32N/A

            \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          2. lift--.f32N/A

            \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          3. sub-negN/A

            \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          4. lower-log1p.f32N/A

            \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          5. lower-neg.f3299.5

            \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
        4. Applied rewrites99.5%

          \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
        5. Taylor expanded in u2 around 0

          \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
        6. Step-by-step derivation
          1. Applied rewrites99.2%

            \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

          if 8.99999985e-4 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

          1. Initial program 55.1%

            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Add Preprocessing
          3. Taylor expanded in u1 around 0

            \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
          4. Step-by-step derivation
            1. +-commutativeN/A

              \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            2. distribute-lft-inN/A

              \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            3. *-rgt-identityN/A

              \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. lower-fma.f32N/A

              \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. *-commutativeN/A

              \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            6. lower-*.f3289.9

              \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          5. Applied rewrites89.9%

            \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
        7. Recombined 2 regimes into one program.
        8. Final simplification95.0%

          \[\leadsto \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.0008999999845400453:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \cdot \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}\\ \end{array} \]
        9. Add Preprocessing

        Alternative 10: 90.5% accurate, 1.6× speedup?

        \[\begin{array}{l} \\ \begin{array}{l} t_0 := u2 \cdot \left(2 \cdot \pi\right)\\ \mathbf{if}\;t\_0 \leq 0.0017999999690800905:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\cos t\_0 \cdot \sqrt{u1}\\ \end{array} \end{array} \]
        (FPCore (cosTheta_i u1 u2)
         :precision binary32
         (let* ((t_0 (* u2 (* 2.0 PI))))
           (if (<= t_0 0.0017999999690800905)
             (* (sqrt (- (log1p (- u1)))) 1.0)
             (* (cos t_0) (sqrt u1)))))
        float code(float cosTheta_i, float u1, float u2) {
        	float t_0 = u2 * (2.0f * ((float) M_PI));
        	float tmp;
        	if (t_0 <= 0.0017999999690800905f) {
        		tmp = sqrtf(-log1pf(-u1)) * 1.0f;
        	} else {
        		tmp = cosf(t_0) * sqrtf(u1);
        	}
        	return tmp;
        }
        
        function code(cosTheta_i, u1, u2)
        	t_0 = Float32(u2 * Float32(Float32(2.0) * Float32(pi)))
        	tmp = Float32(0.0)
        	if (t_0 <= Float32(0.0017999999690800905))
        		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(1.0));
        	else
        		tmp = Float32(cos(t_0) * sqrt(u1));
        	end
        	return tmp
        end
        
        \begin{array}{l}
        
        \\
        \begin{array}{l}
        t_0 := u2 \cdot \left(2 \cdot \pi\right)\\
        \mathbf{if}\;t\_0 \leq 0.0017999999690800905:\\
        \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\
        
        \mathbf{else}:\\
        \;\;\;\;\cos t\_0 \cdot \sqrt{u1}\\
        
        
        \end{array}
        \end{array}
        
        Derivation
        1. Split input into 2 regimes
        2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 0.00179999997

          1. Initial program 57.9%

            \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          2. Add Preprocessing
          3. Step-by-step derivation
            1. lift-log.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            2. lift--.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            3. sub-negN/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. lower-log1p.f32N/A

              \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. lower-neg.f3299.5

              \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          4. Applied rewrites99.5%

            \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          5. Taylor expanded in u2 around 0

            \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
          6. Step-by-step derivation
            1. Applied rewrites98.8%

              \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

            if 0.00179999997 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

            1. Initial program 54.4%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Applied rewrites76.4%

              \[\leadsto \color{blue}{{\left({\left(\mathsf{log1p}\left(u1\right)\right)}^{0.25}\right)}^{2}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            4. Taylor expanded in u1 around 0

              \[\leadsto \color{blue}{\sqrt{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            5. Step-by-step derivation
              1. lower-sqrt.f3278.3

                \[\leadsto \color{blue}{\sqrt{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            6. Applied rewrites78.3%

              \[\leadsto \color{blue}{\sqrt{u1}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
          7. Recombined 2 regimes into one program.
          8. Final simplification90.2%

            \[\leadsto \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.0017999999690800905:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\cos \left(u2 \cdot \left(2 \cdot \pi\right)\right) \cdot \sqrt{u1}\\ \end{array} \]
          9. Add Preprocessing

          Alternative 11: 86.4% accurate, 1.7× speedup?

          \[\begin{array}{l} \\ \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.0007280000136233866:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \end{array} \end{array} \]
          (FPCore (cosTheta_i u1 u2)
           :precision binary32
           (if (<= (* u2 (* 2.0 PI)) 0.0007280000136233866)
             (* (sqrt (- (log1p (- u1)))) 1.0)
             (*
              (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0)
              (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)))))
          float code(float cosTheta_i, float u1, float u2) {
          	float tmp;
          	if ((u2 * (2.0f * ((float) M_PI))) <= 0.0007280000136233866f) {
          		tmp = sqrtf(-log1pf(-u1)) * 1.0f;
          	} else {
          		tmp = fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f) * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
          	}
          	return tmp;
          }
          
          function code(cosTheta_i, u1, u2)
          	tmp = Float32(0.0)
          	if (Float32(u2 * Float32(Float32(2.0) * Float32(pi))) <= Float32(0.0007280000136233866))
          		tmp = Float32(sqrt(Float32(-log1p(Float32(-u1)))) * Float32(1.0));
          	else
          		tmp = Float32(fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)) * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)));
          	end
          	return tmp
          end
          
          \begin{array}{l}
          
          \\
          \begin{array}{l}
          \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.0007280000136233866:\\
          \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\
          
          \mathbf{else}:\\
          \;\;\;\;\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\
          
          
          \end{array}
          \end{array}
          
          Derivation
          1. Split input into 2 regimes
          2. if (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2) < 7.28000014e-4

            1. Initial program 57.8%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Step-by-step derivation
              1. lift-log.f32N/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\log \left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              2. lift--.f32N/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 - u1\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              3. sub-negN/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\log \color{blue}{\left(1 + \left(\mathsf{neg}\left(u1\right)\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              4. lower-log1p.f32N/A

                \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              5. lower-neg.f3299.6

                \[\leadsto \sqrt{-\mathsf{log1p}\left(\color{blue}{-u1}\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            4. Applied rewrites99.6%

              \[\leadsto \sqrt{-\color{blue}{\mathsf{log1p}\left(-u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            5. Taylor expanded in u2 around 0

              \[\leadsto \sqrt{\mathsf{neg}\left(\mathsf{log1p}\left(\mathsf{neg}\left(u1\right)\right)\right)} \cdot \color{blue}{1} \]
            6. Step-by-step derivation
              1. Applied rewrites99.4%

                \[\leadsto \sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot \color{blue}{1} \]

              if 7.28000014e-4 < (*.f32 (*.f32 #s(literal 2 binary32) (PI.f32)) u2)

              1. Initial program 54.8%

                \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              2. Add Preprocessing
              3. Taylor expanded in u1 around 0

                \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              4. Step-by-step derivation
                1. +-commutativeN/A

                  \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                2. distribute-lft-inN/A

                  \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                3. associate-*r*N/A

                  \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                4. unpow2N/A

                  \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                5. *-rgt-identityN/A

                  \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                6. lower-fma.f32N/A

                  \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                7. unpow2N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                8. lower-*.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                9. +-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                10. *-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
                11. lower-fma.f3293.1

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              5. Applied rewrites93.1%

                \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              6. Taylor expanded in u2 around 0

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
              7. Step-by-step derivation
                1. +-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
                2. *-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
                3. associate-*r*N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
                4. *-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
                5. associate-*l*N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
                6. lower-fma.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
                7. unpow2N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
                8. lower-*.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
                9. lower-PI.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
                10. lower-PI.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
                11. lower-*.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
                12. unpow2N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
                13. lower-*.f3269.4

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
              8. Applied rewrites69.4%

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]
            7. Recombined 2 regimes into one program.
            8. Final simplification85.6%

              \[\leadsto \begin{array}{l} \mathbf{if}\;u2 \cdot \left(2 \cdot \pi\right) \leq 0.0007280000136233866:\\ \;\;\;\;\sqrt{-\mathsf{log1p}\left(-u1\right)} \cdot 1\\ \mathbf{else}:\\ \;\;\;\;\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}\\ \end{array} \]
            9. Add Preprocessing

            Alternative 12: 82.4% accurate, 4.3× speedup?

            \[\begin{array}{l} \\ \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \end{array} \]
            (FPCore (cosTheta_i u1 u2)
             :precision binary32
             (*
              (fma (* PI PI) (* -2.0 (* u2 u2)) 1.0)
              (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1))))
            float code(float cosTheta_i, float u1, float u2) {
            	return fmaf((((float) M_PI) * ((float) M_PI)), (-2.0f * (u2 * u2)), 1.0f) * sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1));
            }
            
            function code(cosTheta_i, u1, u2)
            	return Float32(fma(Float32(Float32(pi) * Float32(pi)), Float32(Float32(-2.0) * Float32(u2 * u2)), Float32(1.0)) * sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)))
            end
            
            \begin{array}{l}
            
            \\
            \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}
            \end{array}
            
            Derivation
            1. Initial program 56.4%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Taylor expanded in u1 around 0

              \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
            4. Step-by-step derivation
              1. +-commutativeN/A

                \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              2. distribute-lft-inN/A

                \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              3. associate-*r*N/A

                \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              4. unpow2N/A

                \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              5. *-rgt-identityN/A

                \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              6. lower-fma.f32N/A

                \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              7. unpow2N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              8. lower-*.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              9. +-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              10. *-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot \cos \left(\left(2 \cdot \mathsf{PI}\left(\right)\right) \cdot u2\right) \]
              11. lower-fma.f3292.5

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            5. Applied rewrites92.5%

              \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            6. Taylor expanded in u2 around 0

              \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(1 + -2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right)\right)} \]
            7. Step-by-step derivation
              1. +-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\left(-2 \cdot \left({u2}^{2} \cdot {\mathsf{PI}\left(\right)}^{2}\right) + 1\right)} \]
              2. *-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(-2 \cdot \color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot {u2}^{2}\right)} + 1\right) \]
              3. associate-*r*N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(\color{blue}{\left(-2 \cdot {\mathsf{PI}\left(\right)}^{2}\right) \cdot {u2}^{2}} + 1\right) \]
              4. *-commutativeN/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(\color{blue}{\left({\mathsf{PI}\left(\right)}^{2} \cdot -2\right)} \cdot {u2}^{2} + 1\right) \]
              5. associate-*l*N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \left(\color{blue}{{\mathsf{PI}\left(\right)}^{2} \cdot \left(-2 \cdot {u2}^{2}\right)} + 1\right) \]
              6. lower-fma.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left({\mathsf{PI}\left(\right)}^{2}, -2 \cdot {u2}^{2}, 1\right)} \]
              7. unpow2N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
              8. lower-*.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
              9. lower-PI.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\color{blue}{\mathsf{PI}\left(\right)} \cdot \mathsf{PI}\left(\right), -2 \cdot {u2}^{2}, 1\right) \]
              10. lower-PI.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \color{blue}{\mathsf{PI}\left(\right)}, -2 \cdot {u2}^{2}, 1\right) \]
              11. lower-*.f32N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), \color{blue}{-2 \cdot {u2}^{2}}, 1\right) \]
              12. unpow2N/A

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot \mathsf{fma}\left(\mathsf{PI}\left(\right) \cdot \mathsf{PI}\left(\right), -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
              13. lower-*.f3281.5

                \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \color{blue}{\left(u2 \cdot u2\right)}, 1\right) \]
            8. Applied rewrites81.5%

              \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot \color{blue}{\mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right)} \]
            9. Final simplification81.5%

              \[\leadsto \mathsf{fma}\left(\pi \cdot \pi, -2 \cdot \left(u2 \cdot u2\right), 1\right) \cdot \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \]
            10. Add Preprocessing

            Alternative 13: 76.4% accurate, 5.9× speedup?

            \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1 \end{array} \]
            (FPCore (cosTheta_i u1 u2)
             :precision binary32
             (*
              (sqrt (fma (* u1 u1) (fma u1 (fma u1 0.25 0.3333333333333333) 0.5) u1))
              1.0))
            float code(float cosTheta_i, float u1, float u2) {
            	return sqrtf(fmaf((u1 * u1), fmaf(u1, fmaf(u1, 0.25f, 0.3333333333333333f), 0.5f), u1)) * 1.0f;
            }
            
            function code(cosTheta_i, u1, u2)
            	return Float32(sqrt(fma(Float32(u1 * u1), fma(u1, fma(u1, Float32(0.25), Float32(0.3333333333333333)), Float32(0.5)), u1)) * Float32(1.0))
            end
            
            \begin{array}{l}
            
            \\
            \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)} \cdot 1
            \end{array}
            
            Derivation
            1. Initial program 56.4%

              \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
            2. Add Preprocessing
            3. Taylor expanded in u2 around 0

              \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
            4. Step-by-step derivation
              1. Applied rewrites47.8%

                \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
              2. Taylor expanded in u1 around 0

                \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right)}} \cdot 1 \]
              3. Step-by-step derivation
                1. +-commutativeN/A

                  \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + 1\right)}} \cdot 1 \]
                2. distribute-lft-inN/A

                  \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)\right) + u1 \cdot 1}} \cdot 1 \]
                3. associate-*r*N/A

                  \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right)} + u1 \cdot 1} \cdot 1 \]
                4. unpow2N/A

                  \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + u1 \cdot 1} \cdot 1 \]
                5. *-rgt-identityN/A

                  \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right)\right) + \color{blue}{u1}} \cdot 1 \]
                6. lower-fma.f32N/A

                  \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)}} \cdot 1 \]
                7. unpow2N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
                8. lower-*.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right), u1\right)} \cdot 1 \]
                9. +-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \left(\frac{1}{3} + \frac{1}{4} \cdot u1\right) + \frac{1}{2}}, u1\right)} \cdot 1 \]
                10. lower-fma.f32N/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, \frac{1}{3} + \frac{1}{4} \cdot u1, \frac{1}{2}\right)}, u1\right)} \cdot 1 \]
                11. +-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\frac{1}{4} \cdot u1 + \frac{1}{3}}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
                12. *-commutativeN/A

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{4}} + \frac{1}{3}, \frac{1}{2}\right), u1\right)} \cdot 1 \]
                13. lower-fma.f3273.7

                  \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \color{blue}{\mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right)}, 0.5\right), u1\right)} \cdot 1 \]
              4. Applied rewrites73.7%

                \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, \mathsf{fma}\left(u1, 0.25, 0.3333333333333333\right), 0.5\right), u1\right)}} \cdot 1 \]
              5. Add Preprocessing

              Alternative 14: 75.1% accurate, 7.0× speedup?

              \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot 1 \end{array} \]
              (FPCore (cosTheta_i u1 u2)
               :precision binary32
               (* (sqrt (fma (* u1 u1) (fma u1 0.3333333333333333 0.5) u1)) 1.0))
              float code(float cosTheta_i, float u1, float u2) {
              	return sqrtf(fmaf((u1 * u1), fmaf(u1, 0.3333333333333333f, 0.5f), u1)) * 1.0f;
              }
              
              function code(cosTheta_i, u1, u2)
              	return Float32(sqrt(fma(Float32(u1 * u1), fma(u1, Float32(0.3333333333333333), Float32(0.5)), u1)) * Float32(1.0))
              end
              
              \begin{array}{l}
              
              \\
              \sqrt{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)} \cdot 1
              \end{array}
              
              Derivation
              1. Initial program 56.4%

                \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
              2. Add Preprocessing
              3. Taylor expanded in u2 around 0

                \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
              4. Step-by-step derivation
                1. Applied rewrites47.8%

                  \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                2. Taylor expanded in u1 around 0

                  \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right)}} \cdot 1 \]
                3. Step-by-step derivation
                  1. +-commutativeN/A

                    \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + 1\right)}} \cdot 1 \]
                  2. distribute-lft-inN/A

                    \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(u1 \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)\right) + u1 \cdot 1}} \cdot 1 \]
                  3. associate-*r*N/A

                    \[\leadsto \sqrt{\color{blue}{\left(u1 \cdot u1\right) \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right)} + u1 \cdot 1} \cdot 1 \]
                  4. unpow2N/A

                    \[\leadsto \sqrt{\color{blue}{{u1}^{2}} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + u1 \cdot 1} \cdot 1 \]
                  5. *-rgt-identityN/A

                    \[\leadsto \sqrt{{u1}^{2} \cdot \left(\frac{1}{2} + \frac{1}{3} \cdot u1\right) + \color{blue}{u1}} \cdot 1 \]
                  6. lower-fma.f32N/A

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left({u1}^{2}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)}} \cdot 1 \]
                  7. unpow2N/A

                    \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot 1 \]
                  8. lower-*.f32N/A

                    \[\leadsto \sqrt{\mathsf{fma}\left(\color{blue}{u1 \cdot u1}, \frac{1}{2} + \frac{1}{3} \cdot u1, u1\right)} \cdot 1 \]
                  9. +-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\frac{1}{3} \cdot u1 + \frac{1}{2}}, u1\right)} \cdot 1 \]
                  10. *-commutativeN/A

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{u1 \cdot \frac{1}{3}} + \frac{1}{2}, u1\right)} \cdot 1 \]
                  11. lower-fma.f3273.0

                    \[\leadsto \sqrt{\mathsf{fma}\left(u1 \cdot u1, \color{blue}{\mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right)}, u1\right)} \cdot 1 \]
                4. Applied rewrites73.0%

                  \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1 \cdot u1, \mathsf{fma}\left(u1, 0.3333333333333333, 0.5\right), u1\right)}} \cdot 1 \]
                5. Add Preprocessing

                Alternative 15: 72.6% accurate, 8.6× speedup?

                \[\begin{array}{l} \\ \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)} \cdot 1 \end{array} \]
                (FPCore (cosTheta_i u1 u2)
                 :precision binary32
                 (* (sqrt (fma u1 (* u1 0.5) u1)) 1.0))
                float code(float cosTheta_i, float u1, float u2) {
                	return sqrtf(fmaf(u1, (u1 * 0.5f), u1)) * 1.0f;
                }
                
                function code(cosTheta_i, u1, u2)
                	return Float32(sqrt(fma(u1, Float32(u1 * Float32(0.5)), u1)) * Float32(1.0))
                end
                
                \begin{array}{l}
                
                \\
                \sqrt{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)} \cdot 1
                \end{array}
                
                Derivation
                1. Initial program 56.4%

                  \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                2. Add Preprocessing
                3. Taylor expanded in u2 around 0

                  \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                4. Step-by-step derivation
                  1. Applied rewrites47.8%

                    \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                  2. Taylor expanded in u1 around 0

                    \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(1 + \frac{1}{2} \cdot u1\right)}} \cdot 1 \]
                  3. Step-by-step derivation
                    1. +-commutativeN/A

                      \[\leadsto \sqrt{u1 \cdot \color{blue}{\left(\frac{1}{2} \cdot u1 + 1\right)}} \cdot 1 \]
                    2. distribute-lft-inN/A

                      \[\leadsto \sqrt{\color{blue}{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + u1 \cdot 1}} \cdot 1 \]
                    3. *-rgt-identityN/A

                      \[\leadsto \sqrt{u1 \cdot \left(\frac{1}{2} \cdot u1\right) + \color{blue}{u1}} \cdot 1 \]
                    4. lower-fma.f32N/A

                      \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, \frac{1}{2} \cdot u1, u1\right)}} \cdot 1 \]
                    5. *-commutativeN/A

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot \frac{1}{2}}, u1\right)} \cdot 1 \]
                    6. lower-*.f3271.1

                      \[\leadsto \sqrt{\mathsf{fma}\left(u1, \color{blue}{u1 \cdot 0.5}, u1\right)} \cdot 1 \]
                  4. Applied rewrites71.1%

                    \[\leadsto \sqrt{\color{blue}{\mathsf{fma}\left(u1, u1 \cdot 0.5, u1\right)}} \cdot 1 \]
                  5. Add Preprocessing

                  Alternative 16: 64.8% accurate, 11.6× speedup?

                  \[\begin{array}{l} \\ 1 \cdot \sqrt{-\left(-u1\right)} \end{array} \]
                  (FPCore (cosTheta_i u1 u2) :precision binary32 (* 1.0 (sqrt (- (- u1)))))
                  float code(float cosTheta_i, float u1, float u2) {
                  	return 1.0f * sqrtf(-(-u1));
                  }
                  
                  real(4) function code(costheta_i, u1, u2)
                      real(4), intent (in) :: costheta_i
                      real(4), intent (in) :: u1
                      real(4), intent (in) :: u2
                      code = 1.0e0 * sqrt(-(-u1))
                  end function
                  
                  function code(cosTheta_i, u1, u2)
                  	return Float32(Float32(1.0) * sqrt(Float32(-Float32(-u1))))
                  end
                  
                  function tmp = code(cosTheta_i, u1, u2)
                  	tmp = single(1.0) * sqrt(-(-u1));
                  end
                  
                  \begin{array}{l}
                  
                  \\
                  1 \cdot \sqrt{-\left(-u1\right)}
                  \end{array}
                  
                  Derivation
                  1. Initial program 56.4%

                    \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                  2. Add Preprocessing
                  3. Taylor expanded in u2 around 0

                    \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                  4. Step-by-step derivation
                    1. Applied rewrites47.8%

                      \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                    2. Taylor expanded in u1 around 0

                      \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{-1 \cdot u1}\right)} \cdot 1 \]
                    3. Step-by-step derivation
                      1. mul-1-negN/A

                        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot 1 \]
                      2. lower-neg.f3263.8

                        \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                    4. Applied rewrites63.8%

                      \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                    5. Final simplification63.8%

                      \[\leadsto 1 \cdot \sqrt{-\left(-u1\right)} \]
                    6. Add Preprocessing

                    Alternative 17: 4.9% accurate, 12.8× speedup?

                    \[\begin{array}{l} \\ 1 \cdot \left(-\sqrt{u1}\right) \end{array} \]
                    (FPCore (cosTheta_i u1 u2) :precision binary32 (* 1.0 (- (sqrt u1))))
                    float code(float cosTheta_i, float u1, float u2) {
                    	return 1.0f * -sqrtf(u1);
                    }
                    
                    real(4) function code(costheta_i, u1, u2)
                        real(4), intent (in) :: costheta_i
                        real(4), intent (in) :: u1
                        real(4), intent (in) :: u2
                        code = 1.0e0 * -sqrt(u1)
                    end function
                    
                    function code(cosTheta_i, u1, u2)
                    	return Float32(Float32(1.0) * Float32(-sqrt(u1)))
                    end
                    
                    function tmp = code(cosTheta_i, u1, u2)
                    	tmp = single(1.0) * -sqrt(u1);
                    end
                    
                    \begin{array}{l}
                    
                    \\
                    1 \cdot \left(-\sqrt{u1}\right)
                    \end{array}
                    
                    Derivation
                    1. Initial program 56.4%

                      \[\sqrt{-\log \left(1 - u1\right)} \cdot \cos \left(\left(2 \cdot \pi\right) \cdot u2\right) \]
                    2. Add Preprocessing
                    3. Taylor expanded in u2 around 0

                      \[\leadsto \sqrt{\mathsf{neg}\left(\log \left(1 - u1\right)\right)} \cdot \color{blue}{1} \]
                    4. Step-by-step derivation
                      1. Applied rewrites47.8%

                        \[\leadsto \sqrt{-\log \left(1 - u1\right)} \cdot \color{blue}{1} \]
                      2. Taylor expanded in u1 around 0

                        \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{-1 \cdot u1}\right)} \cdot 1 \]
                      3. Step-by-step derivation
                        1. mul-1-negN/A

                          \[\leadsto \sqrt{\mathsf{neg}\left(\color{blue}{\left(\mathsf{neg}\left(u1\right)\right)}\right)} \cdot 1 \]
                        2. lower-neg.f3263.8

                          \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                      4. Applied rewrites63.8%

                        \[\leadsto \sqrt{-\color{blue}{\left(-u1\right)}} \cdot 1 \]
                      5. Taylor expanded in u1 around 0

                        \[\leadsto \color{blue}{\left(\sqrt{u1} \cdot {\left(\sqrt{-1}\right)}^{2}\right)} \cdot 1 \]
                      6. Step-by-step derivation
                        1. *-commutativeN/A

                          \[\leadsto \color{blue}{\left({\left(\sqrt{-1}\right)}^{2} \cdot \sqrt{u1}\right)} \cdot 1 \]
                        2. unpow2N/A

                          \[\leadsto \left(\color{blue}{\left(\sqrt{-1} \cdot \sqrt{-1}\right)} \cdot \sqrt{u1}\right) \cdot 1 \]
                        3. rem-square-sqrtN/A

                          \[\leadsto \left(\color{blue}{-1} \cdot \sqrt{u1}\right) \cdot 1 \]
                        4. mul-1-negN/A

                          \[\leadsto \color{blue}{\left(\mathsf{neg}\left(\sqrt{u1}\right)\right)} \cdot 1 \]
                        5. lower-neg.f32N/A

                          \[\leadsto \color{blue}{\left(\mathsf{neg}\left(\sqrt{u1}\right)\right)} \cdot 1 \]
                        6. lower-sqrt.f324.7

                          \[\leadsto \left(-\color{blue}{\sqrt{u1}}\right) \cdot 1 \]
                      7. Applied rewrites4.7%

                        \[\leadsto \color{blue}{\left(-\sqrt{u1}\right)} \cdot 1 \]
                      8. Final simplification4.7%

                        \[\leadsto 1 \cdot \left(-\sqrt{u1}\right) \]
                      9. Add Preprocessing

                      Reproduce

                      ?
                      herbie shell --seed 2024232 
                      (FPCore (cosTheta_i u1 u2)
                        :name "Beckmann Sample, near normal, slope_x"
                        :precision binary32
                        :pre (and (and (and (> cosTheta_i 0.9999) (<= cosTheta_i 1.0)) (and (<= 2.328306437e-10 u1) (<= u1 1.0))) (and (<= 2.328306437e-10 u2) (<= u2 1.0)))
                        (* (sqrt (- (log (- 1.0 u1)))) (cos (* (* 2.0 PI) u2))))