Migdal et al, Equation (64)

?

Percentage Accurate: 99.5% → 99.6%
Time: 16.8s
Precision: binary64
Cost: 19776

?

\[ \begin{array}{c}[a1, a2] = \mathsf{sort}([a1, a2])\\ \end{array} \]
\[\frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1\right) + \frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2\right) \]
\[\cos th \cdot \frac{\mathsf{fma}\left(a1, a1, a2 \cdot a2\right)}{\sqrt{2}} \]
(FPCore (a1 a2 th)
 :precision binary64
 (+
  (* (/ (cos th) (sqrt 2.0)) (* a1 a1))
  (* (/ (cos th) (sqrt 2.0)) (* a2 a2))))
(FPCore (a1 a2 th)
 :precision binary64
 (* (cos th) (/ (fma a1 a1 (* a2 a2)) (sqrt 2.0))))
double code(double a1, double a2, double th) {
	return ((cos(th) / sqrt(2.0)) * (a1 * a1)) + ((cos(th) / sqrt(2.0)) * (a2 * a2));
}
double code(double a1, double a2, double th) {
	return cos(th) * (fma(a1, a1, (a2 * a2)) / sqrt(2.0));
}
function code(a1, a2, th)
	return Float64(Float64(Float64(cos(th) / sqrt(2.0)) * Float64(a1 * a1)) + Float64(Float64(cos(th) / sqrt(2.0)) * Float64(a2 * a2)))
end
function code(a1, a2, th)
	return Float64(cos(th) * Float64(fma(a1, a1, Float64(a2 * a2)) / sqrt(2.0)))
end
code[a1_, a2_, th_] := N[(N[(N[(N[Cos[th], $MachinePrecision] / N[Sqrt[2.0], $MachinePrecision]), $MachinePrecision] * N[(a1 * a1), $MachinePrecision]), $MachinePrecision] + N[(N[(N[Cos[th], $MachinePrecision] / N[Sqrt[2.0], $MachinePrecision]), $MachinePrecision] * N[(a2 * a2), $MachinePrecision]), $MachinePrecision]), $MachinePrecision]
code[a1_, a2_, th_] := N[(N[Cos[th], $MachinePrecision] * N[(N[(a1 * a1 + N[(a2 * a2), $MachinePrecision]), $MachinePrecision] / N[Sqrt[2.0], $MachinePrecision]), $MachinePrecision]), $MachinePrecision]
\frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1\right) + \frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2\right)
\cos th \cdot \frac{\mathsf{fma}\left(a1, a1, a2 \cdot a2\right)}{\sqrt{2}}

Local Percentage Accuracy?

The average percentage accuracy by input value. Horizontal axis shows value of an input variable; the variable is choosen in the title. Vertical axis is accuracy; higher is better. Red represent the original program, while blue represents Herbie's suggestion. These can be toggled with buttons below the plot. The line is an average while dots represent individual samples.

Herbie found 17 alternatives:

AlternativeAccuracySpeedup

Accuracy vs Speed

The accuracy (vertical axis) and speed (horizontal axis) of each of Herbie's proposed alternatives. Up and to the right is better. Each dot represents an alternative program; the red square represents the initial program.

Derivation?

  1. Initial program 99.6%

    \[\frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1\right) + \frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2\right) \]
  2. Simplified99.7%

    \[\leadsto \color{blue}{\cos th \cdot \frac{\mathsf{fma}\left(a1, a1, a2 \cdot a2\right)}{\sqrt{2}}} \]
    Step-by-step derivation

    [Start]99.6

    \[ \frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1\right) + \frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2\right) \]

    distribute-lft-out [=>]99.6

    \[ \color{blue}{\frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1 + a2 \cdot a2\right)} \]

    associate-*l/ [=>]99.7

    \[ \color{blue}{\frac{\cos th \cdot \left(a1 \cdot a1 + a2 \cdot a2\right)}{\sqrt{2}}} \]

    associate-*r/ [<=]99.7

    \[ \color{blue}{\cos th \cdot \frac{a1 \cdot a1 + a2 \cdot a2}{\sqrt{2}}} \]

    fma-def [=>]99.7

    \[ \cos th \cdot \frac{\color{blue}{\mathsf{fma}\left(a1, a1, a2 \cdot a2\right)}}{\sqrt{2}} \]
  3. Final simplification99.7%

    \[\leadsto \cos th \cdot \frac{\mathsf{fma}\left(a1, a1, a2 \cdot a2\right)}{\sqrt{2}} \]

Alternatives

Alternative 1
Accuracy79.1%
Cost19780
\[\begin{array}{l} \mathbf{if}\;\cos th \leq 0.6:\\ \;\;\;\;\cos th \cdot \left(\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\right)\\ \mathbf{else}:\\ \;\;\;\;\frac{a2 \cdot a2 + a1 \cdot a1}{\sqrt{2}}\\ \end{array} \]
Alternative 2
Accuracy87.6%
Cost13512
\[\begin{array}{l} \mathbf{if}\;a2 \leq 1.95 \cdot 10^{-154}:\\ \;\;\;\;\cos th \cdot \left(\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\right)\\ \mathbf{elif}\;a2 \leq 1.2 \cdot 10^{-81}:\\ \;\;\;\;\frac{a2 \cdot a2 + a1 \cdot a1}{\sqrt{2}}\\ \mathbf{else}:\\ \;\;\;\;\cos th \cdot \frac{a2}{\frac{\sqrt{2}}{a2}}\\ \end{array} \]
Alternative 3
Accuracy87.6%
Cost13512
\[\begin{array}{l} \mathbf{if}\;a2 \leq 3.7 \cdot 10^{-153}:\\ \;\;\;\;\cos th \cdot \left(\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\right)\\ \mathbf{elif}\;a2 \leq 1.2 \cdot 10^{-81}:\\ \;\;\;\;\frac{a2 \cdot a2 + a1 \cdot a1}{\sqrt{2}}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{0.5} \cdot \left(a2 \cdot \left(\cos th \cdot a2\right)\right)\\ \end{array} \]
Alternative 4
Accuracy87.6%
Cost13512
\[\begin{array}{l} \mathbf{if}\;a2 \leq 1.5 \cdot 10^{-152}:\\ \;\;\;\;\cos th \cdot \frac{a1 \cdot a1}{\sqrt{2}}\\ \mathbf{elif}\;a2 \leq 1.2 \cdot 10^{-81}:\\ \;\;\;\;\frac{a2 \cdot a2 + a1 \cdot a1}{\sqrt{2}}\\ \mathbf{else}:\\ \;\;\;\;\sqrt{0.5} \cdot \left(a2 \cdot \left(\cos th \cdot a2\right)\right)\\ \end{array} \]
Alternative 5
Accuracy99.5%
Cost13504
\[\frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2 + a1 \cdot a1\right) \]
Alternative 6
Accuracy66.1%
Cost7241
\[\begin{array}{l} \mathbf{if}\;th \leq -6.8 \cdot 10^{+104} \lor \neg \left(th \leq -1.32 \cdot 10^{+51}\right):\\ \;\;\;\;\left(a2 \cdot a2 + a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{else}:\\ \;\;\;\;a1 \cdot \sqrt{\frac{a1 \cdot a1}{2}}\\ \end{array} \]
Alternative 7
Accuracy66.1%
Cost7240
\[\begin{array}{l} t_1 := a2 \cdot a2 + a1 \cdot a1\\ \mathbf{if}\;th \leq -6.8 \cdot 10^{+104}:\\ \;\;\;\;t_1 \cdot \sqrt{0.5}\\ \mathbf{elif}\;th \leq -1.32 \cdot 10^{+51}:\\ \;\;\;\;a1 \cdot \sqrt{\frac{a1 \cdot a1}{2}}\\ \mathbf{else}:\\ \;\;\;\;\frac{t_1}{\sqrt{2}}\\ \end{array} \]
Alternative 8
Accuracy58.6%
Cost7117
\[\begin{array}{l} \mathbf{if}\;a2 \leq 9.5 \cdot 10^{-78} \lor \neg \left(a2 \leq 1.26 \cdot 10^{-37}\right) \land a2 \leq 1.1 \cdot 10^{-8}:\\ \;\;\;\;\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{else}:\\ \;\;\;\;a2 \cdot \left(a2 \cdot \sqrt{0.5}\right)\\ \end{array} \]
Alternative 9
Accuracy58.6%
Cost7116
\[\begin{array}{l} t_1 := \left(a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{if}\;a2 \leq 5 \cdot 10^{-76}:\\ \;\;\;\;t_1\\ \mathbf{elif}\;a2 \leq 9.5 \cdot 10^{-37}:\\ \;\;\;\;a2 \cdot \left(a2 \cdot \sqrt{0.5}\right)\\ \mathbf{elif}\;a2 \leq 1.15 \cdot 10^{-9}:\\ \;\;\;\;t_1\\ \mathbf{else}:\\ \;\;\;\;\left(a2 \cdot a2\right) \cdot \sqrt{0.5}\\ \end{array} \]
Alternative 10
Accuracy58.7%
Cost7116
\[\begin{array}{l} \mathbf{if}\;a2 \leq 4.3 \cdot 10^{-76}:\\ \;\;\;\;a1 \cdot \frac{a1}{\sqrt{2}}\\ \mathbf{elif}\;a2 \leq 6.4 \cdot 10^{-37}:\\ \;\;\;\;a2 \cdot \left(a2 \cdot \sqrt{0.5}\right)\\ \mathbf{elif}\;a2 \leq 8.8 \cdot 10^{-10}:\\ \;\;\;\;\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{else}:\\ \;\;\;\;\left(a2 \cdot a2\right) \cdot \sqrt{0.5}\\ \end{array} \]
Alternative 11
Accuracy58.6%
Cost7116
\[\begin{array}{l} \mathbf{if}\;a2 \leq 1.3 \cdot 10^{-79}:\\ \;\;\;\;a1 \cdot \frac{a1}{\sqrt{2}}\\ \mathbf{elif}\;a2 \leq 1.78 \cdot 10^{-37}:\\ \;\;\;\;a2 \cdot \left(a2 \cdot \sqrt{0.5}\right)\\ \mathbf{elif}\;a2 \leq 2.4 \cdot 10^{-10}:\\ \;\;\;\;\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{else}:\\ \;\;\;\;\frac{a2}{\frac{\sqrt{2}}{a2}}\\ \end{array} \]
Alternative 12
Accuracy58.6%
Cost7116
\[\begin{array}{l} \mathbf{if}\;a2 \leq 1.12 \cdot 10^{-76}:\\ \;\;\;\;\frac{a1 \cdot a1}{\sqrt{2}}\\ \mathbf{elif}\;a2 \leq 2 \cdot 10^{-37}:\\ \;\;\;\;a2 \cdot \left(a2 \cdot \sqrt{0.5}\right)\\ \mathbf{elif}\;a2 \leq 5.3 \cdot 10^{-8}:\\ \;\;\;\;\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{else}:\\ \;\;\;\;\frac{a2}{\frac{\sqrt{2}}{a2}}\\ \end{array} \]
Alternative 13
Accuracy58.6%
Cost7116
\[\begin{array}{l} \mathbf{if}\;a2 \leq 8 \cdot 10^{-79}:\\ \;\;\;\;\frac{a1 \cdot a1}{\sqrt{2}}\\ \mathbf{elif}\;a2 \leq 1.6 \cdot 10^{-37}:\\ \;\;\;\;\frac{a2 \cdot a2}{\sqrt{2}}\\ \mathbf{elif}\;a2 \leq 1.05 \cdot 10^{-7}:\\ \;\;\;\;\left(a1 \cdot a1\right) \cdot \sqrt{0.5}\\ \mathbf{else}:\\ \;\;\;\;\frac{a2}{\frac{\sqrt{2}}{a2}}\\ \end{array} \]
Alternative 14
Accuracy53.8%
Cost6852
\[\begin{array}{l} \mathbf{if}\;a1 \leq -9.5 \cdot 10^{+133}:\\ \;\;\;\;a1 \cdot a1\\ \mathbf{else}:\\ \;\;\;\;a2 \cdot \left(a2 \cdot \sqrt{0.5}\right)\\ \end{array} \]
Alternative 15
Accuracy30.0%
Cost320
\[a1 \cdot \left(a1 \cdot 0.5\right) \]
Alternative 16
Accuracy29.9%
Cost192
\[a1 \cdot a1 \]

Reproduce?

herbie shell --seed 2023160 
(FPCore (a1 a2 th)
  :name "Migdal et al, Equation (64)"
  :precision binary64
  (+ (* (/ (cos th) (sqrt 2.0)) (* a1 a1)) (* (/ (cos th) (sqrt 2.0)) (* a2 a2))))