| Alternative 1 | |
|---|---|
| Accuracy | 99.6% |
| Cost | 19776 |
\[\frac{\mathsf{fma}\left(a2, a2, a1 \cdot a1\right)}{\frac{\sqrt{2}}{\cos th}}
\]

(FPCore (a1 a2 th) :precision binary64 (+ (* (/ (cos th) (sqrt 2.0)) (* a1 a1)) (* (/ (cos th) (sqrt 2.0)) (* a2 a2))))
(FPCore (a1 a2 th) :precision binary64 (/ (fma a2 a2 (* a1 a1)) (/ (sqrt 2.0) (cos th))))
double code(double a1, double a2, double th) {
return ((cos(th) / sqrt(2.0)) * (a1 * a1)) + ((cos(th) / sqrt(2.0)) * (a2 * a2));
}
double code(double a1, double a2, double th) {
return fma(a2, a2, (a1 * a1)) / (sqrt(2.0) / cos(th));
}
function code(a1, a2, th) return Float64(Float64(Float64(cos(th) / sqrt(2.0)) * Float64(a1 * a1)) + Float64(Float64(cos(th) / sqrt(2.0)) * Float64(a2 * a2))) end
function code(a1, a2, th) return Float64(fma(a2, a2, Float64(a1 * a1)) / Float64(sqrt(2.0) / cos(th))) end
code[a1_, a2_, th_] := N[(N[(N[(N[Cos[th], $MachinePrecision] / N[Sqrt[2.0], $MachinePrecision]), $MachinePrecision] * N[(a1 * a1), $MachinePrecision]), $MachinePrecision] + N[(N[(N[Cos[th], $MachinePrecision] / N[Sqrt[2.0], $MachinePrecision]), $MachinePrecision] * N[(a2 * a2), $MachinePrecision]), $MachinePrecision]), $MachinePrecision]
code[a1_, a2_, th_] := N[(N[(a2 * a2 + N[(a1 * a1), $MachinePrecision]), $MachinePrecision] / N[(N[Sqrt[2.0], $MachinePrecision] / N[Cos[th], $MachinePrecision]), $MachinePrecision]), $MachinePrecision]
\frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1\right) + \frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2\right)
\frac{\mathsf{fma}\left(a2, a2, a1 \cdot a1\right)}{\frac{\sqrt{2}}{\cos th}}
Herbie found 13 alternatives:
| Alternative | Accuracy | Speedup |
|---|
Initial program 99.6%
Simplified99.7%
[Start]99.6% | \[ \frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1\right) + \frac{\cos th}{\sqrt{2}} \cdot \left(a2 \cdot a2\right)
\] |
|---|---|
distribute-lft-out [=>]99.6% | \[ \color{blue}{\frac{\cos th}{\sqrt{2}} \cdot \left(a1 \cdot a1 + a2 \cdot a2\right)}
\] |
associate-*l/ [=>]99.7% | \[ \color{blue}{\frac{\cos th \cdot \left(a1 \cdot a1 + a2 \cdot a2\right)}{\sqrt{2}}}
\] |
associate-*r/ [<=]99.6% | \[ \color{blue}{\cos th \cdot \frac{a1 \cdot a1 + a2 \cdot a2}{\sqrt{2}}}
\] |
fma-def [=>]99.7% | \[ \cos th \cdot \frac{\color{blue}{\mathsf{fma}\left(a1, a1, a2 \cdot a2\right)}}{\sqrt{2}}
\] |
Taylor expanded in th around inf 99.7%
Simplified99.7%
[Start]99.7% | \[ \frac{\left({a2}^{2} + {a1}^{2}\right) \cdot \cos th}{\sqrt{2}}
\] |
|---|---|
associate-/l* [=>]99.7% | \[ \color{blue}{\frac{{a2}^{2} + {a1}^{2}}{\frac{\sqrt{2}}{\cos th}}}
\] |
unpow2 [=>]99.7% | \[ \frac{\color{blue}{a2 \cdot a2} + {a1}^{2}}{\frac{\sqrt{2}}{\cos th}}
\] |
unpow2 [=>]99.7% | \[ \frac{a2 \cdot a2 + \color{blue}{a1 \cdot a1}}{\frac{\sqrt{2}}{\cos th}}
\] |
fma-def [=>]99.7% | \[ \frac{\color{blue}{\mathsf{fma}\left(a2, a2, a1 \cdot a1\right)}}{\frac{\sqrt{2}}{\cos th}}
\] |
Final simplification99.7%
| Alternative 1 | |
|---|---|
| Accuracy | 99.6% |
| Cost | 19776 |
| Alternative 2 | |
|---|---|
| Accuracy | 78.7% |
| Cost | 19780 |
| Alternative 3 | |
|---|---|
| Accuracy | 99.6% |
| Cost | 19776 |
| Alternative 4 | |
|---|---|
| Accuracy | 99.6% |
| Cost | 13504 |
| Alternative 5 | |
|---|---|
| Accuracy | 87.8% |
| Cost | 13380 |
| Alternative 6 | |
|---|---|
| Accuracy | 87.7% |
| Cost | 13380 |
| Alternative 7 | |
|---|---|
| Accuracy | 87.7% |
| Cost | 13380 |
| Alternative 8 | |
|---|---|
| Accuracy | 65.8% |
| Cost | 6976 |
| Alternative 9 | |
|---|---|
| Accuracy | 65.8% |
| Cost | 6976 |
| Alternative 10 | |
|---|---|
| Accuracy | 58.7% |
| Cost | 6852 |
| Alternative 11 | |
|---|---|
| Accuracy | 58.7% |
| Cost | 6852 |
| Alternative 12 | |
|---|---|
| Accuracy | 58.7% |
| Cost | 6852 |
| Alternative 13 | |
|---|---|
| Accuracy | 40.0% |
| Cost | 6720 |
herbie shell --seed 2023165
(FPCore (a1 a2 th)
:name "Migdal et al, Equation (64)"
:precision binary64
(+ (* (/ (cos th) (sqrt 2.0)) (* a1 a1)) (* (/ (cos th) (sqrt 2.0)) (* a2 a2))))