| Alternative 1 | |
|---|---|
| Accuracy | 99.9% |
| Cost | 3360 |
\[\mathsf{fma}\left(ux, maxCos + -1, 1\right)
\]

(FPCore (ux uy maxCos) :precision binary32 (+ (- 1.0 ux) (* ux maxCos)))
(FPCore (ux uy maxCos) :precision binary32 (fma ux (+ maxCos -1.0) 1.0))
float code(float ux, float uy, float maxCos) {
return (1.0f - ux) + (ux * maxCos);
}
float code(float ux, float uy, float maxCos) {
return fmaf(ux, (maxCos + -1.0f), 1.0f);
}
function code(ux, uy, maxCos) return Float32(Float32(Float32(1.0) - ux) + Float32(ux * maxCos)) end
function code(ux, uy, maxCos) return fma(ux, Float32(maxCos + Float32(-1.0)), Float32(1.0)) end
\left(1 - ux\right) + ux \cdot maxCos
\mathsf{fma}\left(ux, maxCos + -1, 1\right)
Herbie found 5 alternatives:
| Alternative | Accuracy | Speedup |
|---|
Initial program 99.9%
Simplified100.0%
[Start]99.9% | \[ \left(1 - ux\right) + ux \cdot maxCos
\] |
|---|---|
sub-neg [=>]99.9% | \[ \color{blue}{\left(1 + \left(-ux\right)\right)} + ux \cdot maxCos
\] |
associate-+l+ [=>]100.0% | \[ \color{blue}{1 + \left(\left(-ux\right) + ux \cdot maxCos\right)}
\] |
+-commutative [=>]100.0% | \[ \color{blue}{\left(\left(-ux\right) + ux \cdot maxCos\right) + 1}
\] |
neg-mul-1 [=>]100.0% | \[ \left(\color{blue}{-1 \cdot ux} + ux \cdot maxCos\right) + 1
\] |
*-commutative [=>]100.0% | \[ \left(-1 \cdot ux + \color{blue}{maxCos \cdot ux}\right) + 1
\] |
distribute-rgt-out [=>]100.0% | \[ \color{blue}{ux \cdot \left(-1 + maxCos\right)} + 1
\] |
fma-def [=>]100.0% | \[ \color{blue}{\mathsf{fma}\left(ux, -1 + maxCos, 1\right)}
\] |
+-commutative [=>]100.0% | \[ \mathsf{fma}\left(ux, \color{blue}{maxCos + -1}, 1\right)
\] |
Final simplification100.0%
| Alternative 1 | |
|---|---|
| Accuracy | 99.9% |
| Cost | 3360 |
| Alternative 2 | |
|---|---|
| Accuracy | 99.9% |
| Cost | 224 |
| Alternative 3 | |
|---|---|
| Accuracy | 99.9% |
| Cost | 224 |
| Alternative 4 | |
|---|---|
| Accuracy | 98.1% |
| Cost | 96 |
| Alternative 5 | |
|---|---|
| Accuracy | 71.3% |
| Cost | 32 |
herbie shell --seed 2023229
(FPCore (ux uy maxCos)
:name "UniformSampleCone, z"
:precision binary32
:pre (and (and (and (<= 2.328306437e-10 ux) (<= ux 1.0)) (and (<= 2.328306437e-10 uy) (<= uy 1.0))) (and (<= 0.0 maxCos) (<= maxCos 1.0)))
(+ (- 1.0 ux) (* ux maxCos)))