Time bar (total: 18.4s)
| 1× | search |
| True | Other | False | Iter |
|---|---|---|---|
| 0% | 0.6% | 99.4% | 0 |
| 0% | 0.6% | 99.4% | 1 |
| 0% | 0.6% | 99.4% | 2 |
| 0.3% | 0.3% | 99.4% | 3 |
| 0.3% | 0.3% | 99.4% | 4 |
| 0.5% | 0.2% | 99.4% | 5 |
| 0.5% | 0.2% | 99.4% | 6 |
| 0.6% | 0.1% | 99.4% | 7 |
| 0.6% | 0.1% | 99.4% | 8 |
| 0.6% | 0% | 99.4% | 9 |
| 0.6% | 0% | 99.4% | 10 |
| 0.6% | 0% | 99.4% | 11 |
| 0.6% | 0% | 99.4% | 12 |
| 0.6% | 0% | 99.4% | 13 |
| 0.6% | 0% | 99.4% | 14 |
Compiled 41 to 27 computations (34.1% saved)
| 1.7s | 8256× | body | 128 | valid |
Compiled 108 to 70 computations (35.2% saved)
| 1× | egg-herbie |
| 608× | associate-/l*_binary32 |
| 462× | associate-*l*_binary32 |
| 382× | associate-*l/_binary32 |
| 363× | associate-*r*_binary32 |
| 360× | distribute-lft-in_binary32 |
Useful iterations: 1 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 14 | 33 |
| 1 | 45 | 31 |
| 2 | 160 | 31 |
| 3 | 661 | 31 |
| 4 | 2791 | 31 |
| 5 | 4866 | 31 |
| 6 | 4978 | 31 |
3 alts after pruning (3 fresh and 0 done)
| Pruned | Kept | Total | |
|---|---|---|---|
| New | 0 | 2 | 2 |
| Fresh | 0 | 1 | 1 |
| Picked | 0 | 0 | 0 |
| Done | 0 | 0 | 0 |
| Total | 0 | 3 | 3 |
| Status | Error | Program |
| ▶ | 0.4b | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| 0.4b | (/.f32 (fma.f32 alpha alpha -1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) |
Compiled 145 to 87 computations (40% saved)
Found 4 expressions with local error:
| New | Error | Program |
| ✓ | 0.1b | (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta) |
| ✓ | 0.2b | (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) |
| ✓ | 0.3b | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| ✓ | 0.3b | (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) |
4 calls:
| 57.0ms | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| 28.0ms | (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) |
| 10.0ms | (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta) |
| 9.0ms | (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) |
| 2× | batch-egg-rewrite |
| 266× | log1p-udef_binary32 |
| 265× | expm1-udef_binary32 |
| 154× | add-sqr-sqrt_binary32 |
| 146× | log1p-expm1-u_binary32 |
| 146× | expm1-log1p-u_binary32 |
4 calls:
| 93.0ms | (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta) |
| 93.0ms | (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) |
| 93.0ms | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| 93.0ms | (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) |
Useful iterations: 0 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 14 | 62 |
| 1 | 310 | 60 |
| 2 | 4242 | 60 |
| 3 | 4972 | 60 |
| 0 | 0 | 0 |
| 1 | 0 | 0 |
| 1× | egg-herbie |
| 335× | associate-/r*_binary32 |
| 303× | cancel-sign-sub-inv_binary32 |
| 267× | associate-*r*_binary32 |
| 258× | fma-neg_binary32 |
| 223× | sub-neg_binary32 |
Useful iterations: 2 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 264 | 3015 |
| 1 | 865 | 2707 |
| 2 | 3733 | 2705 |
| 3 | 4989 | 2705 |
9 alts after pruning (9 fresh and 0 done)
| Pruned | Kept | Total | |
|---|---|---|---|
| New | 156 | 9 | 165 |
| Fresh | 1 | 0 | 1 |
| Picked | 1 | 0 | 1 |
| Done | 0 | 0 | 0 |
| Total | 158 | 9 | 167 |
| Status | Error | Program |
| 0.6b | (pow.f32 (sqrt.f32 (/.f32 (fma.f32 alpha alpha -1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) 2) | |
| 0.5b | (*.f32 (/.f32 1 (PI.f32)) (/.f32 (fma.f32 alpha alpha -1) (*.f32 (log.f32 (*.f32 alpha alpha)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) | |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (PI.f32)) (/.f32 (+.f32 alpha -1) (*.f32 (log.f32 (*.f32 alpha alpha)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) | |
| 0.5b | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (*.f32 -2 (*.f32 (PI.f32) (log.f32 (/.f32 1 alpha)))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) | |
| 0.6b | (fma.f32 2 (/.f32 (*.f32 (*.f32 cosTheta cosTheta) (*.f32 alpha alpha)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (fma.f32 4 (/.f32 (*.f32 (pow.f32 alpha 6) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (*.f32 (/.f32 (*.f32 alpha alpha) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (pow.f32 cosTheta 4)) (fma.f32 4 (/.f32 (*.f32 (*.f32 alpha alpha) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 (pow.f32 cosTheta 4) (pow.f32 alpha 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha))))))) (fma.f32 (/.f32 alpha (PI.f32)) (/.f32 alpha (*.f32 2 (log.f32 alpha))) (-.f32 (/.f32 (/.f32 -1/2 (log.f32 alpha)) (PI.f32)) (+.f32 (fma.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 8) (PI.f32)) (fma.f32 6 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 4)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))) (fma.f32 (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 4) (PI.f32)) (/.f32 (pow.f32 cosTheta 4) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))))) | |
| ▶ | 0.4b | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| 0.9b | (*.f32 (/.f32 (+.f32 alpha 1) (pow.f32 (cbrt.f32 (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) 2)) (/.f32 (+.f32 alpha -1) (cbrt.f32 (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) | |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (+.f32 alpha -1) (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))))) | |
| 0.7b | (*.f32 (/.f32 (pow.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) 2) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))))) |
Compiled 6883 to 3845 computations (44.1% saved)
Found 4 expressions with local error:
| New | Error | Program |
| 0.1b | (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta) | |
| 0.2b | (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) | |
| ✓ | 0.3b | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| ✓ | 3.0b | (pow.f32 (*.f32 alpha alpha) (PI.f32)) |
2 calls:
| 71.0ms | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| 24.0ms | (pow.f32 (*.f32 alpha alpha) (PI.f32)) |
| 2× | batch-egg-rewrite |
| 379× | fma-neg_binary32 |
| 266× | log1p-udef_binary32 |
| 265× | expm1-udef_binary32 |
| 156× | add-sqr-sqrt_binary32 |
| 149× | log1p-expm1-u_binary32 |
2 calls:
| 116.0ms | (/.f32 (-.f32 (*.f32 alpha alpha) 1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (+.f32 1 (*.f32 (*.f32 (-.f32 (*.f32 alpha alpha) 1) cosTheta) cosTheta)))) |
| 116.0ms | (pow.f32 (*.f32 alpha alpha) (PI.f32)) |
Useful iterations: 0 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 14 | 41 |
| 1 | 314 | 39 |
| 2 | 4188 | 39 |
| 3 | 4991 | 39 |
| 4 | 5302 | 39 |
| 0 | 0 | 0 |
| 1 | 0 | 0 |
| 1× | egg-herbie |
| 335× | associate-/r*_binary32 |
| 300× | cancel-sign-sub-inv_binary32 |
| 265× | associate-*r*_binary32 |
| 246× | fma-neg_binary32 |
| 220× | sub-neg_binary32 |
Useful iterations: 2 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 259 | 2450 |
| 1 | 856 | 2289 |
| 2 | 3741 | 2243 |
| 3 | 5049 | 2243 |
9 alts after pruning (9 fresh and 0 done)
| Pruned | Kept | Total | |
|---|---|---|---|
| New | 175 | 4 | 179 |
| Fresh | 3 | 5 | 8 |
| Picked | 1 | 0 | 1 |
| Done | 0 | 0 | 0 |
| Total | 179 | 9 | 188 |
| Status | Error | Program |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (+.f32 alpha -1) (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))))) | |
| 0.6b | (*.f32 (/.f32 (pow.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) 2) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))))) | |
| 0.6b | (*.f32 (+.f32 alpha 1) (*.f32 (+.f32 alpha -1) (/.f32 1 (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) | |
| 0.5b | (*.f32 (fma.f32 alpha alpha -1) (/.f32 1 (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) | |
| 0.6b | (fma.f32 2 (/.f32 (*.f32 (*.f32 cosTheta cosTheta) (*.f32 alpha alpha)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (fma.f32 4 (/.f32 (*.f32 (pow.f32 alpha 6) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (*.f32 (/.f32 (*.f32 alpha alpha) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (pow.f32 cosTheta 4)) (fma.f32 4 (/.f32 (*.f32 (*.f32 alpha alpha) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 (pow.f32 cosTheta 4) (pow.f32 alpha 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha))))))) (fma.f32 (/.f32 alpha (PI.f32)) (/.f32 alpha (*.f32 2 (log.f32 alpha))) (-.f32 (/.f32 (/.f32 -1/2 (log.f32 alpha)) (PI.f32)) (+.f32 (fma.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 8) (PI.f32)) (fma.f32 6 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 4)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))) (fma.f32 (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 4) (PI.f32)) (/.f32 (pow.f32 cosTheta 4) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))))) | |
| 0.6b | (pow.f32 (sqrt.f32 (/.f32 (fma.f32 alpha alpha -1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) 2) | |
| ▶ | 0.4b | (pow.f32 (/.f32 (fma.f32 alpha alpha -1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) 1) |
| 0.9b | (*.f32 (/.f32 (+.f32 alpha 1) (pow.f32 (cbrt.f32 (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) 2)) (/.f32 (+.f32 alpha -1) (cbrt.f32 (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) | |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (PI.f32)) (/.f32 (+.f32 alpha -1) (*.f32 (log.f32 (*.f32 alpha alpha)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) |
Compiled 7722 to 4205 computations (45.5% saved)
Found 4 expressions with local error:
| New | Error | Program |
| ✓ | 0.0b | (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) |
| ✓ | 0.0b | (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) |
| ✓ | 0.3b | (/.f32 (fma.f32 alpha alpha -1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) |
| 3.0b | (pow.f32 (*.f32 alpha alpha) (PI.f32)) |
3 calls:
| 358.0ms | (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) |
| 310.0ms | (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) |
| 66.0ms | (/.f32 (fma.f32 alpha alpha -1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) |
| 2× | batch-egg-rewrite |
| 667× | log-prod_binary32 |
| 230× | expm1-udef_binary32 |
| 230× | log1p-udef_binary32 |
| 228× | log-pow_binary32 |
| 134× | add-sqr-sqrt_binary32 |
3 calls:
| 70.0ms | (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) |
| 70.0ms | (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) |
| 70.0ms | (/.f32 (fma.f32 alpha alpha -1) (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) |
Useful iterations: 0 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 14 | 65 |
| 1 | 276 | 65 |
| 2 | 3186 | 65 |
| 3 | 5618 | 65 |
| 0 | 0 | 0 |
| 1 | 0 | 0 |
| 1× | egg-herbie |
| 424× | fma-neg_binary32 |
| 366× | associate-*r*_binary32 |
| 335× | associate-/r*_binary32 |
| 272× | associate-*l*_binary32 |
| 249× | sub-neg_binary32 |
Useful iterations: 2 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 289 | 3481 |
| 1 | 967 | 3094 |
| 2 | 4210 | 3085 |
| 3 | 5134 | 3085 |
9 alts after pruning (9 fresh and 0 done)
| Pruned | Kept | Total | |
|---|---|---|---|
| New | 185 | 1 | 186 |
| Fresh | 0 | 8 | 8 |
| Picked | 1 | 0 | 1 |
| Done | 0 | 0 | 0 |
| Total | 186 | 9 | 195 |
| Status | Error | Program |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (+.f32 alpha -1) (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))))) | |
| 0.6b | (*.f32 (/.f32 (pow.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) 2) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))))) | |
| ▶ | 0.5b | (pow.f32 (/.f32 (fma.f32 alpha alpha -1) (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))))) 1) |
| 0.6b | (*.f32 (+.f32 alpha 1) (*.f32 (+.f32 alpha -1) (/.f32 1 (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) | |
| 0.5b | (*.f32 (fma.f32 alpha alpha -1) (/.f32 1 (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) | |
| 0.6b | (fma.f32 2 (/.f32 (*.f32 (*.f32 cosTheta cosTheta) (*.f32 alpha alpha)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (fma.f32 4 (/.f32 (*.f32 (pow.f32 alpha 6) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (*.f32 (/.f32 (*.f32 alpha alpha) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (pow.f32 cosTheta 4)) (fma.f32 4 (/.f32 (*.f32 (*.f32 alpha alpha) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 (pow.f32 cosTheta 4) (pow.f32 alpha 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha))))))) (fma.f32 (/.f32 alpha (PI.f32)) (/.f32 alpha (*.f32 2 (log.f32 alpha))) (-.f32 (/.f32 (/.f32 -1/2 (log.f32 alpha)) (PI.f32)) (+.f32 (fma.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 8) (PI.f32)) (fma.f32 6 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 4)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))) (fma.f32 (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 4) (PI.f32)) (/.f32 (pow.f32 cosTheta 4) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))))) | |
| 0.6b | (pow.f32 (sqrt.f32 (/.f32 (fma.f32 alpha alpha -1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) 2) | |
| 0.9b | (*.f32 (/.f32 (+.f32 alpha 1) (pow.f32 (cbrt.f32 (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) 2)) (/.f32 (+.f32 alpha -1) (cbrt.f32 (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) | |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (PI.f32)) (/.f32 (+.f32 alpha -1) (*.f32 (log.f32 (*.f32 alpha alpha)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) |
Compiled 7778 to 4484 computations (42.4% saved)
Found 4 expressions with local error:
| New | Error | Program |
| ✓ | 0.3b | (/.f32 (fma.f32 alpha alpha -1) (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))))) |
| ✓ | 0.3b | (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) |
| ✓ | 0.5b | (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) |
| 3.0b | (pow.f32 (*.f32 alpha alpha) (PI.f32)) |
3 calls:
| 5.9s | (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) |
| 2.9s | (/.f32 (fma.f32 alpha alpha -1) (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))))) |
| 1.1s | (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) |
| 2× | batch-egg-rewrite |
| 628× | prod-diff_binary32 |
| 174× | add-sqr-sqrt_binary32 |
| 165× | log1p-expm1-u_binary32 |
| 165× | expm1-log1p-u_binary32 |
| 164× | add-log-exp_binary32 |
3 calls:
| 89.0ms | (/.f32 (fma.f32 alpha alpha -1) (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))))) |
| 89.0ms | (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) |
| 89.0ms | (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) |
Useful iterations: 0 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 18 | 82 |
| 1 | 362 | 79 |
| 2 | 4154 | 79 |
| 3 | 5273 | 79 |
| 0 | 0 | 0 |
| 1 | 0 | 0 |
| 1× | egg-herbie |
| 595× | unswap-sqr_binary32 |
| 462× | associate-*r*_binary32 |
| 369× | associate-*l*_binary32 |
| 214× | *-commutative_binary32 |
| 209× | times-frac_binary32 |
Useful iterations: 2 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 276 | 6816 |
| 1 | 825 | 6407 |
| 2 | 4026 | 6194 |
| 3 | 5779 | 6194 |
9 alts after pruning (8 fresh and 1 done)
| Pruned | Kept | Total | |
|---|---|---|---|
| New | 202 | 2 | 204 |
| Fresh | 2 | 6 | 8 |
| Picked | 0 | 1 | 1 |
| Done | 0 | 0 | 0 |
| Total | 204 | 9 | 213 |
| Status | Error | Program |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)) (/.f32 (+.f32 alpha -1) (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))))) | |
| 0.6b | (pow.f32 (*.f32 (pow.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) 2) (*.f32 (cbrt.f32 (fma.f32 alpha alpha -1)) (/.f32 1 (log.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))))) 1) | |
| ✓ | 0.5b | (pow.f32 (/.f32 (fma.f32 alpha alpha -1) (+.f32 (log.f32 (*.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) (log.f32 (cbrt.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))))) 1) |
| 0.6b | (*.f32 (+.f32 alpha 1) (*.f32 (+.f32 alpha -1) (/.f32 1 (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))))) | |
| 0.5b | (*.f32 (fma.f32 alpha alpha -1) (/.f32 1 (*.f32 (log.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) | |
| 0.6b | (fma.f32 2 (/.f32 (*.f32 (*.f32 cosTheta cosTheta) (*.f32 alpha alpha)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (fma.f32 4 (/.f32 (*.f32 (pow.f32 alpha 6) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (*.f32 (/.f32 (*.f32 alpha alpha) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (pow.f32 cosTheta 4)) (fma.f32 4 (/.f32 (*.f32 (*.f32 alpha alpha) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 (pow.f32 cosTheta 4) (pow.f32 alpha 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha))))))) (fma.f32 (/.f32 alpha (PI.f32)) (/.f32 alpha (*.f32 2 (log.f32 alpha))) (-.f32 (/.f32 (/.f32 -1/2 (log.f32 alpha)) (PI.f32)) (+.f32 (fma.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 8) (PI.f32)) (fma.f32 6 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 6)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (+.f32 (/.f32 (pow.f32 cosTheta 6) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (fma.f32 3 (/.f32 (*.f32 (pow.f32 alpha 4) (pow.f32 cosTheta 4)) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))) (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))) (fma.f32 (/.f32 (*.f32 cosTheta cosTheta) (*.f32 2 (log.f32 alpha))) (/.f32 (pow.f32 alpha 4) (PI.f32)) (/.f32 (pow.f32 cosTheta 4) (*.f32 2 (*.f32 (PI.f32) (log.f32 alpha)))))))))) | |
| 0.6b | (pow.f32 (sqrt.f32 (/.f32 (fma.f32 alpha alpha -1) (*.f32 (*.f32 (PI.f32) (log.f32 (*.f32 alpha alpha))) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) 2) | |
| 0.5b | (pow.f32 (pow.f32 (/.f32 (log.f32 (pow.f32 (pow.f32 (*.f32 alpha alpha) (PI.f32)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1))) (fma.f32 alpha alpha -1)) -1) 1) | |
| 0.6b | (*.f32 (/.f32 (+.f32 alpha 1) (PI.f32)) (/.f32 (+.f32 alpha -1) (*.f32 (log.f32 (*.f32 alpha alpha)) (fma.f32 (fma.f32 alpha alpha -1) (*.f32 cosTheta cosTheta) 1)))) |
Compiled 14602 to 8424 computations (42.3% saved)
Total 0.4b remaining (90.1%)
Threshold costs 0.4b (90.1%)
Compiled 56483 to 37043 computations (34.4% saved)
| 1× | egg-herbie |
| 2× | *-commutative_binary32 |
| 1× | +-commutative_binary32 |
| 1× | distribute-neg-frac_binary32 |
| 1× | sub-neg_binary32 |
| 1× | neg-sub0_binary32 |
Useful iterations: 0 (0.0ms)
| Iter | Nodes | Cost |
|---|---|---|
| 0 | 18 | 60 |
| 1 | 25 | 60 |
| 2 | 27 | 60 |
| 3 | 28 | 60 |
| 4 | 26 | 60 |
Compiled 529 to 321 computations (39.3% saved)
Loading profile data...