Sample trimmed logistic on [-pi, pi]

Percentage Accurate: 98.9% → 98.9%
Time: 5.4s
Alternatives: 9
Speedup: 0.9×

Specification

?
\[\left(2.328306437 \cdot 10^{-10} \leq u \land u \leq 1\right) \land \left(0 \leq s \land s \leq 1.0651631\right)\]
\[\begin{array}{l} \\ \begin{array}{l} t_0 := \frac{1}{1 + e^{\frac{\pi}{s}}}\\ \left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - t\_0\right) + t\_0} - 1\right) \end{array} \end{array} \]
(FPCore (u s)
 :precision binary32
 (let* ((t_0 (/ 1.0 (+ 1.0 (exp (/ PI s))))))
   (*
    (- s)
    (log
     (-
      (/ 1.0 (+ (* u (- (/ 1.0 (+ 1.0 (exp (/ (- PI) s)))) t_0)) t_0))
      1.0)))))
float code(float u, float s) {
	float t_0 = 1.0f / (1.0f + expf((((float) M_PI) / s)));
	return -s * logf(((1.0f / ((u * ((1.0f / (1.0f + expf((-((float) M_PI) / s)))) - t_0)) + t_0)) - 1.0f));
}
function code(u, s)
	t_0 = Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(pi) / s))))
	return Float32(Float32(-s) * log(Float32(Float32(Float32(1.0) / Float32(Float32(u * Float32(Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(-Float32(pi)) / s)))) - t_0)) + t_0)) - Float32(1.0))))
end
function tmp = code(u, s)
	t_0 = single(1.0) / (single(1.0) + exp((single(pi) / s)));
	tmp = -s * log(((single(1.0) / ((u * ((single(1.0) / (single(1.0) + exp((-single(pi) / s)))) - t_0)) + t_0)) - single(1.0)));
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \frac{1}{1 + e^{\frac{\pi}{s}}}\\
\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - t\_0\right) + t\_0} - 1\right)
\end{array}
\end{array}

Local Percentage Accuracy vs ?

The average percentage accuracy by input value. Horizontal axis shows value of an input variable; the variable is choosen in the title. Vertical axis is accuracy; higher is better. Red represent the original program, while blue represents Herbie's suggestion. These can be toggled with buttons below the plot. The line is an average while dots represent individual samples.

Accuracy vs Speed?

Herbie found 9 alternatives:

AlternativeAccuracySpeedup
The accuracy (vertical axis) and speed (horizontal axis) of each alternatives. Up and to the right is better. The red square shows the initial program, and each blue circle shows an alternative.The line shows the best available speed-accuracy tradeoffs.

Initial Program: 98.9% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \frac{1}{1 + e^{\frac{\pi}{s}}}\\ \left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - t\_0\right) + t\_0} - 1\right) \end{array} \end{array} \]
(FPCore (u s)
 :precision binary32
 (let* ((t_0 (/ 1.0 (+ 1.0 (exp (/ PI s))))))
   (*
    (- s)
    (log
     (-
      (/ 1.0 (+ (* u (- (/ 1.0 (+ 1.0 (exp (/ (- PI) s)))) t_0)) t_0))
      1.0)))))
float code(float u, float s) {
	float t_0 = 1.0f / (1.0f + expf((((float) M_PI) / s)));
	return -s * logf(((1.0f / ((u * ((1.0f / (1.0f + expf((-((float) M_PI) / s)))) - t_0)) + t_0)) - 1.0f));
}
function code(u, s)
	t_0 = Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(pi) / s))))
	return Float32(Float32(-s) * log(Float32(Float32(Float32(1.0) / Float32(Float32(u * Float32(Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(-Float32(pi)) / s)))) - t_0)) + t_0)) - Float32(1.0))))
end
function tmp = code(u, s)
	t_0 = single(1.0) / (single(1.0) + exp((single(pi) / s)));
	tmp = -s * log(((single(1.0) / ((u * ((single(1.0) / (single(1.0) + exp((-single(pi) / s)))) - t_0)) + t_0)) - single(1.0)));
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \frac{1}{1 + e^{\frac{\pi}{s}}}\\
\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - t\_0\right) + t\_0} - 1\right)
\end{array}
\end{array}

Alternative 1: 98.9% accurate, 0.9× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \frac{1}{e^{\frac{\pi}{s}} + 1}\\ \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - t\_0, u, t\_0\right)\right) \cdot -1\right)\right) \end{array} \end{array} \]
(FPCore (u s)
 :precision binary32
 (let* ((t_0 (/ 1.0 (+ (exp (/ PI s)) 1.0))))
   (*
    (- s)
    (log
     (expm1
      (*
       (log (fma (- (/ 1.0 (+ (exp (/ (- PI) s)) 1.0)) t_0) u t_0))
       -1.0))))))
float code(float u, float s) {
	float t_0 = 1.0f / (expf((((float) M_PI) / s)) + 1.0f);
	return -s * logf(expm1f((logf(fmaf(((1.0f / (expf((-((float) M_PI) / s)) + 1.0f)) - t_0), u, t_0)) * -1.0f)));
}
function code(u, s)
	t_0 = Float32(Float32(1.0) / Float32(exp(Float32(Float32(pi) / s)) + Float32(1.0)))
	return Float32(Float32(-s) * log(expm1(Float32(log(fma(Float32(Float32(Float32(1.0) / Float32(exp(Float32(Float32(-Float32(pi)) / s)) + Float32(1.0))) - t_0), u, t_0)) * Float32(-1.0)))))
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \frac{1}{e^{\frac{\pi}{s}} + 1}\\
\left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - t\_0, u, t\_0\right)\right) \cdot -1\right)\right)
\end{array}
\end{array}
Derivation
  1. Initial program 98.9%

    \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
  2. Applied rewrites98.8%

    \[\leadsto \left(-s\right) \cdot \log \color{blue}{\left(\mathsf{expm1}\left(\log \left(\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)\right) \cdot -1\right)\right)} \]
  3. Add Preprocessing

Alternative 2: 98.8% accurate, 1.0× speedup?

\[\begin{array}{l} \\ \begin{array}{l} t_0 := \frac{1}{e^{\frac{\pi}{s}} + 1}\\ \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - t\_0, u, t\_0\right)} - 1\right) \end{array} \end{array} \]
(FPCore (u s)
 :precision binary32
 (let* ((t_0 (/ 1.0 (+ (exp (/ PI s)) 1.0))))
   (*
    (- s)
    (log
     (- (/ 1.0 (fma (- (/ 1.0 (+ (exp (/ (- PI) s)) 1.0)) t_0) u t_0)) 1.0)))))
float code(float u, float s) {
	float t_0 = 1.0f / (expf((((float) M_PI) / s)) + 1.0f);
	return -s * logf(((1.0f / fmaf(((1.0f / (expf((-((float) M_PI) / s)) + 1.0f)) - t_0), u, t_0)) - 1.0f));
}
function code(u, s)
	t_0 = Float32(Float32(1.0) / Float32(exp(Float32(Float32(pi) / s)) + Float32(1.0)))
	return Float32(Float32(-s) * log(Float32(Float32(Float32(1.0) / fma(Float32(Float32(Float32(1.0) / Float32(exp(Float32(Float32(-Float32(pi)) / s)) + Float32(1.0))) - t_0), u, t_0)) - Float32(1.0))))
end
\begin{array}{l}

\\
\begin{array}{l}
t_0 := \frac{1}{e^{\frac{\pi}{s}} + 1}\\
\left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - t\_0, u, t\_0\right)} - 1\right)
\end{array}
\end{array}
Derivation
  1. Initial program 98.9%

    \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
  2. Applied rewrites98.9%

    \[\leadsto \left(-s\right) \cdot \color{blue}{\log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right)} \]
  3. Add Preprocessing

Alternative 3: 97.5% accurate, 1.1× speedup?

\[\begin{array}{l} \\ \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right)\right) \cdot -1\right)\right) \end{array} \]
(FPCore (u s)
 :precision binary32
 (*
  (- s)
  (log
   (expm1
    (*
     (log
      (*
       u
       (-
        (/ 1.0 (+ 1.0 (exp (* -1.0 (/ PI s)))))
        (/ 1.0 (+ 1.0 (exp (/ PI s)))))))
     -1.0)))))
float code(float u, float s) {
	return -s * logf(expm1f((logf((u * ((1.0f / (1.0f + expf((-1.0f * (((float) M_PI) / s))))) - (1.0f / (1.0f + expf((((float) M_PI) / s))))))) * -1.0f)));
}
function code(u, s)
	return Float32(Float32(-s) * log(expm1(Float32(log(Float32(u * Float32(Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(-1.0) * Float32(Float32(pi) / s))))) - Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(pi) / s))))))) * Float32(-1.0)))))
end
\begin{array}{l}

\\
\left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right)\right) \cdot -1\right)\right)
\end{array}
Derivation
  1. Initial program 98.9%

    \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
  2. Applied rewrites98.8%

    \[\leadsto \left(-s\right) \cdot \log \color{blue}{\left(\mathsf{expm1}\left(\log \left(\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)\right) \cdot -1\right)\right)} \]
  3. Taylor expanded in u around inf

    \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \color{blue}{\left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right)} \cdot -1\right)\right) \]
  4. Step-by-step derivation
    1. lower-*.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \color{blue}{\left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)}\right) \cdot -1\right)\right) \]
    2. lower--.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \color{blue}{\frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}}\right)\right) \cdot -1\right)\right) \]
    3. lower-/.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{\color{blue}{1}}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
    4. lower-+.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
    5. lower-exp.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
    6. lower-*.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
    7. lift-/.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
    8. lift-PI.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\pi}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
  5. Applied rewrites97.4%

    \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \color{blue}{\left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right)\right)} \cdot -1\right)\right) \]
  6. Add Preprocessing

Alternative 4: 97.4% accurate, 1.4× speedup?

\[\begin{array}{l} \\ \left(-s\right) \cdot \log \left(\frac{1}{\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}\right) \cdot u} - 1\right) \end{array} \]
(FPCore (u s)
 :precision binary32
 (*
  (- s)
  (log
   (-
    (/
     1.0
     (*
      (- (/ 1.0 (+ (exp (/ (- PI) s)) 1.0)) (/ 1.0 (+ (exp (/ PI s)) 1.0)))
      u))
    1.0))))
float code(float u, float s) {
	return -s * logf(((1.0f / (((1.0f / (expf((-((float) M_PI) / s)) + 1.0f)) - (1.0f / (expf((((float) M_PI) / s)) + 1.0f))) * u)) - 1.0f));
}
function code(u, s)
	return Float32(Float32(-s) * log(Float32(Float32(Float32(1.0) / Float32(Float32(Float32(Float32(1.0) / Float32(exp(Float32(Float32(-Float32(pi)) / s)) + Float32(1.0))) - Float32(Float32(1.0) / Float32(exp(Float32(Float32(pi) / s)) + Float32(1.0)))) * u)) - Float32(1.0))))
end
function tmp = code(u, s)
	tmp = -s * log(((single(1.0) / (((single(1.0) / (exp((-single(pi) / s)) + single(1.0))) - (single(1.0) / (exp((single(pi) / s)) + single(1.0)))) * u)) - single(1.0)));
end
\begin{array}{l}

\\
\left(-s\right) \cdot \log \left(\frac{1}{\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}\right) \cdot u} - 1\right)
\end{array}
Derivation
  1. Initial program 98.9%

    \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
  2. Taylor expanded in u around inf

    \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\color{blue}{u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)}} - 1\right) \]
  3. Applied rewrites97.5%

    \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\color{blue}{\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}\right) \cdot u}} - 1\right) \]
  4. Add Preprocessing

Alternative 5: 37.8% accurate, 1.6× speedup?

\[\begin{array}{l} \\ \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(0.5 - \frac{1}{2 + \frac{\pi}{s}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \end{array} \]
(FPCore (u s)
 :precision binary32
 (*
  (- s)
  (log
   (-
    (/
     1.0
     (fma (- 0.5 (/ 1.0 (+ 2.0 (/ PI s)))) u (/ 1.0 (+ (exp (/ PI s)) 1.0))))
    1.0))))
float code(float u, float s) {
	return -s * logf(((1.0f / fmaf((0.5f - (1.0f / (2.0f + (((float) M_PI) / s)))), u, (1.0f / (expf((((float) M_PI) / s)) + 1.0f)))) - 1.0f));
}
function code(u, s)
	return Float32(Float32(-s) * log(Float32(Float32(Float32(1.0) / fma(Float32(Float32(0.5) - Float32(Float32(1.0) / Float32(Float32(2.0) + Float32(Float32(pi) / s)))), u, Float32(Float32(1.0) / Float32(exp(Float32(Float32(pi) / s)) + Float32(1.0))))) - Float32(1.0))))
end
\begin{array}{l}

\\
\left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(0.5 - \frac{1}{2 + \frac{\pi}{s}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right)
\end{array}
Derivation
  1. Initial program 98.9%

    \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
  2. Applied rewrites98.9%

    \[\leadsto \left(-s\right) \cdot \color{blue}{\log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right)} \]
  3. Taylor expanded in s around inf

    \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{\color{blue}{2 + \frac{\mathsf{PI}\left(\right)}{s}}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
  4. Step-by-step derivation
    1. lower-+.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{2 + \color{blue}{\frac{\mathsf{PI}\left(\right)}{s}}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
    2. lift-/.f32N/A

      \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{2 + \frac{\mathsf{PI}\left(\right)}{\color{blue}{s}}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
    3. lift-PI.f3295.1

      \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{2 + \frac{\pi}{s}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
  5. Applied rewrites95.1%

    \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{\color{blue}{2 + \frac{\pi}{s}}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
  6. Taylor expanded in s around inf

    \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\color{blue}{\frac{1}{2}} - \frac{1}{2 + \frac{\pi}{s}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
  7. Step-by-step derivation
    1. Applied rewrites37.8%

      \[\leadsto \left(-s\right) \cdot \log \left(\frac{1}{\mathsf{fma}\left(\color{blue}{0.5} - \frac{1}{2 + \frac{\pi}{s}}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)} - 1\right) \]
    2. Add Preprocessing

    Alternative 6: 37.2% accurate, 1.7× speedup?

    \[\begin{array}{l} \\ \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(0.5 - \frac{1}{1 + e^{\frac{\pi}{s}}}\right)\right) \cdot -1\right)\right) \end{array} \]
    (FPCore (u s)
     :precision binary32
     (*
      (- s)
      (log (expm1 (* (log (* u (- 0.5 (/ 1.0 (+ 1.0 (exp (/ PI s))))))) -1.0)))))
    float code(float u, float s) {
    	return -s * logf(expm1f((logf((u * (0.5f - (1.0f / (1.0f + expf((((float) M_PI) / s))))))) * -1.0f)));
    }
    
    function code(u, s)
    	return Float32(Float32(-s) * log(expm1(Float32(log(Float32(u * Float32(Float32(0.5) - Float32(Float32(1.0) / Float32(Float32(1.0) + exp(Float32(Float32(pi) / s))))))) * Float32(-1.0)))))
    end
    
    \begin{array}{l}
    
    \\
    \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(0.5 - \frac{1}{1 + e^{\frac{\pi}{s}}}\right)\right) \cdot -1\right)\right)
    \end{array}
    
    Derivation
    1. Initial program 98.9%

      \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
    2. Applied rewrites98.8%

      \[\leadsto \left(-s\right) \cdot \log \color{blue}{\left(\mathsf{expm1}\left(\log \left(\mathsf{fma}\left(\frac{1}{e^{\frac{-\pi}{s}} + 1} - \frac{1}{e^{\frac{\pi}{s}} + 1}, u, \frac{1}{e^{\frac{\pi}{s}} + 1}\right)\right) \cdot -1\right)\right)} \]
    3. Taylor expanded in u around inf

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \color{blue}{\left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right)} \cdot -1\right)\right) \]
    4. Step-by-step derivation
      1. lower-*.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \color{blue}{\left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)}\right) \cdot -1\right)\right) \]
      2. lower--.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \color{blue}{\frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}}\right)\right) \cdot -1\right)\right) \]
      3. lower-/.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{\color{blue}{1}}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
      4. lower-+.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
      5. lower-exp.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
      6. lower-*.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
      7. lift-/.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\mathsf{PI}\left(\right)}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
      8. lift-PI.f32N/A

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\pi}{s}}} - \frac{1}{1 + e^{\frac{\mathsf{PI}\left(\right)}{s}}}\right)\right) \cdot -1\right)\right) \]
    5. Applied rewrites97.4%

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \color{blue}{\left(u \cdot \left(\frac{1}{1 + e^{-1 \cdot \frac{\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right)\right)} \cdot -1\right)\right) \]
    6. Taylor expanded in s around inf

      \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(\frac{1}{2} - \frac{\color{blue}{1}}{1 + e^{\frac{\pi}{s}}}\right)\right) \cdot -1\right)\right) \]
    7. Step-by-step derivation
      1. Applied rewrites37.2%

        \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{expm1}\left(\log \left(u \cdot \left(0.5 - \frac{\color{blue}{1}}{1 + e^{\frac{\pi}{s}}}\right)\right) \cdot -1\right)\right) \]
      2. Add Preprocessing

      Alternative 7: 25.1% accurate, 5.1× speedup?

      \[\begin{array}{l} \\ \left(-s\right) \cdot \log \left(1 + \frac{\pi}{s}\right) \end{array} \]
      (FPCore (u s) :precision binary32 (* (- s) (log (+ 1.0 (/ PI s)))))
      float code(float u, float s) {
      	return -s * logf((1.0f + (((float) M_PI) / s)));
      }
      
      function code(u, s)
      	return Float32(Float32(-s) * log(Float32(Float32(1.0) + Float32(Float32(pi) / s))))
      end
      
      function tmp = code(u, s)
      	tmp = -s * log((single(1.0) + (single(pi) / s)));
      end
      
      \begin{array}{l}
      
      \\
      \left(-s\right) \cdot \log \left(1 + \frac{\pi}{s}\right)
      \end{array}
      
      Derivation
      1. Initial program 98.9%

        \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
      2. Taylor expanded in s around inf

        \[\leadsto \left(-s\right) \cdot \log \color{blue}{\left(1 + -4 \cdot \frac{u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)}{s}\right)} \]
      3. Step-by-step derivation
        1. +-commutativeN/A

          \[\leadsto \left(-s\right) \cdot \log \left(-4 \cdot \frac{u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)}{s} + \color{blue}{1}\right) \]
        2. *-commutativeN/A

          \[\leadsto \left(-s\right) \cdot \log \left(\frac{u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)}{s} \cdot -4 + 1\right) \]
        3. lower-fma.f32N/A

          \[\leadsto \left(-s\right) \cdot \log \left(\mathsf{fma}\left(\frac{u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)}{s}, \color{blue}{-4}, 1\right)\right) \]
      4. Applied rewrites24.9%

        \[\leadsto \left(-s\right) \cdot \log \color{blue}{\left(\mathsf{fma}\left(\frac{\mathsf{fma}\left(\pi \cdot 0.5, u, -0.25 \cdot \pi\right)}{s}, -4, 1\right)\right)} \]
      5. Taylor expanded in u around 0

        \[\leadsto \left(-s\right) \cdot \log \left(1 + \color{blue}{\frac{\mathsf{PI}\left(\right)}{s}}\right) \]
      6. Step-by-step derivation
        1. lower-+.f32N/A

          \[\leadsto \left(-s\right) \cdot \log \left(1 + \frac{\mathsf{PI}\left(\right)}{\color{blue}{s}}\right) \]
        2. lift-/.f32N/A

          \[\leadsto \left(-s\right) \cdot \log \left(1 + \frac{\mathsf{PI}\left(\right)}{s}\right) \]
        3. lift-PI.f3225.1

          \[\leadsto \left(-s\right) \cdot \log \left(1 + \frac{\pi}{s}\right) \]
      7. Applied rewrites25.1%

        \[\leadsto \left(-s\right) \cdot \log \left(1 + \color{blue}{\frac{\pi}{s}}\right) \]
      8. Add Preprocessing

      Alternative 8: 11.6% accurate, 5.9× speedup?

      \[\begin{array}{l} \\ \mathsf{fma}\left(\pi \cdot 0.5, u, -0.25 \cdot \pi\right) \cdot 4 \end{array} \]
      (FPCore (u s) :precision binary32 (* (fma (* PI 0.5) u (* -0.25 PI)) 4.0))
      float code(float u, float s) {
      	return fmaf((((float) M_PI) * 0.5f), u, (-0.25f * ((float) M_PI))) * 4.0f;
      }
      
      function code(u, s)
      	return Float32(fma(Float32(Float32(pi) * Float32(0.5)), u, Float32(Float32(-0.25) * Float32(pi))) * Float32(4.0))
      end
      
      \begin{array}{l}
      
      \\
      \mathsf{fma}\left(\pi \cdot 0.5, u, -0.25 \cdot \pi\right) \cdot 4
      \end{array}
      
      Derivation
      1. Initial program 98.9%

        \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
      2. Taylor expanded in s around inf

        \[\leadsto \color{blue}{4 \cdot \left(u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)\right)} \]
      3. Step-by-step derivation
        1. *-commutativeN/A

          \[\leadsto \left(u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)\right) \cdot \color{blue}{4} \]
        2. lower-*.f32N/A

          \[\leadsto \left(u \cdot \left(\frac{1}{4} \cdot \mathsf{PI}\left(\right) - \frac{-1}{4} \cdot \mathsf{PI}\left(\right)\right) - \frac{1}{4} \cdot \mathsf{PI}\left(\right)\right) \cdot \color{blue}{4} \]
      4. Applied rewrites11.6%

        \[\leadsto \color{blue}{\mathsf{fma}\left(\pi \cdot 0.5, u, -0.25 \cdot \pi\right) \cdot 4} \]
      5. Add Preprocessing

      Alternative 9: 11.4% accurate, 46.3× speedup?

      \[\begin{array}{l} \\ -\pi \end{array} \]
      (FPCore (u s) :precision binary32 (- PI))
      float code(float u, float s) {
      	return -((float) M_PI);
      }
      
      function code(u, s)
      	return Float32(-Float32(pi))
      end
      
      function tmp = code(u, s)
      	tmp = -single(pi);
      end
      
      \begin{array}{l}
      
      \\
      -\pi
      \end{array}
      
      Derivation
      1. Initial program 98.9%

        \[\left(-s\right) \cdot \log \left(\frac{1}{u \cdot \left(\frac{1}{1 + e^{\frac{-\pi}{s}}} - \frac{1}{1 + e^{\frac{\pi}{s}}}\right) + \frac{1}{1 + e^{\frac{\pi}{s}}}} - 1\right) \]
      2. Taylor expanded in u around 0

        \[\leadsto \color{blue}{-1 \cdot \mathsf{PI}\left(\right)} \]
      3. Step-by-step derivation
        1. mul-1-negN/A

          \[\leadsto \mathsf{neg}\left(\mathsf{PI}\left(\right)\right) \]
        2. lift-neg.f32N/A

          \[\leadsto -\mathsf{PI}\left(\right) \]
        3. lift-PI.f3211.4

          \[\leadsto -\pi \]
      4. Applied rewrites11.4%

        \[\leadsto \color{blue}{-\pi} \]
      5. Add Preprocessing

      Reproduce

      ?
      herbie shell --seed 2025119 
      (FPCore (u s)
        :name "Sample trimmed logistic on [-pi, pi]"
        :precision binary32
        :pre (and (and (<= 2.328306437e-10 u) (<= u 1.0)) (and (<= 0.0 s) (<= s 1.0651631)))
        (* (- s) (log (- (/ 1.0 (+ (* u (- (/ 1.0 (+ 1.0 (exp (/ (- PI) s)))) (/ 1.0 (+ 1.0 (exp (/ PI s)))))) (/ 1.0 (+ 1.0 (exp (/ PI s)))))) 1.0))))