Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added erf(x) Float64 Julia implementation #491

Open
wants to merge 6 commits into
base: master
Choose a base branch
from

Conversation

AhmedYKadah
Copy link

Faster than current wrapper function call (including Float32 function call).
Uses algorithm based on https://github.com/ARM-software/optimized-routines/blob/master/math/erf.c

Copy link

codecov bot commented Mar 31, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.35%. Comparing base (f8fd782) to head (6efcec8).

Additional details and impacted files
@@            Coverage Diff             @@
##           master     #491      +/-   ##
==========================================
+ Coverage   94.11%   94.35%   +0.24%     
==========================================
  Files          14       14              
  Lines        2905     3064     +159     
==========================================
+ Hits         2734     2891     +157     
- Misses        171      173       +2     
Flag Coverage Δ
unittests 94.35% <100.00%> (+0.24%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@AhmedYKadah
Copy link
Author

Old:
Float 64
@benchmark SpecialFunctions.erf(data) setup=(data=6*rand(Float64)-3) samples=1000000

BenchmarkTools.Trial: 217729 samples with 1000 evaluations per sample.
Range (min … max): 6.300 ns … 283.700 ns ┊ GC (min … max): 0.00% … 0.00%
Time (median): 29.500 ns ┊ GC (median): 0.00%
Time (mean ± σ): 21.993 ns ± 13.209 ns ┊ GC (mean ± σ): 0.00% ± 0.00%

Float32
@benchmark SpecialFunctions.erf(data) setup=(data=6*rand(Float32)-3) samples=1000000

BenchmarkTools.Trial: 312732 samples with 1000 evaluations per sample.
Range (min … max): 4.300 ns … 125.100 ns ┊ GC (min … max): 0.00% … 0.00%
Time (median): 19.900 ns ┊ GC (median): 0.00%
Time (mean ± σ): 15.035 ns ± 7.951 ns ┊ GC (mean ± σ): 0.00% ± 0.00%

New:
Float64
@benchmark erf(data) setup=(data=6*rand(Float64)-3) samples=1000000

BenchmarkTools.Trial: 507504 samples with 1000 evaluations per sample.
Range (min … max): 5.400 ns … 4.890 μs ┊ GC (min … max): 0.00% … 0.00%
Time (median): 8.700 ns ┊ GC (median): 0.00%
Time (mean ± σ): 8.775 ns ± 9.855 ns ┊ GC (mean ± σ): 0.00% ± 0.00%

Float32
@benchmark Float32(erf(data)) setup=(data=6*rand(Float64)-3) samples=1000000

BenchmarkTools.Trial: 526797 samples with 1000 evaluations per sample.
Range (min … max): 5.400 ns … 195.500 ns ┊ GC (min … max): 0.00% … 0.00%
Time (median): 8.800 ns ┊ GC (median): 0.00%
Time (mean ± σ): 8.521 ns ± 2.236 ns ┊ GC (mean ± σ): 0.00% ± 0.00%

@AhmedYKadah
Copy link
Author

Float32 implementation available, but not faster than Float64 version due to a exp() call.
Float64 version still faster than old Float32.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant