ggml : implement GEGLU_ERF and GEGLU_QUICK ops #14445

CISC · 2025-06-29T14:46:57Z

Complimentary to the other GLU ops, used in mtmd.

Implemented for all currently GLU supported backends, ~~except GEGLU_ERF in Vulkan due to missing erf~~.

qnixsynapse

SYCL code LGTM :)

jeffbolznv

Vulkan code looks fine. I wonder how much precision erf needs and if we could just handcode something in the Vulkan shader.

CISC · 2025-06-29T19:33:25Z

Vulkan code looks fine. I wonder how much precision erf needs and if we could just handcode something in the Vulkan shader.

The following seems to work for Metal, so should be good enough for Vulkan:

llama.cpp/ggml/src/ggml-metal/ggml-metal.metal

Lines 1099 to 1115 in 38593bc

    
           // based on Abramowitz and Stegun formula 7.1.26 or similar Hastings' approximation 
        
           // ref: https://www.johndcook.com/blog/python_erf/ 
        
           constant float p_erf  = 0.3275911f; 
        
           constant float a1_erf = 0.254829592f; 
        
           constant float a2_erf = -0.284496736f; 
        
           constant float a3_erf = 1.421413741f; 
        
           constant float a4_erf = -1.453152027f; 
        
           constant float a5_erf = 1.061405429f; 
        
           template<typename T> 
        
           T erf_approx(T x) { 
        
               T sign_x = sign(x); 
        
               x = fabs(x); 
        
               T t = 1.0f / (1.0f + p_erf * x); 
        
               T y = 1.0f - (((((a5_erf * t + a4_erf) * t) + a3_erf) * t + a2_erf) * t + a1_erf) * t * exp(-x * x); 
        
               return sign_x * y; 
        
           }

Edit: I'll give it a try.

0cc4m

The Vulkan code works on all of my devices.

CISC · 2025-07-01T08:50:21Z

@lhez I'll add OpenCL too, pending results of #14476

lhez · 2025-07-02T03:48:08Z

@CISC OpenCL looks good and works for me.

implement GEGLU_ERF and GEGLU_QUICK ops

6be1307

CISC requested a review from ggerganov June 29, 2025 14:47

CISC requested review from ngxson, jeffbolznv and qnixsynapse June 29, 2025 14:47

qnixsynapse approved these changes Jun 29, 2025

View reviewed changes

fix cut'n'paste error

38593bc

jeffbolznv approved these changes Jun 29, 2025

View reviewed changes

add GEGLU_ERF for vulkan

d5e4a58

CISC requested a review from jeffbolznv June 29, 2025 20:28

0cc4m self-requested a review June 30, 2025 05:40

0cc4m approved these changes Jul 1, 2025

View reviewed changes

CISC added 3 commits July 1, 2025 13:54

poke

d03f958

Merge branch 'master' into cisc/geglu-erf-quick

95e15ae

add GEGLU_ERF and GEGLU_QUICK for opencl

a798eff

CISC requested a review from max-krasnyansky July 1, 2025 12:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml : implement GEGLU_ERF and GEGLU_QUICK ops #14445

ggml : implement GEGLU_ERF and GEGLU_QUICK ops #14445

CISC commented Jun 29, 2025 •

edited

Loading

Uh oh!

qnixsynapse left a comment

Uh oh!

jeffbolznv left a comment

Uh oh!

CISC commented Jun 29, 2025 •

edited

Loading

Uh oh!

0cc4m left a comment

Uh oh!

CISC commented Jul 1, 2025

Uh oh!

lhez commented Jul 2, 2025

Uh oh!

Uh oh!

ggml : implement GEGLU_ERF and GEGLU_QUICK ops #14445

Are you sure you want to change the base?

ggml : implement GEGLU_ERF and GEGLU_QUICK ops #14445

Conversation

CISC commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qnixsynapse left a comment

Choose a reason for hiding this comment

Uh oh!

jeffbolznv left a comment

Choose a reason for hiding this comment

Uh oh!

CISC commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

CISC commented Jul 1, 2025

Uh oh!

lhez commented Jul 2, 2025

Uh oh!

Uh oh!

CISC commented Jun 29, 2025 •

edited

Loading

CISC commented Jun 29, 2025 •

edited

Loading