Skip to content

Commit

Permalink
refactor the kernel
Browse files Browse the repository at this point in the history
Signed-off-by: xiaoyao0115 <[email protected]>
  • Loading branch information
xiaoyao0115 committed Nov 19, 2024
1 parent 35c9836 commit fd83104
Show file tree
Hide file tree
Showing 2 changed files with 225 additions and 166 deletions.
2 changes: 0 additions & 2 deletions transformer_engine/pytorch/csrc/extensions.h
Original file line number Diff line number Diff line change
Expand Up @@ -458,8 +458,6 @@ void thd_grad_correction(at::Tensor grad, const at::Tensor &grad_per_step,
at::Tensor thd_get_partitioned_indices(const at::Tensor &cu_seqlens, int total_tokens,
int world_size, int rank);

__forceinline__ __device__ int binary_search(int target, int *array, int len);

/***************************************************************************************************
* multi_tensor_* kernels
**************************************************************************************************/
Expand Down
Loading

0 comments on commit fd83104

Please sign in to comment.