GitHub - erictang000/GPU-BSW-Work-Stealing: Work Stealing for GPU Accelerated Smith-Waterman Algorithm using CUDA and OpenMP

GPU-BSW Work Stealing

In this project, we aimed to implement CPU work stealing from a shared work queue for the Smith-Waterman algorithm. Research has showed that the most optimal CPU implementation of Smith-Waterman is competitive with GPU implementations for an instruction set (SSE2) an implementation from 2013 - thus it is important to both be able to take advantage of the parallel computation ability of GPUs, and the lack of the need for transfer latency on CPU, in order to optimize the Smith-Waterman algorithm on high performance compute clusters, where the compute environment is heterogeneous.

For the project, we worked off of an implementation of GPU-BSW, which is a batched version of the Smith-Waterman algorithm, for running parallel alignment computations on GPU owning threads. For non GPU owning threads, we used a SIMD version of Smith-Waterman. We integrated the two by updating the kernel calls of the GPU-BSW library to pop work off a shared atomic work queue used by all of the currently running threads. We showed steady performance improvements with this split CPU/GPU approach to tackling Smith-Waterman, and showed how these two implementations can be used in conjunction with one another rather than independently.

In addition to this basic integration of CPU and GPU, we attempted additional optimizations, including work stealing on GPU owning threads, and tuning batch sizes for CPU and GPUs to take or block work off of the shared queue.

ADEPT (GPU-BSW)

GPU-BSW or GPU Batch Smith-Waterman is a GPU accelerated implementation of the Smith-Waterman alignment algorithm based on the ADEPT strategy hence also referenced as ADEPT. Implementation details of ADEPT can be found in the publication here: https://rdcu.be/b7fhY. ADEPT uses GPU's two level parallelism to perform multiple sequence alignments in batches while using fine grained parallelism to accelerate each individual alignment. Overall it provides several time faster performance in comparison to existing SIMD implementations for CPU, a comparative study with existing CPU and GPU methods has been provided in the publication mentioned above. ADEPT performs a complete smith-waterman alignment with affine gap penalities and can align both protein and DNA sequences.

ADEPT provides a driver function that separates CUDA code from the main application which enables easy use and integeration in existing applications, effectively providing a drop in replacement for CPU libraries. The driver also enables balancing of alignments across all the GPUs available on a system.

To Build:

mkdir build

cd build

cmake CMAKE_BUILD_TYPE=Release ..

make

To Execute DNA test run:

./program_gpu dna ../test-data/dna-reference.fasta ../test-data/dna-query.fasta ./out_file

To Execute Protein test run:

./program_gpu aa ../test-data/protein-reference.fasta ../test-data/protein-query.fasta ./out_file

Contact

If you need help modifying the library to match your specific use-case or for other issues and bug reports please open an issue or reach out at [email protected]

Citation

Awan, M.G., Deslippe, J., Buluc, A. et al. ADEPT: a domain independent sequence alignment strategy for gpu architectures. BMC Bioinformatics 21, 406 (2020). https://doi.org/10.1186/s12859-020-03720-1

License:

GPU accelerated Smith-Waterman for performing batch alignments (GPU-BSW) Copyright (c) 2019, The Regents of the University of California, through Lawrence Berkeley National Laboratory (subject to receipt of any required approvals from the U.S. Dept. of Energy). All rights reserved.

If you have questions about your rights to use or distribute this software, please contact Berkeley Lab's Intellectual Property Office at [email protected].

NOTICE. This Software was developed under funding from the U.S. Department of Energy and the U.S. Government consequently retains certain rights. As such, the U.S. Government has been granted for itself and others acting on its behalf a paid-up, nonexclusive, irrevocable, worldwide license in the Software to reproduce, distribute copies to the public, prepare derivative works, and perform publicly and display publicly, and to permit other to do so.

Name		Name	Last commit message	Last commit date
Latest commit History 269 Commits
.vscode		.vscode
additional_data		additional_data
regression_test		regression_test
results		results
test-data		test-data
test-scripts		test-scripts
.DS_Store		.DS_Store
.clang-format		.clang-format
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md
alignments.cpp		alignments.cpp
alignments.hpp		alignments.hpp
driver.cpp		driver.cpp
driver.hpp		driver.hpp
driver_broken.cpp		driver_broken.cpp
driver_old.cpp		driver_old.cpp
kernel.cpp		kernel.cpp
kernel.hpp		kernel.hpp
license.txt		license.txt
main.cpp		main.cpp
project_report.pdf		project_report.pdf
run_3m_scaling.sh		run_3m_scaling.sh
ssw.c		ssw.c
ssw.h		ssw.h
ssw_cpp.cpp		ssw_cpp.cpp
ssw_cpp.h		ssw_cpp.h
utils.cpp		utils.cpp
utils.hpp		utils.hpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPU-BSW Work Stealing

ADEPT (GPU-BSW)

To Build:

To Execute DNA test run:

To Execute Protein test run:

Contact

Citation

License:

About

Releases

Packages

Contributors 4

Languages

License

erictang000/GPU-BSW-Work-Stealing

Folders and files

Latest commit

History

Repository files navigation

GPU-BSW Work Stealing

ADEPT (GPU-BSW)

To Build:

To Execute DNA test run:

To Execute Protein test run:

Contact

Citation

License:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages