-
Notifications
You must be signed in to change notification settings - Fork 628
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test test_low_latency.py failed #15
Comments
Please provide additional information, including:
Furthermore, please share the results of the following reports, if available:
Alternatively, you can simply provide the output of the command: /path/to/nvshmem/dir/bin/nvshmem-info -a |
There is a line You can try to reduce the number of experts. |
We may add H20 compatible changes later. |
Hi Lyric, in addition to reducing the number of experts, is it possible to enable 288 experts by changing other parameters? and I also wonder if the H20 support change will be launched soon? Thanks! |
It is possible, and for an easier solution (I will update the mainstream later), you can simply change (2 places, both dispatch and combine) constexpr int kNumWarpsPerGroup = 10;
constexpr int kNumWarpGroups = 3; into constexpr int kNumWarpsPerGroup = 8;
constexpr int kNumWarpGroups = 4; and remove I don't have H20 for testing, if the performance is not good, please report and file an issue :) |
I encountered an issue while running the test
python test_low_latency.py
. The test is failing with the following error:However, another related test
tests/test_intranode.py
runs successfully and produces the expected output.Environment:
The text was updated successfully, but these errors were encountered: