File tree
3 files changed
+73
-9
lines changed- tests
- trl
- models
- trainer
3 files changed
+73
-9
lines changedDiff for: tests/test_grpo_trainer.py
+35
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
498 | 498 |
| |
499 | 499 |
| |
500 | 500 |
| |
| 501 | + | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
| 518 | + | |
| 519 | + | |
| 520 | + | |
| 521 | + | |
| 522 | + | |
| 523 | + | |
| 524 | + | |
| 525 | + | |
| 526 | + | |
| 527 | + | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
| 534 | + | |
| 535 | + |
Diff for: trl/models/utils.py
+23-8
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
20 | 20 |
| |
21 | 21 |
| |
22 | 22 |
| |
| 23 | + | |
23 | 24 |
| |
24 | 25 |
| |
25 | 26 |
| |
| |||
37 | 38 |
| |
38 | 39 |
| |
39 | 40 |
| |
40 |
| - | |
41 |
| - | |
42 | 41 |
| |
43 | 42 |
| |
44 | 43 |
| |
| |||
176 | 175 |
| |
177 | 176 |
| |
178 | 177 |
| |
| 178 | + | |
179 | 179 |
| |
180 | 180 |
| |
181 | 181 |
| |
182 |
| - | |
183 | 182 |
| |
184 |
| - | |
185 |
| - | |
186 |
| - | |
| 183 | + | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
| 188 | + | |
| 189 | + | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
187 | 204 |
| |
188 | 205 |
| |
189 |
| - | |
190 |
| - | |
191 | 206 |
| |
192 | 207 |
| |
193 | 208 |
| |
|
Diff for: trl/trainer/grpo_trainer.py
+15-1
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
51 | 51 |
| |
52 | 52 |
| |
53 | 53 |
| |
54 |
| - | |
| 54 | + | |
55 | 55 |
| |
56 | 56 |
| |
57 | 57 |
| |
| |||
492 | 492 |
| |
493 | 493 |
| |
494 | 494 |
| |
| 495 | + | |
| 496 | + | |
| 497 | + | |
| 498 | + | |
| 499 | + | |
| 500 | + | |
| 501 | + | |
| 502 | + | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
495 | 509 |
| |
496 | 510 |
| |
497 | 511 |
| |
|
0 commit comments