Skip to content

Pull requests: AI-Hypercomputer/maxtext

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix mmlu api
#1868 opened Jun 24, 2025 by shuningjin Draft
4 tasks done
Migrate AttentionOp to use NNX
#1867 opened Jun 24, 2025 by bvandermoon Loading…
4 tasks done
Small update to keep elastic training working
#1866 opened Jun 23, 2025 by lukebaumann Draft
4 tasks done
All Gather Once (FSDP Zero-one)
#1865 opened Jun 23, 2025 by wei879-100 Draft
4 tasks
Test notify changes
#1864 opened Jun 23, 2025 by quoctruong Draft
4 tasks done
[maxtext] improve profiling in decode
#1863 opened Jun 23, 2025 by copybara-service bot Loading…
Add DataLoader
#1862 opened Jun 23, 2025 by SurbhiJainUSC Loading…
4 tasks done
[DEBUGGING] Debugging flaky tests
#1861 opened Jun 22, 2025 by gobbleturk Loading…
4 tasks
[DRAFT] Qwen3 0.6b
#1858 opened Jun 21, 2025 by shralex Loading…
4 tasks
[WIP] Add user guide to customize model
#1857 opened Jun 21, 2025 by RissyRan Draft
4 tasks done
Use forked splash_attention_kernel
#1856 opened Jun 20, 2025 by copybara-service bot Loading…
Refactor: Decouple Core Transformer Blocks
#1852 opened Jun 19, 2025 by parambole Loading…
4 tasks done
Fork Splash Attention kernel to MaxText
#1851 opened Jun 19, 2025 by copybara-service bot Loading…
Allow moe_test.py to be run on internal tools.
#1847 opened Jun 18, 2025 by copybara-service bot Loading…
Resolve pylint errors in moe.py.
#1846 opened Jun 18, 2025 by copybara-service bot Loading…
Get Linter / CPU tests to succeed
#1844 opened Jun 18, 2025 by SamuelMarks Loading…
4 tasks done
Enable Checkpoint Conversion from Huggingface to Maxtext
#1839 opened Jun 16, 2025 by YixuanWang-99 Loading…
4 tasks
pyconfig → pydantic
#1836 opened Jun 15, 2025 by SamuelMarks Draft
4 tasks done
Add Qwen3
#1835 opened Jun 14, 2025 by bzantium Loading…
4 tasks
Refactor profiler in trainers pull ready
#1833 opened Jun 14, 2025 by SurbhiJainUSC Loading…
4 tasks done
JetStream Offline Engine
#1829 opened Jun 13, 2025 by wenxindongwork Loading…
4 tasks done
ProTip! What’s not been updated in a month: updated:<2025-05-24.