Pico-MAML: Learning Dynamics Research on Model-Agnostic Meta-Learning

Pico is a framework designed to facilitate research into language model learning dynamics through a comprehensive suite of small to medium-scale models (1M-1B parameters). Built on a LLAMA-style architecture, Pico emphasizes simplicity, modularity, and research accessibility. We fork Pico to implement Subset Masked Language Modeling Tasks (SMLMT) an unsupervised approach to create meta-training tasks from unlabeled data. We then implement Model-Agnostic Meta-Learning (MAML) to train the actual model.

🚀 Quick Start

Clone Project

git clone https://github.com/rdiehlmartinez/pico.git && cd pico

Configure Environment Create .env file:

export HF_TOKEN=your_huggingface_token
export WANDB_API_KEY=your_wandb_key

Setup Dependencies

source setup.sh

Exploring the Codebase

The core implementation is organized into these key files and packages:

src/model/pico.py: The heart of Pico
- LLAMA-style transformer implementation
- Attention mechanism with KV-cache
- RoPE positional embeddings
- Documentation references for each component
src/training/trainer.py: Training pipeline
- Distributed training setup
- Checkpoint management
- Logging configuration
src/training/smlmt.py: SMLMT helper class
- Task generation
- Task sampling
- Injecting your own sentences
src/config: Model configuration
- Hyperparameter definitions
- Model architecture settings
- Training parameters
src/checkpointing: Checkpointing and State Management
- Training state persistence (model, optimizer, scheduler)
- Learning dynamics tracking (activations, weights, gradients)
- Evaluation results storage
- Automatically store huggingface-compatible version of model for down-stream use

Common Starting Points

Using Pre-trained Models

from transformers import AutoModelForCausalLM

# Load a specific model size
model = AutoModelForCausalLM.from_pretrained("pico-lm/[...]")

Training Your Own Suite

# Create a config yaml file, e.g. `my_config.yaml`
# You can follow the provided demo template in configs/demo.yaml
# If no config file is provided the default config values are used
poetry run train --config_path my_config.yaml

📚 References

Our implementation draws inspiration from and builds upon:

🤝 Contributing

We welcome contributions in:

New features and improvements
Documentation and tutorials
Bug fixes and testing
Research findings and analysis

📝 License

Apache 2.0 License

📫 Contact

GitHub: DavidDemitriAfrica/pico-maml
Author: David Demitri Africa

Citation

If you use Pico-MAML in your research, please cite:

@software{picomaml2025,
    author = {Africa, David Demitri},
    title = {Pico-MAML: Learning Dynamics Research on Model-Agnostic Meta-Learning},
    year = {2025},
}

Name		Name	Last commit message	Last commit date
Latest commit History 230 Commits
configs		configs
scripts		scripts
src		src
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pico-MAML: Learning Dynamics Research on Model-Agnostic Meta-Learning

🚀 Quick Start

Exploring the Codebase

Common Starting Points

📚 References

🤝 Contributing

📝 License

📫 Contact

Citation

About

Releases

Packages

Languages

License

DavidDemitriAfrica/pico-maml

Folders and files

Latest commit

History

Repository files navigation

Pico-MAML: Learning Dynamics Research on Model-Agnostic Meta-Learning

🚀 Quick Start

Exploring the Codebase

Common Starting Points

📚 References

🤝 Contributing

📝 License

📫 Contact

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages