Introduce export_llama to extension/llm as export_llm

### 🚀 The feature, motivation and pitch

To generalize `export_llama` better to transformers as `export_llm`, the following directories and files in `examples/models/llama` will be moved to `extension/llm`:
- examples/models/llama/config
- examples/models/llama/source_transformation
- examples/models/llama/tests
- [attention.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/attention.py) -> Will go into a new `attention` directory
- [export_llama_hydra.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/export_llama_hydra.py) -> this will be renamed to `export_llm.py` and will be the main entrypoint to the new API
- [export_llama_lib.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/export_llama_lib.py) -> this will be renamed to `export_llm_lib.py`
- [fairseq2.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/fairseq2.py)
- [hf_download.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/hf_download.py)
- [llama_transformer.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/llama_transformer.py) -> this will be renamed to `transformer.py`
- [model.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/model.py) -> this will have `Llama2Model` renamed to `TransformerModel`. Also not a big fan of the naming of this file, open to take this opportunity to rename to `eager.py` or something like that to its intent better, which is to instantiate the model.
  - We could remove Llama-specific components from the version in `extension/llm` such as Fairseq2 and have `Llama2Model` inherit from it
- [model_args.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/model_args.py)
- [norm.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/norm.py)
- [rope.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/rope.py)
- [static_attention.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/static_attention.py) -> Will go into a new `attention` directory

Key notes:
- [export_llama.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/export_llama.py) stays put, `export_llama` will still exist to preserve BC, especially internally
  -  [export_llama_args.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/export_llama_args.py) then also stays put, supporting the legacy CLI
  - [export_llama_hydra.py](https://github.com/pytorch/executorch/blob/main/examples/models/llama/export_llama_hydra.py) will be the new `export_llm` powered by Hydra, and it will be in `extension/llm`. Note that this means that the `export_llm` API will not support the legacy CLI
- Lots of internal callsites for functions in `export_llama_lib` such as `_prepare_for_llama_export` and `_export_llama`. These will need to all be renamed and refactored.

### Alternatives

_No response_

### Additional context

_No response_

### RFC (Optional)

_No response_

cc @larryliu0820 @mergennachin @cccclai @helunwencser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Introduce export_llama to extension/llm as export_llm #11527

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Introduce export_llama to extension/llm as export_llm #11527

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions