finetune

me playing around with fine tuning.

Setup

Install pyenv
Install pytorch && torchtune
downloaded models from huggingface
- huggingface-cli download meta-llama/Llama-2-7b-hf --local-dir /home/marek/models/meta-llama/Llama-2-7b-hf --token=$HUGGING_FACE_TOKEN

Learnings:

Need to use the same tokenizer as the model was trained with
Need the data to be formatted in the same way as the model was trained.

want to download a dataset? goodluck finding any info on it. I used this. huggingface-cli download yahma/alpaca-cleaned --local-dir /home/marek/datasets/alpaca-cleaned/ --token=$HUGGING_FACE_TOKEN --repo-type dataset

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
llama3-1b		llama3-1b
llama_take2		llama_take2
qwen_take2		qwen_take2
rag		rag
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
custom_config.yaml		custom_config.yaml
custom_generation_config.yaml		custom_generation_config.yaml
custom_small_config.yaml		custom_small_config.yaml
generate_qwen.yaml		generate_qwen.yaml
main.py		main.py
qlora_config.yaml		qlora_config.yaml
quen2_5_lora_single_device_finetune.yaml		quen2_5_lora_single_device_finetune.yaml
test-dataset.txt		test-dataset.txt
test.yaml		test.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

finetune

Setup

Learnings:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

null-channel/finetune

Folders and files

Latest commit

History

Repository files navigation

finetune

Setup

Learnings:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages