File tree 3 files changed +55
-1
lines changed
3 files changed +55
-1
lines changed Original file line number Diff line number Diff line change
1
+ target_task : tasks/i18n/ja.md
2
+ base_model_id : meta-llama/Meta-Llama-3-8B
3
+ model_name : Llama-3-8B-ja-wikipedia-v0.1
4
+ output_base_dir : output
5
+ dataset_id : wikimedia/wikipedia
6
+ dataset_load_config : 20231101.ja
7
+ dataset_input_field_name : text
8
+ dataset_train_split_seed : 42
9
+ dataset_train_split_test_size : 0.2
10
+ lora_r : 8
11
+ lora_alpha : 16
12
+ lora_dropout : 0.05
13
+ train_claim_gpu_num : 8
14
+ train_per_device_train_batch_size : 1
15
+ train_gradient_accumulation_steps : 128
16
+ train_num_train_epochs : 1
17
+ inference_max_new_tokens : 32
18
+ evaluations :
19
+ -
20
+ prompt : " 2つ以上の文字を組み合わせて図案化したもののことで、特にルイ・ヴィトンのものが知られるのは何でしょう?"
21
+ expected_output : " モノグラム"
22
+ -
23
+ prompt : " 幾つかの布切れを縫いあわせ、飾りや模様を作る手芸方法を何というでしょう?"
24
+ expected_output : " パッチワーク"
25
+ -
26
+ prompt : " 格闘家ボブ・サップの出身国はどこでしょう?"
27
+ expected_output : " アメリカ"
Original file line number Diff line number Diff line change @@ -13,7 +13,7 @@ lora_dropout: 0.05
13
13
train_claim_gpu_num : 8
14
14
train_per_device_train_batch_size : 1
15
15
train_gradient_accumulation_steps : 128
16
- train_num_train_epochs : 5
16
+ train_num_train_epochs : 1
17
17
inference_max_new_tokens : 32
18
18
evaluations :
19
19
-
Original file line number Diff line number Diff line change
1
+ target_task : tasks/i18n/ja.md
2
+ base_model_id : google/gemma-7b
3
+ model_name : gemma-7b-ja-wikipedia-v0.1
4
+ output_base_dir : output
5
+ dataset_id : wikimedia/wikipedia
6
+ dataset_load_config : 20231101.ja
7
+ dataset_input_field_name : text
8
+ dataset_train_split_seed : 42
9
+ dataset_train_split_test_size : 0.2
10
+ lora_r : 8
11
+ lora_alpha : 16
12
+ lora_dropout : 0.05
13
+ train_claim_gpu_num : 8
14
+ train_per_device_train_batch_size : 1
15
+ train_gradient_accumulation_steps : 128
16
+ train_num_train_epochs : 1
17
+ inference_max_new_tokens : 32
18
+ evaluations :
19
+ -
20
+ prompt : " 2つ以上の文字を組み合わせて図案化したもののことで、特にルイ・ヴィトンのものが知られるのは何でしょう?"
21
+ expected_output : " モノグラム"
22
+ -
23
+ prompt : " 幾つかの布切れを縫いあわせ、飾りや模様を作る手芸方法を何というでしょう?"
24
+ expected_output : " パッチワーク"
25
+ -
26
+ prompt : " 格闘家ボブ・サップの出身国はどこでしょう?"
27
+ expected_output : " アメリカ"
You can’t perform that action at this time.
0 commit comments