File tree 2 files changed +36
-0
lines changed
2 files changed +36
-0
lines changed Original file line number Diff line number Diff line change
1
+ target_task : tasks/text-generation/text2sql.md
2
+ base_model_id : mistralai/Mistral-7B-v0.1
3
+ model_name : Mistral-7B-v0.1-coder-sql-en-v0.1
4
+ output_base_dir : output
5
+ dataset_id : b-mc2/sql-create-context
6
+ dataset_input_field_name : question
7
+ dataset_context_hint : Given the context, generate an SQL query that answers the question
8
+ dataset_context_field_name : context
9
+ dataset_output_field_name : answer
10
+ dataset_train_split_seed : 42
11
+ dataset_train_split_test_size : 0.2
12
+ lora_r : 8
13
+ lora_alpha : 16
14
+ lora_dropout : 0.05
15
+ train_claim_gpu_num : 8
16
+ train_per_device_train_batch_size : 1
17
+ train_gradient_accumulation_steps : 16
18
+ train_num_train_epochs : 4
Original file line number Diff line number Diff line change
1
+ target_task : tasks/nlp/translation.md
2
+ base_model_id : TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T
3
+ model_name : tinyllama-coder-trident-v0.1
4
+ output_base_dir : output
5
+ dataset_id : yuiseki/text2geoql
6
+ dataset_filter_field_name : input_type
7
+ dataset_filter_field_value : trident
8
+ dataset_input_field_name : input
9
+ dataset_output_field_name : output
10
+ dataset_train_split_seed : 42
11
+ dataset_train_split_test_size : 0.2
12
+ lora_r : 8
13
+ lora_alpha : 16
14
+ lora_dropout : 0.05
15
+ train_claim_gpu_num : 8
16
+ train_per_device_train_batch_size : 8
17
+ train_gradient_accumulation_steps : 2
18
+ train_num_train_epochs : 4
You can’t perform that action at this time.
0 commit comments