difference between the paper and code #30

ttio2tech · 2023-04-09T23:44:11Z

It seems that in your paper the train dataset is 'InstructorDoctor-205k' but in this repo, from the training command, the dataset is 'HealthCareMagic-100k.json'
In the paper, the training was 'fine tuning on nstructorDoctor-205k (seems to be one step?)', but in this repo: 'Our model was firstly be fine-tuned by Stanford Alpaca's data to have some basic conversational capabilities.' does it mean the repo contains updated method?
Training time difference: paper - 18 hours. repo - 30 minutes
Can you help to provide some clarifications?
Thanks!

Kent0n-Li · 2023-04-10T01:31:06Z

We are still enhancing our model, once finished, we will update the details in our paper.

mehrdad-data · 2023-05-28T22:02:38Z

@KentOn-Li
@ttio2tech
@saharmor
hello, I have filled out the link several times, but I do not receive related weight files. Is there something missing here? (I had check my spam) My email is autogptuser(at)gmail(dot)com could you please send me the pre-trained weights? Thanks a lot.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

difference between the paper and code #30

difference between the paper and code #30

ttio2tech commented Apr 9, 2023

Kent0n-Li commented Apr 10, 2023

mehrdad-data commented May 28, 2023

difference between the paper and code #30

difference between the paper and code #30

Comments

ttio2tech commented Apr 9, 2023

Kent0n-Li commented Apr 10, 2023

mehrdad-data commented May 28, 2023