You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After executing examples/quantization_2of4_sparse_w4a16/llama7b_sparse_w4a16.py, I obtained three models at different stages: stage_sparsity, stage_finetuning, and stage_quantization. My question is, if an error occurs while running the stage_quantization phase, how can I resume the process from the stage_finetuning model? Given that both the stage_sparsity and stage_finetuning phases are resource-intensive, restarting from the beginning would be inefficient and time-consuming.