monsterapi
/

mistral_7b_HalfEpoch_DolphinCoder

Model card Files Files and versions

Zangs3011 commited on Jan 20, 2024

Commit

a2d4dd9

·

verified ·

1 Parent(s): 94f00b0

Update README.md

Files changed (1) hide show

README.md +10 -9

README.md CHANGED Viewed

@@ -3,16 +3,16 @@ library_name: peft
 tags:
 - code
 - instruct
-- code-llama
 datasets:
 - cognitivecomputations/dolphin-coder
-base_model: codellama/CodeLlama-7b-hf
 license: apache-2.0
 ---
 ### Finetuning Overview:
-**Model Used:** codellama/CodeLlama-7b-hf
 **Dataset:** cognitivecomputations/dolphin-coder
@@ -25,21 +25,22 @@ license: apache-2.0
 With the utilization of [MonsterAPI](https://monsterapi.ai)'s [no-code LLM finetuner](https://monsterapi.ai/finetuning), this finetuning:
 - Was achieved with great cost-effectiveness.
-- Completed in a total duration of 15hr 31mins for 1 epochs using an A6000 48GB GPU.
-- Costed `$31.31` for the entire 1 epoch.
 #### Hyperparameters & Additional Details:
-- **Epochs:** 1
-- **Total Finetuning Cost:** $31.31
-- **Model Path:** codellama/CodeLlama-7b-hf
 - **Learning Rate:** 0.0002
 - **Data Split:** 100% train
 - **Gradient Accumulation Steps:** 128
 - **lora r:** 32
 - **lora alpha:** 64
-![Train Loss](https://cdn-uploads.huggingface.co/production/uploads/63ba46aa0a9866b28cb19a14/aNujXePogMlJZmoi1Bq56.png)
 ---
 license: apache-2.0

 tags:
 - code
 - instruct
+- mistral
 datasets:
 - cognitivecomputations/dolphin-coder
+base_model: mistralai/Mistral-7B-v0.1
 license: apache-2.0
 ---
 ### Finetuning Overview:
+**Model Used:** mistralai/Mistral-7B-v0.1
 **Dataset:** cognitivecomputations/dolphin-coder
 With the utilization of [MonsterAPI](https://monsterapi.ai)'s [no-code LLM finetuner](https://monsterapi.ai/finetuning), this finetuning:
 - Was achieved with great cost-effectiveness.
+- Completed in a total duration of 7hrs 36min for 0.1 epochs using an A6000 48GB GPU.
+- Costed `$15.2` for the entire run
 #### Hyperparameters & Additional Details:
+- **Epochs:** 0.1
+- **Cost for full run:** $15.2
+- **Model Path:** mistralai/Mistral-7B-v0.1
 - **Learning Rate:** 0.0002
 - **Data Split:** 100% train
 - **Gradient Accumulation Steps:** 128
 - **lora r:** 32
 - **lora alpha:** 64
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/6313732454e6e5d9f0f797cd/0O1VKp3SJNfrhTd5earci.png)
 ---
 license: apache-2.0