Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | @@ -1,50 +1,22 @@ | |
| 1 | 
             
            ---
         | 
| 2 | 
             
            base_model:
         | 
| 3 | 
            -
            -  | 
| 4 | 
             
            - NeverSleep/Lumimaid-v0.2-12B
         | 
| 5 | 
             
            - Undi95/LocalC-12B-e2.0
         | 
| 6 | 
            -
            -  | 
| 7 | 
             
            library_name: transformers
         | 
| 8 | 
             
            tags:
         | 
| 9 | 
             
            - mergekit
         | 
| 10 | 
             
            - merge
         | 
| 11 |  | 
| 12 | 
             
            ---
         | 
| 13 | 
            -
            # out
         | 
| 14 | 
            -
             | 
| 15 | 
            -
            This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
         | 
| 16 | 
            -
             | 
| 17 | 
            -
            ## Merge Details
         | 
| 18 | 
            -
            ### Merge Method
         | 
| 19 | 
            -
             | 
| 20 | 
            -
            This model was merged using the [DELLA](https://arxiv.org/abs/2406.11617) merge method using [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) as a base.
         | 
| 21 | 
            -
             | 
| 22 | 
            -
            ### Models Merged
         | 
| 23 |  | 
| 24 | 
            -
             | 
| 25 | 
            -
            * [anthracite-org/magnum-v4-12b](https://huggingface.co/anthracite-org/magnum-v4-12b)
         | 
| 26 | 
            -
            * [NeverSleep/Lumimaid-v0.2-12B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-12B)
         | 
| 27 | 
            -
            * [Undi95/LocalC-12B-e2.0](https://huggingface.co/Undi95/LocalC-12B-e2.0)
         | 
| 28 |  | 
| 29 | 
            -
             | 
| 30 |  | 
| 31 | 
            -
             | 
| 32 |  | 
| 33 | 
            -
            ```yaml
         | 
| 34 | 
            -
            base_model: mistralai/Mistral-Nemo-Instruct-2407
         | 
| 35 | 
            -
            merge_method: della
         | 
| 36 | 
            -
            dtype: bfloat16
         | 
| 37 | 
            -
            models:
         | 
| 38 | 
            -
              - model: anthracite-org/magnum-v4-12b
         | 
| 39 | 
            -
                parameters:
         | 
| 40 | 
            -
                  weight: 1.0
         | 
| 41 | 
            -
              - model: Undi95/LocalC-12B-e2.0
         | 
| 42 | 
            -
                parameters:
         | 
| 43 | 
            -
                  weight: 1.0
         | 
| 44 | 
            -
              - model: NeverSleep/Lumimaid-v0.2-12B
         | 
| 45 | 
            -
                parameters:
         | 
| 46 | 
            -
                  weight: 1.0
         | 
| 47 | 
            -
              - model: mistralai/Mistral-Nemo-Instruct-2407
         | 
| 48 | 
            -
                parameters:
         | 
| 49 | 
            -
                  weight: 1.0
         | 
| 50 | 
             
            ```
         | 
|  | |
|  | 
|  | |
| 1 | 
             
            ---
         | 
| 2 | 
             
            base_model:
         | 
| 3 | 
            +
            - mistralai/Mistral-Nemo-Instruct-2407
         | 
| 4 | 
             
            - NeverSleep/Lumimaid-v0.2-12B
         | 
| 5 | 
             
            - Undi95/LocalC-12B-e2.0
         | 
| 6 | 
            +
            - anthracite-org/magnum-v4-12b
         | 
| 7 | 
             
            library_name: transformers
         | 
| 8 | 
             
            tags:
         | 
| 9 | 
             
            - mergekit
         | 
| 10 | 
             
            - merge
         | 
| 11 |  | 
| 12 | 
             
            ---
         | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
| 13 |  | 
| 14 | 
            +
            Merge of Lumimaid and Magnum as requested by some. <b>UPDATE : Magnum v4 used in this merge as asked [here](https://huggingface.co/Undi95/Lumimaid-Magnum-12B/discussions/4)</b>
         | 
|  | |
|  | |
|  | |
| 15 |  | 
| 16 | 
            +
            I used the DELLA merge method in mergekit and added a finetune of Nemo only on Claude input, trained on 16k ctx, in the mix.
         | 
| 17 |  | 
| 18 | 
            +
            # Prompt template: Mistral
         | 
| 19 |  | 
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
|  | |
| 20 | 
             
            ```
         | 
| 21 | 
            +
            <s>[INST] {input} [/INST] {output}</s>
         | 
| 22 | 
            +
            ```
         | 
