Mingke977 commited on 11 days ago

Commit

facd22e

verified ·

1 Parent(s): bb94af7

Add files using upload-large-folder tool

Browse files

Files changed (44) hide show

README.md +45 -374
config.json +9 -4
mergekit_config.yml +21 -0
model-1-of-40.safetensors +1 -1
model-10-of-40.safetensors +2 -2
model-11-of-40.safetensors +2 -2
model-12-of-40.safetensors +2 -2
model-13-of-40.safetensors +2 -2
model-14-of-40.safetensors +2 -2
model-15-of-40.safetensors +2 -2
model-16-of-40.safetensors +2 -2
model-17-of-40.safetensors +2 -2
model-18-of-40.safetensors +2 -2
model-19-of-40.safetensors +2 -2
model-2-of-40.safetensors +2 -2
model-20-of-40.safetensors +2 -2
model-21-of-40.safetensors +2 -2
model-22-of-40.safetensors +2 -2
model-23-of-40.safetensors +2 -2
model-24-of-40.safetensors +2 -2
model-25-of-40.safetensors +2 -2
model-26-of-40.safetensors +2 -2
model-27-of-40.safetensors +2 -2
model-28-of-40.safetensors +2 -2
model-29-of-40.safetensors +2 -2
model-3-of-40.safetensors +2 -2
model-30-of-40.safetensors +2 -2
model-31-of-40.safetensors +2 -2
model-32-of-40.safetensors +2 -2
model-33-of-40.safetensors +2 -2
model-34-of-40.safetensors +2 -2
model-35-of-40.safetensors +2 -2
model-36-of-40.safetensors +2 -2
model-37-of-40.safetensors +2 -2
model-38-of-40.safetensors +2 -2
model-39-of-40.safetensors +2 -2
model-4-of-40.safetensors +2 -2
model-40-of-40.safetensors +2 -2
model-5-of-40.safetensors +2 -2
model-6-of-40.safetensors +2 -2
model-7-of-40.safetensors +2 -2
model-8-of-40.safetensors +2 -2
model-9-of-40.safetensors +2 -2
model-non-layer.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,378 +1,49 @@
 ---
-language:
-- zh
-- en
-pipeline_tag: text-generation
 library_name: transformers
----
-<div align="center">
-  <picture>
-      <img src="figures/joyai-logo.png" width="30%" alt="JoyAI-LLM Flash">
-  </picture>
-</div>
-<hr>
-<div align="center" style="line-height: 1;">
-  <a href="https://huggingface.co/jdopensource" target="_blank"><img alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-JD-ffc107?color=ffc107&logoColor=white"/></a>
-  <a href="https://huggingface.co/jdopensource/JoyAI-LLM-Flash/blob/main/LICENSE"><img alt="License" src="https://img.shields.io/badge/License-Modified_MIT-f5de53?&color=f5de53"/></a>
-</div>
-## 1. Model Introduction
-JoyAI-LLM-Flash is a state-of-the-art medium-sized instruct language model with 3 billion activated parameters and 48 billion total parameters. JoyAI-LLM-Flash was pretrained on 20 trillion text tokens using Muon optimizer, followed by large-scale supervised fine-tuning (SFT), direct preference optimization (DPO), and reinforcement learning (RL) across diverse environments. JoyAI-LLM-Flash achieves strong performance across frontier knowledge, reasoning, coding tasks and agentic capabilities.
-### Key Features
-- Fiber Bundle RL: Introduces fiber bundle theory into reinforcement learning, proposing a novel optimization framework, FiberPO. This method is specifically designed to handle the challenges of large-scale and heterogeneous agent training, improving stability and robustness under complex data distributions.
-- Training-Inference Collaboration: apply Muon optimizer with dense MTP, develop novel optimization techniques to resolve instabilities while scaling up, delivering 1.3× to 1.7× the throughput of the non-MTP version.
-- Agentic Intelligence: designed for tool use, reasoning, and autonomous problem-solving.
-## 2. Model Summary
-|                                             |                          |
-| :-----------------------------------------: | :----------------------: |
-|              **Architecture**               | Mixture-of-Experts (MoE) |
-|            **Total Parameters**             |           48B            |
-|          **Activated Parameters**           |            3B            |
-| **Number of Layers** (Dense layer included) |            40            |
-|         **Number of Dense Layers**          |            1             |
-|       **Attention Hidden Dimension**        |           2048           |
-|    **MoE Hidden Dimension** (per Expert)    |           768            |
-|        **Number of Attention Heads**        |            32            |
-|            **Number of Experts**            |           256            |
-|       **Selected Experts per Token**        |            8             |
-|        **Number of Shared Experts**         |            1             |
-|             **Vocabulary Size**             |           129K           |
-|             **Context Length**              |           128K           |
-|           **Attention Mechanism**           |           MLA            |
-|           **Activation Function**           |          SwiGLU          |
-|                   </div>                    |                          |
-## 3. Evaluation Results
-<table>
-<thead>
-<tr>
-<th align="center">Benchmark</th>
-<th align="center"><sup>JoyAI-LLM Flash</sup></th>
-<th align="center"><sup>Qwen3-30B-A3B-Instuct-2507</sup></th>
-<th align="center"><sup>GLM-4.7-Flash<br>(Non-thinking)</sup></th>
-</tr>
-</thead>
-<tbody>
-<tr>
-<td align="center" colspan=8><strong>Knowledge &amp; Alignment</strong></td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">MMLU</td>
-<td align="center" style="vertical-align: middle"><strong>89.50</strong></td>
-<td align="center" style="vertical-align: middle">86.87</td>
-<td align="center" style="vertical-align: middle">80.53</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">MMLU-Pro</td>
-<td align="center" style="vertical-align: middle"><strong>81.02</strong></td>
-<td align="center" style="vertical-align: middle">73.88</td>
-<td align="center" style="vertical-align: middle">63.62</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">CMMLU</td>
-<td align="center" style="vertical-align: middle"><strong>87.03</strong></td>
-<td align="center" style="vertical-align: middle">85.88</td>
-<td align="center" style="vertical-align: middle">75.85</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">GPQA-Diamond</td>
-<td align="center" style="vertical-align: middle"><strong>74.43</strong></td>
-<td align="center" style="vertical-align: middle">68.69</td>
-<td align="center" style="vertical-align: middle">39.90</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">SuperGPQA</td>
-<td align="center" style="vertical-align: middle"><strong>55.00</strong></td>
-<td align="center" style="vertical-align: middle">52.00</td>
-<td align="center" style="vertical-align: middle">32.00</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">LiveBench</td>
-<td align="center" style="vertical-align: middle"><strong>72.90</strong></td>
-<td align="center" style="vertical-align: middle">59.70</td>
-<td align="center" style="vertical-align: middle">43.10</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">IFEval</td>
-<td align="center" style="vertical-align: middle"><strong>86.69</strong></td>
-<td align="center" style="vertical-align: middle">83.18</td>
-<td align="center" style="vertical-align: middle">82.44</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">AlignBench</td>
-<td align="center" style="vertical-align: middle"><strong>8.24</strong></td>
-<td align="center" style="vertical-align: middle">8.07</td>
-<td align="center" style="vertical-align: middle">6.85</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">HellaSwag</td>
-<td align="center" style="vertical-align: middle"><strong>91.79</strong></td>
-<td align="center" style="vertical-align: middle">89.90</td>
-<td align="center" style="vertical-align: middle">60.84</td>
-</tr>
-<tr>
-<td align="center" colspan=8><strong>Coding</strong></td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">HumanEval</td>
-<td align="center" style="vertical-align: middle"><strong>96.34</strong></td>
-<td align="center" style="vertical-align: middle">95.12</td>
-<td align="center" style="vertical-align: middle">74.39</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">LiveCodeBench</td>
-<td align="center" style="vertical-align: middle"><strong>65.60</strong></td>
-<td align="center" style="vertical-align: middle">39.71</td>
-<td align="center" style="vertical-align: middle">27.43</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">SciCode</td>
-<td align="center" style="vertical-align: middle"><strong>3.08/22.92</strong></td>
-<td align="center" style="vertical-align: middle"><strong>3.08/22.92</strong></td>
-<td align="center" style="vertical-align: middle">3.08/15.11</td>
-</tr>
-<tr>
-<td align="center" colspan=8><strong>Mathematics</strong></td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">GSM8K</td>
-<td align="center" style="vertical-align: middle"><strong>95.83</strong></td>
-<td align="center" style="vertical-align: middle">79.83</td>
-<td align="center" style="vertical-align: middle">81.88</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">AIME2025</td>
-<td align="center" style="vertical-align: middle"><strong>65.83</strong></td>
-<td align="center" style="vertical-align: middle">62.08</td>
-<td align="center" style="vertical-align: middle">24.17</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">MATH 500</td>
-<td align="center" style="vertical-align: middle"><strong>97.10</strong></td>
-<td align="center" style="vertical-align: middle">89.80</td>
-<td align="center" style="vertical-align: middle">90.90</td>
-</tr>
-<tr>
-<td align="center" colspan=8><strong>Agentic</strong></td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">SWE-bench Verified</td>
-<td align="center" style="vertical-align: middle"><strong>60.60</strong></td>
-<td align="center" style="vertical-align: middle">24.44</td>
-<td align="center" style="vertical-align: middle">51.60</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">Tau2-Retail</td>
-<td align="center" style="vertical-align: middle"><strong>67.55</strong></td>
-<td align="center" style="vertical-align: middle">53.51</td>
-<td align="center" style="vertical-align: middle">62.28</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">Tau2-Airline</td>
-<td align="center" style="vertical-align: middle"><strong>54.00</strong></td>
-<td align="center" style="vertical-align: middle">32.00</td>
-<td align="center" style="vertical-align: middle">52.00</td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">Tau2-Telecom</td>
-<td align="center" style="vertical-align: middle">79.83</td>
-<td align="center" style="vertical-align: middle">4.39</td>
-<td align="center" style="vertical-align: middle"><strong>88.60</strong></td>
-</tr>
-<tr>
-<td align="center" colspan=8><strong>Long Context</strong></td>
-</tr>
-<tr>
-<td align="center" style="vertical-align: middle">RULER</td>
-<td align="center" style="vertical-align: middle"><strong>95.60</strong></td>
-<td align="center" style="vertical-align: middle">89.66</td>
-<td align="center" style="vertical-align: middle">56.12</td>
-</tr>
-</tbody>
-</table>
-## 4. Deployment
-> [!Note]
-> You can access JoyAI-LLM Flash API on https://docs.jdcloud.com/cn/jdaip/chat and we provide OpenAI/Anthropic-compatible API for you.
-> Currently, JoyAI-LLM-Flash-Block-INT8 is recommended to run on the following inference engines:
-* SGLang
-Deployment examples can be found in the [Model Deployment Guide](docs/deploy_guidance.md).
-## 5. Model Usage
-The usage demos below demonstrate how to call our official API.
-For third-party APIs deployed with vLLM or SGLang, please note that:
-> [!Note] Recommended sampling parameters: `temperature=0.6`, `top_p=1.0`
-### Chat Completion
-This is a simple chat completion script which shows how to call JoyAI-Flash API.
-```python
-from openai import OpenAI
-client = OpenAI(base_url="http://IP:PORT/v1", api_key="EMPTY")
-def simple_chat(client: OpenAI):
-    messages = [
-        {
-            "role": "user",
-            "content": [
-                {
-                    "type": "text",
-                    "text": "which one is bigger, 9.11 or 9.9? think carefully.",
-                }
-            ],
-        },
-    ]
-    model_name = client.models.list().data[0].id
-    response = client.chat.completions.create(
-        model=model_name, messages=messages, stream=False, max_tokens=4096
-    )
-    print(f"response: {response.choices[0].message.content}")
-if __name__ == "__main__":
-    simple_chat(client)
-```
-### Tool call Completion
-This is a simple toll call completion script which shows how to call JoyAI-Flash API.
-```python
-import json
-from openai import OpenAI
-client = OpenAI(base_url="http://IP:PORT/v1", api_key="EMPTY")
-def my_calculator(expression: str) -> str:
-    return str(eval(expression))
-def rewrite(expression: str) -> str:
-    return str(expression)
-def simple_tool_call(client: OpenAI):
-    messages = [
-        {
-            "role": "user",
-            "content": [
-                {
-                    "type": "text",
-                    "text": "use my functions to compute the results for the equations: 6+1",
-                },
-            ],
-        },
-    ]
-    tools = [
-        {
-            "type": "function",
-            "function": {
-                "name": "my_calculator",
-                "description": "A calculator that can evaluate a mathematical equation and compute its results.",
-                "parameters": {
-                    "type": "object",
-                    "properties": {
-                        "expression": {
-                            "type": "string",
-                            "description": "The mathematical expression to evaluate.",
-                        },
-                    },
-                    "required": ["expression"],
-                },
-            },
-        },
-        {
-            "type": "function",
-            "function": {
-                "name": "rewrite",
-                "description": "Rewrite a given text for improved clarity",
-                "parameters": {
-                    "type": "object",
-                    "properties": {
-                        "text": {
-                            "type": "string",
-                            "description": "The input text to rewrite",
-                        }
-                    },
-                },
-            },
-        },
-    ]
-    model_name = client.models.list().data[0].id
-    response = client.chat.completions.create(
-        model=model_name,
-        messages=messages,
-        temperature=1.0,
-        max_tokens=1024,
-        tools=tools,
-        tool_choice="auto",
-    )
-    tool_calls = response.choices[0].message.tool_calls
-    results = []
-    for tool_call in tool_calls:
-        function_name = tool_call.function.name
-        function_args = tool_call.function.arguments
-        if function_name == "my_calculator":
-            result = my_calculator(**json.loads(function_args))
-            results.append(result)
-    messages.append({"role": "assistant", "tool_calls": tool_calls})
-    for tool_call, result in zip(tool_calls, results):
-        messages.append(
-            {
-                "role": "tool",
-                "tool_call_id": tool_call.id,
-                "name": tool_call.function.name,
-                "content": result,
-            }
-        )
-    response = client.chat.completions.create(
-        model=model_name,
-        messages=messages,
-        temperature=1.0,
-        max_tokens=1024,
-    )
-    print(response.choices[0].message.content)
-if __name__ == "__main__":
-    simple_tool_call(client)
-```
 ---
-## 6. License
-Both the code repository and the model weights are released under the [Modified MIT License](LICENSE).

 ---
+base_model: []
 library_name: transformers
+tags:
+- mergekit
+- merge
 ---
+# c362_step50_ta05
+This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
+## Merge Details
+### Merge Method
+This model was merged using the [Linear DARE](https://arxiv.org/abs/2311.03099) merge method using /root/myCodeLab/host/downloads/models/40Bra as a base.
+### Models Merged
+The following models were included in the merge:
+* /root/myCodeLab/host/verl/ckpts/40bra_k8s_single_domain/40bra_k8s_16node_sd_c362_20260327_205644_unknown/global_step_50/actor/huggingface
+### Configuration
+The following YAML configuration was used to produce this model:
+```yaml
+base_model: /root/myCodeLab/host/downloads/models/40Bra
+dtype: float32
+merge_method: dare_linear
+modules:
+  default:
+    slices:
+    - sources:
+      - layer_range: [0, 40]
+        model: /root/myCodeLab/host/downloads/models/40Bra
+      - layer_range: [0, 40]
+        model: /root/myCodeLab/host/verl/ckpts/40bra_k8s_single_domain/40bra_k8s_16node_sd_c362_20260327_205644_unknown/global_step_50/actor/huggingface
+        parameters:
+          density: 1.0
+          weight:
+          - filter: .mlp.gate.
+            value: 0.0
+          - value: 0.5
+    - sources:
+      - layer_range: [40, 41]
+        model: /root/myCodeLab/host/downloads/models/40Bra
+out_dtype: bfloat16
+```

config.json CHANGED Viewed

@@ -10,16 +10,18 @@
         "AutoModelForCausalLM": "modeling_deepseek.DeepseekV3ForCausalLM"
     },
     "bos_token_id": 0,
     "eos_token_id": 1,
     "ep_size": 1,
     "first_k_dense_replace": 1,
     "hidden_act": "silu",
     "hidden_size": 2048,
     "initializer_range": 0.02,
     "intermediate_size": 7168,
     "kv_lora_rank": 512,
     "max_position_embeddings": 131072,
-    "model_type": "joyai_llm_flash",
     "moe_intermediate_size": 768,
     "moe_layer_freq": 1,
     "n_group": 1,
@@ -31,7 +33,9 @@
     "num_hidden_layers": 40,
     "num_key_value_heads": 32,
     "num_nextn_predict_layers": 1,
     "q_lora_rank": 1536,
     "qk_nope_head_dim": 128,
     "qk_rope_head_dim": 64,
     "quantization_config": {
@@ -43,15 +47,16 @@
         ]
     },
     "rms_norm_eps": 1e-06,
     "rope_theta": 32000000,
     "routed_scaling_factor": 2.5,
     "scoring_func": "sigmoid",
     "tie_word_embeddings": false,
     "topk_group": 1,
     "topk_method": "noaux_tc",
-    "torch_dtype": "bfloat16",
-    "transformers_version": "4.44.2",
     "use_cache": true,
     "v_head_dim": 128,
     "vocab_size": 129280
-}

         "AutoModelForCausalLM": "modeling_deepseek.DeepseekV3ForCausalLM"
     },
     "bos_token_id": 0,
+    "dtype": "bfloat16",
     "eos_token_id": 1,
     "ep_size": 1,
     "first_k_dense_replace": 1,
+    "head_dim": 64,
     "hidden_act": "silu",
     "hidden_size": 2048,
     "initializer_range": 0.02,
     "intermediate_size": 7168,
     "kv_lora_rank": 512,
     "max_position_embeddings": 131072,
+    "model_type":"joyai_llm_flash",
     "moe_intermediate_size": 768,
     "moe_layer_freq": 1,
     "n_group": 1,
     "num_hidden_layers": 40,
     "num_key_value_heads": 32,
     "num_nextn_predict_layers": 1,
+    "pretraining_tp": 1,
     "q_lora_rank": 1536,
+    "qk_head_dim": 192,
     "qk_nope_head_dim": 128,
     "qk_rope_head_dim": 64,
     "quantization_config": {
         ]
     },
     "rms_norm_eps": 1e-06,
+    "rope_interleave": true,
+    "rope_scaling": null,
     "rope_theta": 32000000,
     "routed_scaling_factor": 2.5,
     "scoring_func": "sigmoid",
     "tie_word_embeddings": false,
     "topk_group": 1,
     "topk_method": "noaux_tc",
+    "transformers_version": "4.57.3",
     "use_cache": true,
     "v_head_dim": 128,
     "vocab_size": 129280
+}

mergekit_config.yml ADDED Viewed

	@@ -0,0 +1,21 @@

+base_model: /root/myCodeLab/host/downloads/models/40Bra
+dtype: float32
+merge_method: dare_linear
+modules:
+  default:
+    slices:
+    - sources:
+      - layer_range: [0, 40]
+        model: /root/myCodeLab/host/downloads/models/40Bra
+      - layer_range: [0, 40]
+        model: /root/myCodeLab/host/verl/ckpts/40bra_k8s_single_domain/40bra_k8s_16node_sd_c362_20260327_205644_unknown/global_step_50/actor/huggingface
+        parameters:
+          density: 1.0
+          weight:
+          - filter: .mlp.gate.
+            value: 0.0
+          - value: 0.5
+    - sources:
+      - layer_range: [40, 41]
+        model: /root/myCodeLab/host/downloads/models/40Bra
+out_dtype: bfloat16

model-1-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4f48ddf430827ffdb51975cfd39f9426dc705d549159e0b3031c9386ad60cd92
 size 70417424

 version https://git-lfs.github.com/spec/v1
+oid sha256:d3e78b8a79349b638ac5cb7139dbee0189474e2a41043e49189150bd46fd0b50
 size 70417424

model-10-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9c91cf0ce6f06c7dd5ebf4ed4b416fb115e8dc16b96b907571f54af9c8b7addc
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:9da377dcb5bac74fd43f4983126f1cf84e7accc5e5dd26a6664fa47acb0443fc
+size 1240574640

model-11-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3395a6d10132a7965d3e2bebaac84a072624c3f0dde104c3392a7075d0bdab8
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:19eafdc4c9b4bc748b0e8dea5e19d3dc718e73c0a4ddf8dc318c90ee32d5f4d0
+size 1240576192

model-12-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4ac71db3d3322e5fb3ba411105cbbcd773555cc929a9f59a68473a445030af67
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:a418e1068b40ebb60f4b69124ab631c4a0f708172f3365b6c669689da945c1ab
+size 1240576192

model-13-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a24e8ce9f29b0d3c9ff6e5b4d3493df5eb60ee8c52ed1a67cb2d8cf6db217dc5
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:e11a81dd89b553ea4ec708fcc2a335b2428967d993ab8c81a473c9d01360e484
+size 1240576192

model-14-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:184e58487f757f2e5fe05f51351f41f3c995578a3b152d406573ef46b85529ae
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:4c2f4d9e1f17289700b246b2059ec6eff978db14b4a51298925538471c544802
+size 1240576192

model-15-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7a50c38dd5267e2eb8470c9b0b6ce80c397a70def85147cab046e91f40bacf30
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:acfd3c136db7e3419b22de25de2942bf35a78dc5a5d2fb253e02f766c4be33ea
+size 1240576192

model-16-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:11864879ee5baf912acb086c736130cdeb40a48c9defb2b6ec2446c722bf81d6
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:44a56cb3b937d388bddf3f40407b2705bbe91c0bb1af31dfdd43102c5df2f5d2
+size 1240576192

model-17-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3e03ce933ed6dda2dd9ced2fc05f6f9ff20f22bb5a924530a2ecc5fdc1d643a9
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:7df20a82591ba541f59038b4f4e1cf7ae5e2998614ad954504d425a60b5c0ea4
+size 1240576192

model-18-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3bbe260ad95b9c788ea2a2b7f304fe0c765ddcfaca1e2ff344da558d7f86baef
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:a305a86aa156997d8845359b907b2c7363909f37d59ddd6b532276e2d8c5f8c4
+size 1240576192

model-19-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4569083a64418b328aa7b9083068391be62f46337c662a752ff93b4a501758c8
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:118013021992908478f2c7318cf57a22325769be520c262f86b385ab267678b7
+size 1240576192

model-2-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b51f6e450426f9a1acca1daac7c28b15927c709612fb7b5df9bdd8b7d5cf7ef6
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:77dfe5162ff011af212696ac29b31273396121f0f08b17ad147bcf0592f87319
+size 1240574640

model-20-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e7f08b5e331b4c094f76702e0b8ccfc5ffe540cfed309d6534a9d81a9ca36029
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:ab2c3b4fc1a6d3f0671782bef9beeb8766825d67213889938b05ff6a9551fa9b
+size 1240576192

model-21-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8c04abe8b76c5015f97526e79df0a85a9f17615d9fd3d8662e13b1209854c4a1
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:921c963a2a3948712da54396366ee7871c023d8473799f0d6bda2674cae7e6c0
+size 1240576192

model-22-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c56faaa1cb68b0f89f2cd6afaa00afb2072de41dc56c85a7a1fe69aeca282eab
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:723d02655f20936a6efb99d81295832abab6081a735c36a29149a333d753d845
+size 1240576192

model-23-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:76b32333d89529b8f406dddd0b16571eb3f07a6c055c4f63fa05e8bf1cb61822
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:cd583ecdbc08fb94ac2b701936b763e779669604d34e1608c41770d3fff0dc32
+size 1240576192

model-24-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7dcf0fe97ee36d329d237c55b9231e28b2f5536678d7ba5deab4fae0c9737206
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:1027648adeb8aeb6baa34f1f8fcc0ccde91f7684e0684cac1eacd6b78f11ef53
+size 1240576192

model-25-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:09e63583cb2acb5d279ba8d61674aa8c0e031fda96664b8e833713278a4696a0
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:26425adf5423eeb45dd469b455a0d86f1f07c4ac6b9fb2274c8bf57c173cb8bf
+size 1240576192

model-26-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fba5920b459dbf18927202eccd87ef47013b0537a6863772b9466594d820f625
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:cf7d3674d89ad3e7b75cf2aa1bbafc4d4d59c188ef2ca813d27068a0f181102a
+size 1240576192

model-27-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d9c7ca88ee3f851b2a2fa2edba45bf94677d50554710535310536cbcdc21a4e4
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:23325ad47f7967fbc63e53562cbfc27d7f0f8ad149a34fc224d9ca0faa588600
+size 1240576192

model-28-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cfcef06073d7eff69a26d30a298444eee9c17968b970a9af48bd9386860feec2
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:a0f0c1c41480fa1405a05777294d6925980d8b7157112b05a12aab35f0c007cb
+size 1240576192

model-29-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2a8cab07da30997cc7dfa269362b2f03c8ff8c8bc199557bd0d12aca7fdbcc22
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:e8c478ff960026dd7ee4ba9048b092b81cac6eb38e67be0f68959702601a9e0d
+size 1240576192

model-3-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e41720e0ef2303621353674634234a406bde493f8e05d88b23084ffe112d67db
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:b82527e24104dcb69840afaf49cfbb16449c42465854d01fb73fa6827de7cf95
+size 1240574640

model-30-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cfa7161d6099ff72463fef10762583c290fe5682e3b938e4066c93e4ace5b71e
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:ba4e991f40e5bf3ffa2efc2f31ad31b05aea580cfda986fd14fc6f7fe2913cb6
+size 1240576192

model-31-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ccdd3eb8e24ff9561e77068e3ac74eaea1746d8925e71965a1b2b8127f1972e6
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:ddb4c3211db92ff361c5ea2f04337a97aa67c36eab38de8f2cf18cb4811a2b94
+size 1240576192

model-32-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:96810a9f732857b8fa8f1e08f957392ccf41abf1cfe2a38a95c01a9fd70b9502
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:3d99e57522ed93ba01e14b7d0d98b26f8f0e5de2711c97ae894628f380f310c7
+size 1240576192

model-33-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d3f02a4e22221075d3a84521fe06a2d8e714b1d9bd2c2d07c1bc417a76d52e6d
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:5551f0b231afa4606982c970a30e36b488cda4a9cde7084d3381c602a0f9f8d5
+size 1240576192

model-34-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:82982c5da6ec2217f169924c05f4868eb9353654fcbe52fc555cdf1e54e954ab
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:e8c862fc52f1f34046d81867df9dfbc320f16355b1072b7048484e7a94ee26b1
+size 1240576192

model-35-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b7cddd2ed5e46db1ce96c5ac9e6bca7a813498bc2be2872fd8d847261cee6a10
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:0063484a9d45f05b435069c82b8e0f1630df7b41eb48d8ec0c3d34d5383d15d5
+size 1240576192

model-36-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:85bafba90f7517fcfc50de98af728dd7943c54e4bcc936ef28d3390391370a2b
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:e964f833051abd324f4af376864e262a7dae09c83d30d57c292beab094c01b58
+size 1240576192

model-37-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:932cf6ccecd8f8e1b6dea7ff99318d607432c25cd243278171009f706d3eca8f
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:30d882ce806dd69ab3e1a6a1e8215c037a97dc2b8eebe39b1e73a5d597b10624
+size 1240576192

model-38-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2d02dce21d360f7a2390ec11da920c6516407f0abb86b8e08886e6829794541e
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:01b9a194c9882d07a4fe0344ec9e44267cbbfe77efb37464ca94943ba5cb4894
+size 1240576192

model-39-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:71d231d31a4da0c90c36cb95ffe091937b67199f27bd4da4f97c4511068e73c4
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:b700ada0cd83229be4003b49ad99ebafcbb01de3a3a344be9033a79fb40272a4
+size 1240576192

model-4-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6ebab19e7fbdeb44956dd3ef63c1a1900c36fd4e9b0799f2adddd3045eefadfc
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:3e8cc7d10ab9a1e9f7858ad9d7be017e294d92c2aeab1d5285c7dc32e0171f09
+size 1240574640

model-40-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8d9b8b7aec70e054c9da57d3f76bff33c059453bd7b24512ea980de1e36ffa27
-size 1240576704

 version https://git-lfs.github.com/spec/v1
+oid sha256:d2bfff3bca35a5b9a2f055f5169912d692629a7e94b402df99211185d520985f
+size 1240576192

model-5-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:83f78c8d03ee6a20fcf3c8b1c50da169b9f5fe32cdf66961cf847470e3a76fd6
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:6f053c4ad2283fbca4bc849d358c1aca1a01dfe879c2e415192fa0d730a5804c
+size 1240574640

model-6-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:316b401e76e4db37a6ca37aba97fd91238cf8213cc631e43401c4c40b9298612
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:b1c9591722e7fbfa9a914620a5b7e3b6768e019b249e48b5228700354adf8548
+size 1240574640

model-7-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:91f8cb67739b65bfc0417beca5b5174b3c2f0b3bf6e139c23e4ae91e361a0944
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:ae77ea69752a2e2ea748a20646b640cbbd4cef91e5947f6f31e08c115fb5c780
+size 1240574640

model-8-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:75a4595159bacc23dc5b85f737a53c88d66c1e594c1a1642e9cc0e5e1a2e9402
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:09cf4e2fc6c07b51b48ee048a930a379331e509146dd24cba7f9ccce1844c235
+size 1240574640

model-9-of-40.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9212c9d8925dba5e1367c32032afc05af97c8c3a6d96555aa38ece98ddde31ad
-size 1240575152

 version https://git-lfs.github.com/spec/v1
+oid sha256:131c1e377964f5f059532619e45416af849af7662675595f7d475023daba37cc
+size 1240574640

model-non-layer.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:358810bbba6e784b3be7462f7398fed2ba159a0edcec4f586aa72e877f4187ca
 size 1059066184

 version https://git-lfs.github.com/spec/v1
+oid sha256:9b992b0a3e3ab06a9490d364bf942df3bcf69874dcb9f940e935f674672f09cd
 size 1059066184