Instructions to use eaddario/Llama-Guard-3-8B-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use eaddario/Llama-Guard-3-8B-GGUF with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="eaddario/Llama-Guard-3-8B-GGUF", filename="Llama-Guard-3-8B-F16.gguf", )
llm.create_chat_completion( messages = [ { "role": "user", "content": "What is the capital of France?" } ] ) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use eaddario/Llama-Guard-3-8B-GGUF with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M # Run inference directly in the terminal: llama-cli -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
Use Docker
docker model run hf.co/eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
- LM Studio
- Jan
- vLLM
How to use eaddario/Llama-Guard-3-8B-GGUF with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "eaddario/Llama-Guard-3-8B-GGUF" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "eaddario/Llama-Guard-3-8B-GGUF", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
- Ollama
How to use eaddario/Llama-Guard-3-8B-GGUF with Ollama:
ollama run hf.co/eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
- Unsloth Studio
How to use eaddario/Llama-Guard-3-8B-GGUF with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for eaddario/Llama-Guard-3-8B-GGUF to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for eaddario/Llama-Guard-3-8B-GGUF to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for eaddario/Llama-Guard-3-8B-GGUF to start chatting
- Docker Model Runner
How to use eaddario/Llama-Guard-3-8B-GGUF with Docker Model Runner:
docker model run hf.co/eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
- Lemonade
How to use eaddario/Llama-Guard-3-8B-GGUF with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull eaddario/Llama-Guard-3-8B-GGUF:Q4_K_M
Run and chat with the model
lemonade run user.Llama-Guard-3-8B-GGUF-Q4_K_M
List all available models
lemonade list
Add GGUF internal file structure
Browse files- scores/Llama-Guard-3-8B-F16.md +974 -0
- scores/Llama-Guard-3-8B-IQ3_M.md +978 -0
- scores/Llama-Guard-3-8B-IQ3_S.md +978 -0
- scores/Llama-Guard-3-8B-IQ4_NL.md +978 -0
- scores/Llama-Guard-3-8B-Q3_K_L.md +978 -0
- scores/Llama-Guard-3-8B-Q3_K_M.md +978 -0
- scores/Llama-Guard-3-8B-Q3_K_S.md +978 -0
- scores/Llama-Guard-3-8B-Q4_K_M.md +978 -0
- scores/Llama-Guard-3-8B-Q4_K_S.md +978 -0
- scores/Llama-Guard-3-8B-Q5_K_M.md +978 -0
- scores/Llama-Guard-3-8B-Q5_K_S.md +978 -0
- scores/Llama-Guard-3-8B-Q6_K.md +978 -0
- scores/Llama-Guard-3-8B-Q8_0.md +978 -0
scores/Llama-Guard-3-8B-F16.md
ADDED
|
@@ -0,0 +1,974 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-F16.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 35 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 32 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | general.file_type | 1 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 36 |
+
| 26 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 38 |
+
| 28 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 39 |
+
| 29 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 40 |
+
| 30 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 41 |
+
| 31 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 43 |
+
| 33 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 44 |
+
| 34 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 45 |
+
| 35 | UINT32 | 1 | general.quantization_version | 2 |
|
| 46 |
+
|
| 47 |
+
## Tensors Overview ~8B Elements
|
| 48 |
+
|
| 49 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 50 |
+
|
| 51 |
+
- [Llama-Guard-3-8B-F16.gguf - GGUF Internal File Dump](#llama-guard-3-8b-f16gguf---gguf-internal-file-dump)
|
| 52 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 53 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 54 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 55 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 56 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 57 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 58 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 59 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 60 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 61 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 62 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 63 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 64 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 65 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 66 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 67 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 68 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 69 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 70 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 71 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 72 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 73 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 74 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 75 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 76 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 77 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 78 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 79 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 80 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 81 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 82 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 83 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 84 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 85 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 86 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 87 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 88 |
+
|
| 89 |
+
### Tensor Data Offset
|
| 90 |
+
|
| 91 |
+
This table contains the offset and data segment relative to start of file
|
| 92 |
+
|
| 93 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 94 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 95 |
+
| 0 | rope_freqs.weight | 0x779780 | 0x100 |
|
| 96 |
+
| 1 | token_embd.weight | 0x779880 | 0x3ea00000 |
|
| 97 |
+
| 2 | blk.0.attn_norm.weight | 0x3f179880 | 0x4000 |
|
| 98 |
+
| 3 | blk.0.ffn_down.weight | 0x3f17d880 | 0x7000000 |
|
| 99 |
+
| 4 | blk.0.ffn_gate.weight | 0x4617d880 | 0x7000000 |
|
| 100 |
+
| 5 | blk.0.ffn_up.weight | 0x4d17d880 | 0x7000000 |
|
| 101 |
+
| 6 | blk.0.ffn_norm.weight | 0x5417d880 | 0x4000 |
|
| 102 |
+
| 7 | blk.0.attn_k.weight | 0x54181880 | 0x800000 |
|
| 103 |
+
| 8 | blk.0.attn_output.weight | 0x54981880 | 0x2000000 |
|
| 104 |
+
| 9 | blk.0.attn_q.weight | 0x56981880 | 0x2000000 |
|
| 105 |
+
| 10 | blk.0.attn_v.weight | 0x58981880 | 0x800000 |
|
| 106 |
+
| 11 | blk.1.attn_norm.weight | 0x59181880 | 0x4000 |
|
| 107 |
+
| 12 | blk.1.ffn_down.weight | 0x59185880 | 0x7000000 |
|
| 108 |
+
| 13 | blk.1.ffn_gate.weight | 0x60185880 | 0x7000000 |
|
| 109 |
+
| 14 | blk.1.ffn_up.weight | 0x67185880 | 0x7000000 |
|
| 110 |
+
| 15 | blk.1.ffn_norm.weight | 0x6e185880 | 0x4000 |
|
| 111 |
+
| 16 | blk.1.attn_k.weight | 0x6e189880 | 0x800000 |
|
| 112 |
+
| 17 | blk.1.attn_output.weight | 0x6e989880 | 0x2000000 |
|
| 113 |
+
| 18 | blk.1.attn_q.weight | 0x70989880 | 0x2000000 |
|
| 114 |
+
| 19 | blk.1.attn_v.weight | 0x72989880 | 0x800000 |
|
| 115 |
+
| 20 | blk.2.attn_norm.weight | 0x73189880 | 0x4000 |
|
| 116 |
+
| 21 | blk.2.ffn_down.weight | 0x7318d880 | 0x7000000 |
|
| 117 |
+
| 22 | blk.2.ffn_gate.weight | 0x7a18d880 | 0x7000000 |
|
| 118 |
+
| 23 | blk.2.ffn_up.weight | 0x8118d880 | 0x7000000 |
|
| 119 |
+
| 24 | blk.2.ffn_norm.weight | 0x8818d880 | 0x4000 |
|
| 120 |
+
| 25 | blk.2.attn_k.weight | 0x88191880 | 0x800000 |
|
| 121 |
+
| 26 | blk.2.attn_output.weight | 0x88991880 | 0x2000000 |
|
| 122 |
+
| 27 | blk.2.attn_q.weight | 0x8a991880 | 0x2000000 |
|
| 123 |
+
| 28 | blk.2.attn_v.weight | 0x8c991880 | 0x800000 |
|
| 124 |
+
| 29 | blk.3.attn_norm.weight | 0x8d191880 | 0x4000 |
|
| 125 |
+
| 30 | blk.3.ffn_down.weight | 0x8d195880 | 0x7000000 |
|
| 126 |
+
| 31 | blk.3.ffn_gate.weight | 0x94195880 | 0x7000000 |
|
| 127 |
+
| 32 | blk.3.ffn_up.weight | 0x9b195880 | 0x7000000 |
|
| 128 |
+
| 33 | blk.3.ffn_norm.weight | 0xa2195880 | 0x4000 |
|
| 129 |
+
| 34 | blk.3.attn_k.weight | 0xa2199880 | 0x800000 |
|
| 130 |
+
| 35 | blk.3.attn_output.weight | 0xa2999880 | 0x2000000 |
|
| 131 |
+
| 36 | blk.3.attn_q.weight | 0xa4999880 | 0x2000000 |
|
| 132 |
+
| 37 | blk.3.attn_v.weight | 0xa6999880 | 0x800000 |
|
| 133 |
+
| 38 | blk.4.attn_norm.weight | 0xa7199880 | 0x4000 |
|
| 134 |
+
| 39 | blk.4.ffn_down.weight | 0xa719d880 | 0x7000000 |
|
| 135 |
+
| 40 | blk.4.ffn_gate.weight | 0xae19d880 | 0x7000000 |
|
| 136 |
+
| 41 | blk.4.ffn_up.weight | 0xb519d880 | 0x7000000 |
|
| 137 |
+
| 42 | blk.4.ffn_norm.weight | 0xbc19d880 | 0x4000 |
|
| 138 |
+
| 43 | blk.4.attn_k.weight | 0xbc1a1880 | 0x800000 |
|
| 139 |
+
| 44 | blk.4.attn_output.weight | 0xbc9a1880 | 0x2000000 |
|
| 140 |
+
| 45 | blk.4.attn_q.weight | 0xbe9a1880 | 0x2000000 |
|
| 141 |
+
| 46 | blk.4.attn_v.weight | 0xc09a1880 | 0x800000 |
|
| 142 |
+
| 47 | blk.5.attn_norm.weight | 0xc11a1880 | 0x4000 |
|
| 143 |
+
| 48 | blk.5.ffn_down.weight | 0xc11a5880 | 0x7000000 |
|
| 144 |
+
| 49 | blk.5.ffn_gate.weight | 0xc81a5880 | 0x7000000 |
|
| 145 |
+
| 50 | blk.5.ffn_up.weight | 0xcf1a5880 | 0x7000000 |
|
| 146 |
+
| 51 | blk.5.ffn_norm.weight | 0xd61a5880 | 0x4000 |
|
| 147 |
+
| 52 | blk.5.attn_k.weight | 0xd61a9880 | 0x800000 |
|
| 148 |
+
| 53 | blk.5.attn_output.weight | 0xd69a9880 | 0x2000000 |
|
| 149 |
+
| 54 | blk.5.attn_q.weight | 0xd89a9880 | 0x2000000 |
|
| 150 |
+
| 55 | blk.5.attn_v.weight | 0xda9a9880 | 0x800000 |
|
| 151 |
+
| 56 | blk.6.attn_norm.weight | 0xdb1a9880 | 0x4000 |
|
| 152 |
+
| 57 | blk.6.ffn_down.weight | 0xdb1ad880 | 0x7000000 |
|
| 153 |
+
| 58 | blk.6.ffn_gate.weight | 0xe21ad880 | 0x7000000 |
|
| 154 |
+
| 59 | blk.6.ffn_up.weight | 0xe91ad880 | 0x7000000 |
|
| 155 |
+
| 60 | blk.6.ffn_norm.weight | 0xf01ad880 | 0x4000 |
|
| 156 |
+
| 61 | blk.6.attn_k.weight | 0xf01b1880 | 0x800000 |
|
| 157 |
+
| 62 | blk.6.attn_output.weight | 0xf09b1880 | 0x2000000 |
|
| 158 |
+
| 63 | blk.6.attn_q.weight | 0xf29b1880 | 0x2000000 |
|
| 159 |
+
| 64 | blk.6.attn_v.weight | 0xf49b1880 | 0x800000 |
|
| 160 |
+
| 65 | blk.7.attn_norm.weight | 0xf51b1880 | 0x4000 |
|
| 161 |
+
| 66 | blk.7.ffn_down.weight | 0xf51b5880 | 0x7000000 |
|
| 162 |
+
| 67 | blk.7.ffn_gate.weight | 0xfc1b5880 | 0x7000000 |
|
| 163 |
+
| 68 | blk.7.ffn_up.weight | 0x1031b5880 | 0x7000000 |
|
| 164 |
+
| 69 | blk.7.ffn_norm.weight | 0x10a1b5880 | 0x4000 |
|
| 165 |
+
| 70 | blk.7.attn_k.weight | 0x10a1b9880 | 0x800000 |
|
| 166 |
+
| 71 | blk.7.attn_output.weight | 0x10a9b9880 | 0x2000000 |
|
| 167 |
+
| 72 | blk.7.attn_q.weight | 0x10c9b9880 | 0x2000000 |
|
| 168 |
+
| 73 | blk.7.attn_v.weight | 0x10e9b9880 | 0x800000 |
|
| 169 |
+
| 74 | blk.8.attn_norm.weight | 0x10f1b9880 | 0x4000 |
|
| 170 |
+
| 75 | blk.8.ffn_down.weight | 0x10f1bd880 | 0x7000000 |
|
| 171 |
+
| 76 | blk.8.ffn_gate.weight | 0x1161bd880 | 0x7000000 |
|
| 172 |
+
| 77 | blk.8.ffn_up.weight | 0x11d1bd880 | 0x7000000 |
|
| 173 |
+
| 78 | blk.8.ffn_norm.weight | 0x1241bd880 | 0x4000 |
|
| 174 |
+
| 79 | blk.8.attn_k.weight | 0x1241c1880 | 0x800000 |
|
| 175 |
+
| 80 | blk.8.attn_output.weight | 0x1249c1880 | 0x2000000 |
|
| 176 |
+
| 81 | blk.8.attn_q.weight | 0x1269c1880 | 0x2000000 |
|
| 177 |
+
| 82 | blk.8.attn_v.weight | 0x1289c1880 | 0x800000 |
|
| 178 |
+
| 83 | blk.10.attn_norm.weight | 0x1291c1880 | 0x4000 |
|
| 179 |
+
| 84 | blk.10.ffn_down.weight | 0x1291c5880 | 0x7000000 |
|
| 180 |
+
| 85 | blk.10.ffn_gate.weight | 0x1301c5880 | 0x7000000 |
|
| 181 |
+
| 86 | blk.10.ffn_up.weight | 0x1371c5880 | 0x7000000 |
|
| 182 |
+
| 87 | blk.10.ffn_norm.weight | 0x13e1c5880 | 0x4000 |
|
| 183 |
+
| 88 | blk.10.attn_k.weight | 0x13e1c9880 | 0x800000 |
|
| 184 |
+
| 89 | blk.10.attn_output.weight | 0x13e9c9880 | 0x2000000 |
|
| 185 |
+
| 90 | blk.10.attn_q.weight | 0x1409c9880 | 0x2000000 |
|
| 186 |
+
| 91 | blk.10.attn_v.weight | 0x1429c9880 | 0x800000 |
|
| 187 |
+
| 92 | blk.11.attn_norm.weight | 0x1431c9880 | 0x4000 |
|
| 188 |
+
| 93 | blk.11.ffn_down.weight | 0x1431cd880 | 0x7000000 |
|
| 189 |
+
| 94 | blk.11.ffn_gate.weight | 0x14a1cd880 | 0x7000000 |
|
| 190 |
+
| 95 | blk.11.ffn_up.weight | 0x1511cd880 | 0x7000000 |
|
| 191 |
+
| 96 | blk.11.ffn_norm.weight | 0x1581cd880 | 0x4000 |
|
| 192 |
+
| 97 | blk.11.attn_k.weight | 0x1581d1880 | 0x800000 |
|
| 193 |
+
| 98 | blk.11.attn_output.weight | 0x1589d1880 | 0x2000000 |
|
| 194 |
+
| 99 | blk.11.attn_q.weight | 0x15a9d1880 | 0x2000000 |
|
| 195 |
+
| 100 | blk.11.attn_v.weight | 0x15c9d1880 | 0x800000 |
|
| 196 |
+
| 101 | blk.12.attn_norm.weight | 0x15d1d1880 | 0x4000 |
|
| 197 |
+
| 102 | blk.12.ffn_down.weight | 0x15d1d5880 | 0x7000000 |
|
| 198 |
+
| 103 | blk.12.ffn_gate.weight | 0x1641d5880 | 0x7000000 |
|
| 199 |
+
| 104 | blk.12.ffn_up.weight | 0x16b1d5880 | 0x7000000 |
|
| 200 |
+
| 105 | blk.12.ffn_norm.weight | 0x1721d5880 | 0x4000 |
|
| 201 |
+
| 106 | blk.12.attn_k.weight | 0x1721d9880 | 0x800000 |
|
| 202 |
+
| 107 | blk.12.attn_output.weight | 0x1729d9880 | 0x2000000 |
|
| 203 |
+
| 108 | blk.12.attn_q.weight | 0x1749d9880 | 0x2000000 |
|
| 204 |
+
| 109 | blk.12.attn_v.weight | 0x1769d9880 | 0x800000 |
|
| 205 |
+
| 110 | blk.13.attn_norm.weight | 0x1771d9880 | 0x4000 |
|
| 206 |
+
| 111 | blk.13.ffn_down.weight | 0x1771dd880 | 0x7000000 |
|
| 207 |
+
| 112 | blk.13.ffn_gate.weight | 0x17e1dd880 | 0x7000000 |
|
| 208 |
+
| 113 | blk.13.ffn_up.weight | 0x1851dd880 | 0x7000000 |
|
| 209 |
+
| 114 | blk.13.ffn_norm.weight | 0x18c1dd880 | 0x4000 |
|
| 210 |
+
| 115 | blk.13.attn_k.weight | 0x18c1e1880 | 0x800000 |
|
| 211 |
+
| 116 | blk.13.attn_output.weight | 0x18c9e1880 | 0x2000000 |
|
| 212 |
+
| 117 | blk.13.attn_q.weight | 0x18e9e1880 | 0x2000000 |
|
| 213 |
+
| 118 | blk.13.attn_v.weight | 0x1909e1880 | 0x800000 |
|
| 214 |
+
| 119 | blk.14.attn_norm.weight | 0x1911e1880 | 0x4000 |
|
| 215 |
+
| 120 | blk.14.ffn_down.weight | 0x1911e5880 | 0x7000000 |
|
| 216 |
+
| 121 | blk.14.ffn_gate.weight | 0x1981e5880 | 0x7000000 |
|
| 217 |
+
| 122 | blk.14.ffn_up.weight | 0x19f1e5880 | 0x7000000 |
|
| 218 |
+
| 123 | blk.14.ffn_norm.weight | 0x1a61e5880 | 0x4000 |
|
| 219 |
+
| 124 | blk.14.attn_k.weight | 0x1a61e9880 | 0x800000 |
|
| 220 |
+
| 125 | blk.14.attn_output.weight | 0x1a69e9880 | 0x2000000 |
|
| 221 |
+
| 126 | blk.14.attn_q.weight | 0x1a89e9880 | 0x2000000 |
|
| 222 |
+
| 127 | blk.14.attn_v.weight | 0x1aa9e9880 | 0x800000 |
|
| 223 |
+
| 128 | blk.15.attn_norm.weight | 0x1ab1e9880 | 0x4000 |
|
| 224 |
+
| 129 | blk.15.ffn_down.weight | 0x1ab1ed880 | 0x7000000 |
|
| 225 |
+
| 130 | blk.15.ffn_gate.weight | 0x1b21ed880 | 0x7000000 |
|
| 226 |
+
| 131 | blk.15.ffn_up.weight | 0x1b91ed880 | 0x7000000 |
|
| 227 |
+
| 132 | blk.15.ffn_norm.weight | 0x1c01ed880 | 0x4000 |
|
| 228 |
+
| 133 | blk.15.attn_k.weight | 0x1c01f1880 | 0x800000 |
|
| 229 |
+
| 134 | blk.15.attn_output.weight | 0x1c09f1880 | 0x2000000 |
|
| 230 |
+
| 135 | blk.15.attn_q.weight | 0x1c29f1880 | 0x2000000 |
|
| 231 |
+
| 136 | blk.15.attn_v.weight | 0x1c49f1880 | 0x800000 |
|
| 232 |
+
| 137 | blk.16.attn_norm.weight | 0x1c51f1880 | 0x4000 |
|
| 233 |
+
| 138 | blk.16.ffn_down.weight | 0x1c51f5880 | 0x7000000 |
|
| 234 |
+
| 139 | blk.16.ffn_gate.weight | 0x1cc1f5880 | 0x7000000 |
|
| 235 |
+
| 140 | blk.16.ffn_up.weight | 0x1d31f5880 | 0x7000000 |
|
| 236 |
+
| 141 | blk.16.ffn_norm.weight | 0x1da1f5880 | 0x4000 |
|
| 237 |
+
| 142 | blk.16.attn_k.weight | 0x1da1f9880 | 0x800000 |
|
| 238 |
+
| 143 | blk.16.attn_output.weight | 0x1da9f9880 | 0x2000000 |
|
| 239 |
+
| 144 | blk.16.attn_q.weight | 0x1dc9f9880 | 0x2000000 |
|
| 240 |
+
| 145 | blk.16.attn_v.weight | 0x1de9f9880 | 0x800000 |
|
| 241 |
+
| 146 | blk.17.attn_norm.weight | 0x1df1f9880 | 0x4000 |
|
| 242 |
+
| 147 | blk.17.ffn_down.weight | 0x1df1fd880 | 0x7000000 |
|
| 243 |
+
| 148 | blk.17.ffn_gate.weight | 0x1e61fd880 | 0x7000000 |
|
| 244 |
+
| 149 | blk.17.ffn_up.weight | 0x1ed1fd880 | 0x7000000 |
|
| 245 |
+
| 150 | blk.17.ffn_norm.weight | 0x1f41fd880 | 0x4000 |
|
| 246 |
+
| 151 | blk.17.attn_k.weight | 0x1f4201880 | 0x800000 |
|
| 247 |
+
| 152 | blk.17.attn_output.weight | 0x1f4a01880 | 0x2000000 |
|
| 248 |
+
| 153 | blk.17.attn_q.weight | 0x1f6a01880 | 0x2000000 |
|
| 249 |
+
| 154 | blk.17.attn_v.weight | 0x1f8a01880 | 0x800000 |
|
| 250 |
+
| 155 | blk.18.attn_norm.weight | 0x1f9201880 | 0x4000 |
|
| 251 |
+
| 156 | blk.18.ffn_down.weight | 0x1f9205880 | 0x7000000 |
|
| 252 |
+
| 157 | blk.18.ffn_gate.weight | 0x200205880 | 0x7000000 |
|
| 253 |
+
| 158 | blk.18.ffn_up.weight | 0x207205880 | 0x7000000 |
|
| 254 |
+
| 159 | blk.18.ffn_norm.weight | 0x20e205880 | 0x4000 |
|
| 255 |
+
| 160 | blk.18.attn_k.weight | 0x20e209880 | 0x800000 |
|
| 256 |
+
| 161 | blk.18.attn_output.weight | 0x20ea09880 | 0x2000000 |
|
| 257 |
+
| 162 | blk.18.attn_q.weight | 0x210a09880 | 0x2000000 |
|
| 258 |
+
| 163 | blk.18.attn_v.weight | 0x212a09880 | 0x800000 |
|
| 259 |
+
| 164 | blk.19.attn_norm.weight | 0x213209880 | 0x4000 |
|
| 260 |
+
| 165 | blk.19.ffn_down.weight | 0x21320d880 | 0x7000000 |
|
| 261 |
+
| 166 | blk.19.ffn_gate.weight | 0x21a20d880 | 0x7000000 |
|
| 262 |
+
| 167 | blk.19.ffn_up.weight | 0x22120d880 | 0x7000000 |
|
| 263 |
+
| 168 | blk.19.ffn_norm.weight | 0x22820d880 | 0x4000 |
|
| 264 |
+
| 169 | blk.19.attn_k.weight | 0x228211880 | 0x800000 |
|
| 265 |
+
| 170 | blk.19.attn_output.weight | 0x228a11880 | 0x2000000 |
|
| 266 |
+
| 171 | blk.19.attn_q.weight | 0x22aa11880 | 0x2000000 |
|
| 267 |
+
| 172 | blk.19.attn_v.weight | 0x22ca11880 | 0x800000 |
|
| 268 |
+
| 173 | blk.20.ffn_gate.weight | 0x22d211880 | 0x7000000 |
|
| 269 |
+
| 174 | blk.20.attn_k.weight | 0x234211880 | 0x800000 |
|
| 270 |
+
| 175 | blk.20.attn_output.weight | 0x234a11880 | 0x2000000 |
|
| 271 |
+
| 176 | blk.20.attn_q.weight | 0x236a11880 | 0x2000000 |
|
| 272 |
+
| 177 | blk.20.attn_v.weight | 0x238a11880 | 0x800000 |
|
| 273 |
+
| 178 | blk.9.attn_norm.weight | 0x239211880 | 0x4000 |
|
| 274 |
+
| 179 | blk.9.ffn_down.weight | 0x239215880 | 0x7000000 |
|
| 275 |
+
| 180 | blk.9.ffn_gate.weight | 0x240215880 | 0x7000000 |
|
| 276 |
+
| 181 | blk.9.ffn_up.weight | 0x247215880 | 0x7000000 |
|
| 277 |
+
| 182 | blk.9.ffn_norm.weight | 0x24e215880 | 0x4000 |
|
| 278 |
+
| 183 | blk.9.attn_k.weight | 0x24e219880 | 0x800000 |
|
| 279 |
+
| 184 | blk.9.attn_output.weight | 0x24ea19880 | 0x2000000 |
|
| 280 |
+
| 185 | blk.9.attn_q.weight | 0x250a19880 | 0x2000000 |
|
| 281 |
+
| 186 | blk.9.attn_v.weight | 0x252a19880 | 0x800000 |
|
| 282 |
+
| 187 | blk.20.attn_norm.weight | 0x253219880 | 0x4000 |
|
| 283 |
+
| 188 | blk.20.ffn_down.weight | 0x25321d880 | 0x7000000 |
|
| 284 |
+
| 189 | blk.20.ffn_up.weight | 0x25a21d880 | 0x7000000 |
|
| 285 |
+
| 190 | blk.20.ffn_norm.weight | 0x26121d880 | 0x4000 |
|
| 286 |
+
| 191 | blk.21.attn_norm.weight | 0x261221880 | 0x4000 |
|
| 287 |
+
| 192 | blk.21.ffn_down.weight | 0x261225880 | 0x7000000 |
|
| 288 |
+
| 193 | blk.21.ffn_gate.weight | 0x268225880 | 0x7000000 |
|
| 289 |
+
| 194 | blk.21.ffn_up.weight | 0x26f225880 | 0x7000000 |
|
| 290 |
+
| 195 | blk.21.ffn_norm.weight | 0x276225880 | 0x4000 |
|
| 291 |
+
| 196 | blk.21.attn_k.weight | 0x276229880 | 0x800000 |
|
| 292 |
+
| 197 | blk.21.attn_output.weight | 0x276a29880 | 0x2000000 |
|
| 293 |
+
| 198 | blk.21.attn_q.weight | 0x278a29880 | 0x2000000 |
|
| 294 |
+
| 199 | blk.21.attn_v.weight | 0x27aa29880 | 0x800000 |
|
| 295 |
+
| 200 | blk.22.attn_norm.weight | 0x27b229880 | 0x4000 |
|
| 296 |
+
| 201 | blk.22.ffn_down.weight | 0x27b22d880 | 0x7000000 |
|
| 297 |
+
| 202 | blk.22.ffn_gate.weight | 0x28222d880 | 0x7000000 |
|
| 298 |
+
| 203 | blk.22.ffn_up.weight | 0x28922d880 | 0x7000000 |
|
| 299 |
+
| 204 | blk.22.ffn_norm.weight | 0x29022d880 | 0x4000 |
|
| 300 |
+
| 205 | blk.22.attn_k.weight | 0x290231880 | 0x800000 |
|
| 301 |
+
| 206 | blk.22.attn_output.weight | 0x290a31880 | 0x2000000 |
|
| 302 |
+
| 207 | blk.22.attn_q.weight | 0x292a31880 | 0x2000000 |
|
| 303 |
+
| 208 | blk.22.attn_v.weight | 0x294a31880 | 0x800000 |
|
| 304 |
+
| 209 | blk.23.attn_norm.weight | 0x295231880 | 0x4000 |
|
| 305 |
+
| 210 | blk.23.ffn_down.weight | 0x295235880 | 0x7000000 |
|
| 306 |
+
| 211 | blk.23.ffn_gate.weight | 0x29c235880 | 0x7000000 |
|
| 307 |
+
| 212 | blk.23.ffn_up.weight | 0x2a3235880 | 0x7000000 |
|
| 308 |
+
| 213 | blk.23.ffn_norm.weight | 0x2aa235880 | 0x4000 |
|
| 309 |
+
| 214 | blk.23.attn_k.weight | 0x2aa239880 | 0x800000 |
|
| 310 |
+
| 215 | blk.23.attn_output.weight | 0x2aaa39880 | 0x2000000 |
|
| 311 |
+
| 216 | blk.23.attn_q.weight | 0x2aca39880 | 0x2000000 |
|
| 312 |
+
| 217 | blk.23.attn_v.weight | 0x2aea39880 | 0x800000 |
|
| 313 |
+
| 218 | blk.24.attn_norm.weight | 0x2af239880 | 0x4000 |
|
| 314 |
+
| 219 | blk.24.ffn_down.weight | 0x2af23d880 | 0x7000000 |
|
| 315 |
+
| 220 | blk.24.ffn_gate.weight | 0x2b623d880 | 0x7000000 |
|
| 316 |
+
| 221 | blk.24.ffn_up.weight | 0x2bd23d880 | 0x7000000 |
|
| 317 |
+
| 222 | blk.24.ffn_norm.weight | 0x2c423d880 | 0x4000 |
|
| 318 |
+
| 223 | blk.24.attn_k.weight | 0x2c4241880 | 0x800000 |
|
| 319 |
+
| 224 | blk.24.attn_output.weight | 0x2c4a41880 | 0x2000000 |
|
| 320 |
+
| 225 | blk.24.attn_q.weight | 0x2c6a41880 | 0x2000000 |
|
| 321 |
+
| 226 | blk.24.attn_v.weight | 0x2c8a41880 | 0x800000 |
|
| 322 |
+
| 227 | blk.25.attn_norm.weight | 0x2c9241880 | 0x4000 |
|
| 323 |
+
| 228 | blk.25.ffn_down.weight | 0x2c9245880 | 0x7000000 |
|
| 324 |
+
| 229 | blk.25.ffn_gate.weight | 0x2d0245880 | 0x7000000 |
|
| 325 |
+
| 230 | blk.25.ffn_up.weight | 0x2d7245880 | 0x7000000 |
|
| 326 |
+
| 231 | blk.25.ffn_norm.weight | 0x2de245880 | 0x4000 |
|
| 327 |
+
| 232 | blk.25.attn_k.weight | 0x2de249880 | 0x800000 |
|
| 328 |
+
| 233 | blk.25.attn_output.weight | 0x2dea49880 | 0x2000000 |
|
| 329 |
+
| 234 | blk.25.attn_q.weight | 0x2e0a49880 | 0x2000000 |
|
| 330 |
+
| 235 | blk.25.attn_v.weight | 0x2e2a49880 | 0x800000 |
|
| 331 |
+
| 236 | blk.26.attn_norm.weight | 0x2e3249880 | 0x4000 |
|
| 332 |
+
| 237 | blk.26.ffn_down.weight | 0x2e324d880 | 0x7000000 |
|
| 333 |
+
| 238 | blk.26.ffn_gate.weight | 0x2ea24d880 | 0x7000000 |
|
| 334 |
+
| 239 | blk.26.ffn_up.weight | 0x2f124d880 | 0x7000000 |
|
| 335 |
+
| 240 | blk.26.ffn_norm.weight | 0x2f824d880 | 0x4000 |
|
| 336 |
+
| 241 | blk.26.attn_k.weight | 0x2f8251880 | 0x800000 |
|
| 337 |
+
| 242 | blk.26.attn_output.weight | 0x2f8a51880 | 0x2000000 |
|
| 338 |
+
| 243 | blk.26.attn_q.weight | 0x2faa51880 | 0x2000000 |
|
| 339 |
+
| 244 | blk.26.attn_v.weight | 0x2fca51880 | 0x800000 |
|
| 340 |
+
| 245 | blk.27.attn_norm.weight | 0x2fd251880 | 0x4000 |
|
| 341 |
+
| 246 | blk.27.ffn_down.weight | 0x2fd255880 | 0x7000000 |
|
| 342 |
+
| 247 | blk.27.ffn_gate.weight | 0x304255880 | 0x7000000 |
|
| 343 |
+
| 248 | blk.27.ffn_up.weight | 0x30b255880 | 0x7000000 |
|
| 344 |
+
| 249 | blk.27.ffn_norm.weight | 0x312255880 | 0x4000 |
|
| 345 |
+
| 250 | blk.27.attn_k.weight | 0x312259880 | 0x800000 |
|
| 346 |
+
| 251 | blk.27.attn_output.weight | 0x312a59880 | 0x2000000 |
|
| 347 |
+
| 252 | blk.27.attn_q.weight | 0x314a59880 | 0x2000000 |
|
| 348 |
+
| 253 | blk.27.attn_v.weight | 0x316a59880 | 0x800000 |
|
| 349 |
+
| 254 | blk.28.attn_norm.weight | 0x317259880 | 0x4000 |
|
| 350 |
+
| 255 | blk.28.ffn_down.weight | 0x31725d880 | 0x7000000 |
|
| 351 |
+
| 256 | blk.28.ffn_gate.weight | 0x31e25d880 | 0x7000000 |
|
| 352 |
+
| 257 | blk.28.ffn_up.weight | 0x32525d880 | 0x7000000 |
|
| 353 |
+
| 258 | blk.28.ffn_norm.weight | 0x32c25d880 | 0x4000 |
|
| 354 |
+
| 259 | blk.28.attn_k.weight | 0x32c261880 | 0x800000 |
|
| 355 |
+
| 260 | blk.28.attn_output.weight | 0x32ca61880 | 0x2000000 |
|
| 356 |
+
| 261 | blk.28.attn_q.weight | 0x32ea61880 | 0x2000000 |
|
| 357 |
+
| 262 | blk.28.attn_v.weight | 0x330a61880 | 0x800000 |
|
| 358 |
+
| 263 | blk.29.attn_norm.weight | 0x331261880 | 0x4000 |
|
| 359 |
+
| 264 | blk.29.ffn_down.weight | 0x331265880 | 0x7000000 |
|
| 360 |
+
| 265 | blk.29.ffn_gate.weight | 0x338265880 | 0x7000000 |
|
| 361 |
+
| 266 | blk.29.ffn_up.weight | 0x33f265880 | 0x7000000 |
|
| 362 |
+
| 267 | blk.29.ffn_norm.weight | 0x346265880 | 0x4000 |
|
| 363 |
+
| 268 | blk.29.attn_k.weight | 0x346269880 | 0x800000 |
|
| 364 |
+
| 269 | blk.29.attn_output.weight | 0x346a69880 | 0x2000000 |
|
| 365 |
+
| 270 | blk.29.attn_q.weight | 0x348a69880 | 0x2000000 |
|
| 366 |
+
| 271 | blk.29.attn_v.weight | 0x34aa69880 | 0x800000 |
|
| 367 |
+
| 272 | blk.30.attn_norm.weight | 0x34b269880 | 0x4000 |
|
| 368 |
+
| 273 | blk.30.ffn_down.weight | 0x34b26d880 | 0x7000000 |
|
| 369 |
+
| 274 | blk.30.ffn_gate.weight | 0x35226d880 | 0x7000000 |
|
| 370 |
+
| 275 | blk.30.ffn_up.weight | 0x35926d880 | 0x7000000 |
|
| 371 |
+
| 276 | blk.30.ffn_norm.weight | 0x36026d880 | 0x4000 |
|
| 372 |
+
| 277 | blk.30.attn_k.weight | 0x360271880 | 0x800000 |
|
| 373 |
+
| 278 | blk.30.attn_output.weight | 0x360a71880 | 0x2000000 |
|
| 374 |
+
| 279 | blk.30.attn_q.weight | 0x362a71880 | 0x2000000 |
|
| 375 |
+
| 280 | blk.30.attn_v.weight | 0x364a71880 | 0x800000 |
|
| 376 |
+
| 281 | blk.31.ffn_gate.weight | 0x365271880 | 0x7000000 |
|
| 377 |
+
| 282 | blk.31.ffn_up.weight | 0x36c271880 | 0x7000000 |
|
| 378 |
+
| 283 | blk.31.attn_k.weight | 0x373271880 | 0x800000 |
|
| 379 |
+
| 284 | blk.31.attn_output.weight | 0x373a71880 | 0x2000000 |
|
| 380 |
+
| 285 | blk.31.attn_q.weight | 0x375a71880 | 0x2000000 |
|
| 381 |
+
| 286 | blk.31.attn_v.weight | 0x377a71880 | 0x800000 |
|
| 382 |
+
| 287 | output.weight | 0x378271880 | 0x3ea00000 |
|
| 383 |
+
| 288 | blk.31.attn_norm.weight | 0x3b6c71880 | 0x4000 |
|
| 384 |
+
| 289 | blk.31.ffn_down.weight | 0x3b6c75880 | 0x7000000 |
|
| 385 |
+
| 290 | blk.31.ffn_norm.weight | 0x3bdc75880 | 0x4000 |
|
| 386 |
+
| 291 | output_norm.weight | 0x3bdc79880 | 0x4000 |
|
| 387 |
+
|
| 388 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 389 |
+
|
| 390 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 391 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 392 |
+
| 0 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 393 |
+
| 1 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | F16 |
|
| 394 |
+
| 287 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | F16 |
|
| 395 |
+
| 291 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 396 |
+
|
| 397 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 398 |
+
- Percentage of total elements: 13.08%
|
| 399 |
+
|
| 400 |
+
|
| 401 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 402 |
+
|
| 403 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 404 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 405 |
+
| 2 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 406 |
+
| 3 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 407 |
+
| 4 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 408 |
+
| 5 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 409 |
+
| 6 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 410 |
+
| 7 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 411 |
+
| 8 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 412 |
+
| 9 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 413 |
+
| 10 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 414 |
+
|
| 415 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 416 |
+
- Percentage of total elements: 2.72%
|
| 417 |
+
|
| 418 |
+
|
| 419 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 420 |
+
|
| 421 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 422 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 423 |
+
| 11 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 424 |
+
| 12 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 425 |
+
| 13 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 426 |
+
| 14 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 427 |
+
| 15 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 428 |
+
| 16 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 429 |
+
| 17 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 430 |
+
| 18 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 431 |
+
| 19 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 432 |
+
|
| 433 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 434 |
+
- Percentage of total elements: 2.72%
|
| 435 |
+
|
| 436 |
+
|
| 437 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 438 |
+
|
| 439 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 440 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 441 |
+
| 20 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 442 |
+
| 21 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 443 |
+
| 22 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 444 |
+
| 23 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 445 |
+
| 24 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 446 |
+
| 25 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 447 |
+
| 26 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 448 |
+
| 27 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 449 |
+
| 28 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 450 |
+
|
| 451 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 452 |
+
- Percentage of total elements: 2.72%
|
| 453 |
+
|
| 454 |
+
|
| 455 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 456 |
+
|
| 457 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 458 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 459 |
+
| 29 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 460 |
+
| 30 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 461 |
+
| 31 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 462 |
+
| 32 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 463 |
+
| 33 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 464 |
+
| 34 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 465 |
+
| 35 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 466 |
+
| 36 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 467 |
+
| 37 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 468 |
+
|
| 469 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 470 |
+
- Percentage of total elements: 2.72%
|
| 471 |
+
|
| 472 |
+
|
| 473 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 474 |
+
|
| 475 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 476 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 477 |
+
| 38 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 478 |
+
| 39 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 479 |
+
| 40 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 480 |
+
| 41 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 481 |
+
| 42 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 482 |
+
| 43 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 483 |
+
| 44 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 484 |
+
| 45 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 485 |
+
| 46 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 486 |
+
|
| 487 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 488 |
+
- Percentage of total elements: 2.72%
|
| 489 |
+
|
| 490 |
+
|
| 491 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 492 |
+
|
| 493 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 494 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 495 |
+
| 47 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 496 |
+
| 48 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 497 |
+
| 49 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 498 |
+
| 50 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 499 |
+
| 51 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 500 |
+
| 52 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 501 |
+
| 53 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 502 |
+
| 54 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 503 |
+
| 55 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 504 |
+
|
| 505 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 506 |
+
- Percentage of total elements: 2.72%
|
| 507 |
+
|
| 508 |
+
|
| 509 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 510 |
+
|
| 511 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 512 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 513 |
+
| 56 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 514 |
+
| 57 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 515 |
+
| 58 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 516 |
+
| 59 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 517 |
+
| 60 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 518 |
+
| 61 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 519 |
+
| 62 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 520 |
+
| 63 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 521 |
+
| 64 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 522 |
+
|
| 523 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 524 |
+
- Percentage of total elements: 2.72%
|
| 525 |
+
|
| 526 |
+
|
| 527 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 528 |
+
|
| 529 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 530 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 531 |
+
| 65 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 532 |
+
| 66 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 533 |
+
| 67 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 534 |
+
| 68 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 535 |
+
| 69 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 536 |
+
| 70 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 537 |
+
| 71 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 538 |
+
| 72 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 539 |
+
| 73 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 540 |
+
|
| 541 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 542 |
+
- Percentage of total elements: 2.72%
|
| 543 |
+
|
| 544 |
+
|
| 545 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 546 |
+
|
| 547 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 548 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 549 |
+
| 74 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 550 |
+
| 75 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 551 |
+
| 76 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 552 |
+
| 77 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 553 |
+
| 78 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 554 |
+
| 79 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 555 |
+
| 80 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 556 |
+
| 81 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 557 |
+
| 82 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 558 |
+
|
| 559 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 560 |
+
- Percentage of total elements: 2.72%
|
| 561 |
+
|
| 562 |
+
|
| 563 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 564 |
+
|
| 565 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 566 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 567 |
+
| 83 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 568 |
+
| 84 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 569 |
+
| 85 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 570 |
+
| 86 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 571 |
+
| 87 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 572 |
+
| 88 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 573 |
+
| 89 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 574 |
+
| 90 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 575 |
+
| 91 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 576 |
+
|
| 577 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 578 |
+
- Percentage of total elements: 2.72%
|
| 579 |
+
|
| 580 |
+
|
| 581 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 582 |
+
|
| 583 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 584 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 585 |
+
| 92 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 586 |
+
| 93 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 587 |
+
| 94 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 588 |
+
| 95 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 589 |
+
| 96 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 590 |
+
| 97 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 591 |
+
| 98 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 592 |
+
| 99 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 593 |
+
| 100 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 594 |
+
|
| 595 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 596 |
+
- Percentage of total elements: 2.72%
|
| 597 |
+
|
| 598 |
+
|
| 599 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 600 |
+
|
| 601 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 602 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 603 |
+
| 101 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 604 |
+
| 102 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 605 |
+
| 103 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 606 |
+
| 104 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 607 |
+
| 105 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 608 |
+
| 106 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 609 |
+
| 107 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 610 |
+
| 108 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 611 |
+
| 109 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 612 |
+
|
| 613 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 614 |
+
- Percentage of total elements: 2.72%
|
| 615 |
+
|
| 616 |
+
|
| 617 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 618 |
+
|
| 619 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 620 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 621 |
+
| 110 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 622 |
+
| 111 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 623 |
+
| 112 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 624 |
+
| 113 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 625 |
+
| 114 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 626 |
+
| 115 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 627 |
+
| 116 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 628 |
+
| 117 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 629 |
+
| 118 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 630 |
+
|
| 631 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 632 |
+
- Percentage of total elements: 2.72%
|
| 633 |
+
|
| 634 |
+
|
| 635 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 636 |
+
|
| 637 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 638 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 639 |
+
| 119 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 640 |
+
| 120 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 641 |
+
| 121 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 642 |
+
| 122 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 643 |
+
| 123 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 644 |
+
| 124 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 645 |
+
| 125 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 646 |
+
| 126 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 647 |
+
| 127 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 648 |
+
|
| 649 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 650 |
+
- Percentage of total elements: 2.72%
|
| 651 |
+
|
| 652 |
+
|
| 653 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 654 |
+
|
| 655 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 656 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 657 |
+
| 128 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 658 |
+
| 129 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 659 |
+
| 130 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 660 |
+
| 131 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 661 |
+
| 132 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 662 |
+
| 133 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 663 |
+
| 134 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 664 |
+
| 135 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 665 |
+
| 136 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 666 |
+
|
| 667 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 668 |
+
- Percentage of total elements: 2.72%
|
| 669 |
+
|
| 670 |
+
|
| 671 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 672 |
+
|
| 673 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 674 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 675 |
+
| 137 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 676 |
+
| 138 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 677 |
+
| 139 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 678 |
+
| 140 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 679 |
+
| 141 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 680 |
+
| 142 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 681 |
+
| 143 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 682 |
+
| 144 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 683 |
+
| 145 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 684 |
+
|
| 685 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 686 |
+
- Percentage of total elements: 2.72%
|
| 687 |
+
|
| 688 |
+
|
| 689 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 690 |
+
|
| 691 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 692 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 693 |
+
| 146 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 694 |
+
| 147 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 695 |
+
| 148 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 696 |
+
| 149 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 697 |
+
| 150 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 698 |
+
| 151 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 699 |
+
| 152 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 700 |
+
| 153 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 701 |
+
| 154 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 702 |
+
|
| 703 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 704 |
+
- Percentage of total elements: 2.72%
|
| 705 |
+
|
| 706 |
+
|
| 707 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 708 |
+
|
| 709 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 710 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 711 |
+
| 155 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 712 |
+
| 156 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 713 |
+
| 157 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 714 |
+
| 158 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 715 |
+
| 159 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 716 |
+
| 160 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 717 |
+
| 161 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 718 |
+
| 162 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 719 |
+
| 163 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 720 |
+
|
| 721 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 722 |
+
- Percentage of total elements: 2.72%
|
| 723 |
+
|
| 724 |
+
|
| 725 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 726 |
+
|
| 727 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 728 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 729 |
+
| 164 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 730 |
+
| 165 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 731 |
+
| 166 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 732 |
+
| 167 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 733 |
+
| 168 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 734 |
+
| 169 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 735 |
+
| 170 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 736 |
+
| 171 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 737 |
+
| 172 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 738 |
+
|
| 739 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 740 |
+
- Percentage of total elements: 2.72%
|
| 741 |
+
|
| 742 |
+
|
| 743 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 744 |
+
|
| 745 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 746 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 747 |
+
| 173 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 748 |
+
| 174 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 749 |
+
| 175 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 750 |
+
| 176 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 751 |
+
| 177 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 752 |
+
| 187 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 188 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 754 |
+
| 189 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 755 |
+
| 190 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 756 |
+
|
| 757 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 758 |
+
- Percentage of total elements: 2.72%
|
| 759 |
+
|
| 760 |
+
|
| 761 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 762 |
+
|
| 763 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 764 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 765 |
+
| 178 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 766 |
+
| 179 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 767 |
+
| 180 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 768 |
+
| 181 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 769 |
+
| 182 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 770 |
+
| 183 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 771 |
+
| 184 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 772 |
+
| 185 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 773 |
+
| 186 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 774 |
+
|
| 775 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 776 |
+
- Percentage of total elements: 2.72%
|
| 777 |
+
|
| 778 |
+
|
| 779 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 780 |
+
|
| 781 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 782 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 783 |
+
| 191 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 784 |
+
| 192 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 785 |
+
| 193 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 786 |
+
| 194 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 787 |
+
| 195 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 788 |
+
| 196 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 789 |
+
| 197 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 790 |
+
| 198 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 791 |
+
| 199 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 792 |
+
|
| 793 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 794 |
+
- Percentage of total elements: 2.72%
|
| 795 |
+
|
| 796 |
+
|
| 797 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 798 |
+
|
| 799 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 800 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 801 |
+
| 200 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 802 |
+
| 201 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 803 |
+
| 202 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 804 |
+
| 203 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 805 |
+
| 204 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 806 |
+
| 205 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 807 |
+
| 206 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 808 |
+
| 207 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 809 |
+
| 208 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 810 |
+
|
| 811 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 812 |
+
- Percentage of total elements: 2.72%
|
| 813 |
+
|
| 814 |
+
|
| 815 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 816 |
+
|
| 817 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 818 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 819 |
+
| 209 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 820 |
+
| 210 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 821 |
+
| 211 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 822 |
+
| 212 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 823 |
+
| 213 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 824 |
+
| 214 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 825 |
+
| 215 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 826 |
+
| 216 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 827 |
+
| 217 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 828 |
+
|
| 829 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 830 |
+
- Percentage of total elements: 2.72%
|
| 831 |
+
|
| 832 |
+
|
| 833 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 834 |
+
|
| 835 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 836 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 837 |
+
| 218 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 838 |
+
| 219 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 839 |
+
| 220 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 840 |
+
| 221 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 841 |
+
| 222 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 842 |
+
| 223 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 843 |
+
| 224 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 844 |
+
| 225 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 845 |
+
| 226 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 846 |
+
|
| 847 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 848 |
+
- Percentage of total elements: 2.72%
|
| 849 |
+
|
| 850 |
+
|
| 851 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 852 |
+
|
| 853 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 854 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 855 |
+
| 227 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 856 |
+
| 228 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 857 |
+
| 229 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 858 |
+
| 230 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 859 |
+
| 231 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 860 |
+
| 232 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 861 |
+
| 233 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 862 |
+
| 234 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 863 |
+
| 235 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 864 |
+
|
| 865 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 866 |
+
- Percentage of total elements: 2.72%
|
| 867 |
+
|
| 868 |
+
|
| 869 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 870 |
+
|
| 871 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 872 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 873 |
+
| 236 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 874 |
+
| 237 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 875 |
+
| 238 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 876 |
+
| 239 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 877 |
+
| 240 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 878 |
+
| 241 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 879 |
+
| 242 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 880 |
+
| 243 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 881 |
+
| 244 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 882 |
+
|
| 883 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 884 |
+
- Percentage of total elements: 2.72%
|
| 885 |
+
|
| 886 |
+
|
| 887 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 888 |
+
|
| 889 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 890 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 891 |
+
| 245 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 892 |
+
| 246 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 893 |
+
| 247 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 894 |
+
| 248 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 895 |
+
| 249 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 896 |
+
| 250 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 897 |
+
| 251 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 898 |
+
| 252 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 899 |
+
| 253 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 900 |
+
|
| 901 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 902 |
+
- Percentage of total elements: 2.72%
|
| 903 |
+
|
| 904 |
+
|
| 905 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 906 |
+
|
| 907 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 908 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 909 |
+
| 254 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 910 |
+
| 255 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 911 |
+
| 256 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 912 |
+
| 257 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 913 |
+
| 258 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 914 |
+
| 259 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 915 |
+
| 260 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 916 |
+
| 261 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 917 |
+
| 262 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 918 |
+
|
| 919 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 920 |
+
- Percentage of total elements: 2.72%
|
| 921 |
+
|
| 922 |
+
|
| 923 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 924 |
+
|
| 925 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 926 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 927 |
+
| 263 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 928 |
+
| 264 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 929 |
+
| 265 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 930 |
+
| 266 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 931 |
+
| 267 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 932 |
+
| 268 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 933 |
+
| 269 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 934 |
+
| 270 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 935 |
+
| 271 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 936 |
+
|
| 937 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 938 |
+
- Percentage of total elements: 2.72%
|
| 939 |
+
|
| 940 |
+
|
| 941 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 942 |
+
|
| 943 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 944 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 945 |
+
| 272 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 946 |
+
| 273 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 947 |
+
| 274 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 948 |
+
| 275 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 949 |
+
| 276 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 950 |
+
| 277 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 951 |
+
| 278 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 952 |
+
| 279 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 953 |
+
| 280 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 954 |
+
|
| 955 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 956 |
+
- Percentage of total elements: 2.72%
|
| 957 |
+
|
| 958 |
+
|
| 959 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 960 |
+
|
| 961 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 962 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 963 |
+
| 281 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 964 |
+
| 282 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | F16 |
|
| 965 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 966 |
+
| 284 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 967 |
+
| 285 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | F16 |
|
| 968 |
+
| 286 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 969 |
+
| 288 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 970 |
+
| 289 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | F16 |
|
| 971 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 972 |
+
|
| 973 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 974 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-IQ3_M.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-IQ3_M.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 27 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-IQ3\_M.gguf - GGUF Internal File Dump](#llama-guard-3-8b-iq3_mgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0xd746000 |
|
| 100 |
+
| 1 | output_norm.weight | 0xdebf880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0xdec3880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0xdec3980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x1b609980 | 0x188000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x1b791980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1b795980 | 0x900000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x1c095980 | 0x620000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x1c6b5980 | 0x1b8000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x1c86d980 | 0x1f80000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x1e7ed980 | 0x1570000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x1fd5d980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x1fd61980 | 0x1570000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x212d1980 | 0x188000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x21459980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x2145d980 | 0x900000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x21d5d980 | 0x620000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x2237d980 | 0x1b8000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x22535980 | 0x1f80000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x244b5980 | 0x1570000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x25a25980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x25a29980 | 0x1570000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x26f99980 | 0x188000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x27121980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x27125980 | 0x900000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x27a25980 | 0x620000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x28045980 | 0x1b8000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x281fd980 | 0x1f80000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x2a17d980 | 0x1570000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x2b6ed980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x2b6f1980 | 0x1570000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x2cc61980 | 0x188000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x2cde9980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x2cded980 | 0x900000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x2d6ed980 | 0x620000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x2dd0d980 | 0x1b8000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x2dec5980 | 0x1f80000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x2fe45980 | 0x1570000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x313b5980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x313b9980 | 0x1570000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x32929980 | 0x188000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x32ab1980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x32ab5980 | 0x900000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x333b5980 | 0x620000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x339d5980 | 0x1b8000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x33b8d980 | 0x1f80000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x35b0d980 | 0x1570000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x3707d980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x37081980 | 0x1570000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x385f1980 | 0x188000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x38779980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x3877d980 | 0x900000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x3907d980 | 0x620000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x3969d980 | 0x1b8000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x39855980 | 0x1f80000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x3b7d5980 | 0x1570000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x3cd45980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x3cd49980 | 0x1570000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x3e2b9980 | 0x188000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x3e441980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x3e445980 | 0x900000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x3ed45980 | 0x620000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x3f365980 | 0x1b8000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x3f51d980 | 0x1f80000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x4149d980 | 0x1570000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x42a0d980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x42a11980 | 0x1570000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x43f81980 | 0x188000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x44109980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x4410d980 | 0x900000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x44a0d980 | 0x620000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x4502d980 | 0x1b8000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x451e5980 | 0x1f80000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x47165980 | 0x1570000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x486d5980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x486d9980 | 0x1570000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x49c49980 | 0x188000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x49dd1980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x49dd5980 | 0x900000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x4a6d5980 | 0x620000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x4acf5980 | 0x1b8000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x4aead980 | 0x1f80000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x4ce2d980 | 0x1570000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x4e39d980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x4e3a1980 | 0x1570000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x4f911980 | 0x188000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x4fa99980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x4fa9d980 | 0x900000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x5039d980 | 0x620000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x509bd980 | 0x1b8000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x50b75980 | 0x1f80000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x52af5980 | 0x1570000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x54065980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x54069980 | 0x1570000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x555d9980 | 0x188000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x55761980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x55765980 | 0x900000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x56065980 | 0x620000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x56685980 | 0x1b8000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x5683d980 | 0x1f80000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x587bd980 | 0x1570000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x59d2d980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x59d31980 | 0x1570000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x5b2a1980 | 0x188000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x5b429980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x5b42d980 | 0x900000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x5bd2d980 | 0x620000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x5c34d980 | 0x1b8000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x5c505980 | 0x1f80000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x5e485980 | 0x1570000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x5f9f5980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x5f9f9980 | 0x1570000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x60f69980 | 0x188000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x610f1980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x610f5980 | 0x900000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x619f5980 | 0x620000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x62015980 | 0x1b8000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x621cd980 | 0x1f80000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x6414d980 | 0x1570000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x656bd980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x656c1980 | 0x1570000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x66c31980 | 0x1b8000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x66de9980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x66ded980 | 0x900000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x676ed980 | 0x6e0000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x67dcd980 | 0x240000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x6800d980 | 0x1f80000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x69f8d980 | 0x1570000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x6b4fd980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x6b501980 | 0x1570000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x6ca71980 | 0x1b8000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x6cc29980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x6cc2d980 | 0x900000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x6d52d980 | 0x6e0000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x6dc0d980 | 0x240000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x6de4d980 | 0x1f80000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x6fdcd980 | 0x1570000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x7133d980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x71341980 | 0x1570000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x728b1980 | 0x188000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x72a39980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x72a3d980 | 0x900000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x7333d980 | 0x620000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x7395d980 | 0x1b8000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x73b15980 | 0x1f80000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x75a95980 | 0x1570000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x77005980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x77009980 | 0x1570000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x78579980 | 0x1b8000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x78731980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x78735980 | 0x900000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x79035980 | 0x6e0000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x79715980 | 0x240000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x79955980 | 0x1f80000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x7b8d5980 | 0x1810000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x7d0e5980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x7d0e9980 | 0x1810000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x7e8f9980 | 0x188000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x7ea81980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x7ea85980 | 0x900000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x7f385980 | 0x620000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x7f9a5980 | 0x1b8000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x7fb5d980 | 0x1f80000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x81add980 | 0x1810000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x832ed980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x832f1980 | 0x1810000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x84b01980 | 0x1b8000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x84cb9980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x84cbd980 | 0x900000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x855bd980 | 0x6e0000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x85c9d980 | 0x240000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x85edd980 | 0x1f80000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x87e5d980 | 0x1810000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x8966d980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x89671980 | 0x1810000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x8ae81980 | 0x1b8000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x8b039980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x8b03d980 | 0x900000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x8b93d980 | 0x6e0000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x8c01d980 | 0x240000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x8c25d980 | 0x1f80000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x8e1dd980 | 0x1810000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0x8f9ed980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0x8f9f1980 | 0x1810000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0x91201980 | 0x1b8000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0x913b9980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0x913bd980 | 0x900000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0x91cbd980 | 0x6e0000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0x9239d980 | 0x240000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0x925dd980 | 0x1f80000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x9455d980 | 0x1810000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x95d6d980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x95d71980 | 0x1810000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x97581980 | 0x1b8000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x97739980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x9773d980 | 0x900000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x9803d980 | 0x6e0000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x9871d980 | 0x240000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x9895d980 | 0x1f80000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x9a8dd980 | 0x1810000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x9c0ed980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x9c0f1980 | 0x1810000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x9d901980 | 0x1b8000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x9dab9980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x9dabd980 | 0x900000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x9e3bd980 | 0x6e0000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x9ea9d980 | 0x240000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x9ecdd980 | 0x1f80000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xa0c5d980 | 0x1810000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xa246d980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xa2471980 | 0x1810000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xa3c81980 | 0x1b8000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xa3e39980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xa3e3d980 | 0x900000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xa473d980 | 0x6e0000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xa4e1d980 | 0x240000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xa505d980 | 0x1f80000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xa6fdd980 | 0x1810000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xa87ed980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xa87f1980 | 0x1810000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xaa001980 | 0x1b8000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xaa1b9980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xaa1bd980 | 0x900000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xaaabd980 | 0x6e0000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xab19d980 | 0x240000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xab3dd980 | 0x1f80000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xad35d980 | 0x1810000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xaeb6d980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xaeb71980 | 0x1810000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xb0381980 | 0x1b8000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xb0539980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xb053d980 | 0x900000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xb0e3d980 | 0x6e0000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xb151d980 | 0x240000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xb175d980 | 0x1f80000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xb36dd980 | 0x1810000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xb4eed980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xb4ef1980 | 0x1810000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xb6701980 | 0x1b8000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xb68b9980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xb68bd980 | 0x900000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xb71bd980 | 0x6e0000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xb789d980 | 0x240000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xb7add980 | 0x1f80000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xb9a5d980 | 0x1810000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xbb26d980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xbb271980 | 0x1810000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xbca81980 | 0x1b8000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xbcc39980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xbcc3d980 | 0x900000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xbd53d980 | 0x6e0000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xbdc1d980 | 0x240000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xbde5d980 | 0x1f80000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xbfddd980 | 0x1810000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xc15ed980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xc15f1980 | 0x1810000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xc2e01980 | 0x1b8000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xc2fb9980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xc2fbd980 | 0x900000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xc38bd980 | 0x6e0000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xc3f9d980 | 0x240000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xc41dd980 | 0x1f80000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xc615d980 | 0x1810000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xc796d980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xc7971980 | 0x1810000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xc9181980 | 0x1b8000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xc9339980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xc933d980 | 0x900000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xc9c3d980 | 0x6e0000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xca31d980 | 0x240000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xca55d980 | 0x1f80000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xcc4dd980 | 0x1810000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xcdced980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xcdcf1980 | 0x1810000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xcf501980 | 0x1b8000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xcf6b9980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xcf6bd980 | 0x900000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xcffbd980 | 0x6e0000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xd069d980 | 0x240000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xd08dd980 | 0x1f80000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xd285d980 | 0x1810000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xd406d980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xd4071980 | 0x1810000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xd5881980 | 0x188000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xd5a09980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xd5a0d980 | 0x900000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xd630d980 | 0x620000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xd692d980 | 0x1b8000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xd6ae5980 | 0x1f80000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0xd8a65980 | 0x1810000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0xda275980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0xda279980 | 0x1810000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:------|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | IQ3_S |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | IQ3_S |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-IQ3_S.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-IQ3_S.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 26 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-IQ3\_S.gguf - GGUF Internal File Dump](#llama-guard-3-8b-iq3_sgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0xbfca000 |
|
| 100 |
+
| 1 | output_norm.weight | 0xc743880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0xc747880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0xc747980 | 0xbfca000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x18711980 | 0x188000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x18899980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1889d980 | 0x6e0000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x18f7d980 | 0x620000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x1959d980 | 0x1b8000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x19755980 | 0x1810000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x1af65980 | 0x1570000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x1c4d5980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x1c4d9980 | 0x1570000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x1da49980 | 0x188000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x1dbd1980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x1dbd5980 | 0x6e0000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x1e2b5980 | 0x620000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x1e8d5980 | 0x1b8000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x1ea8d980 | 0x1f80000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x20a0d980 | 0x1570000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x21f7d980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x21f81980 | 0x1570000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x234f1980 | 0x188000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x23679980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x2367d980 | 0x6e0000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x23d5d980 | 0x620000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x2437d980 | 0x1b8000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x24535980 | 0x1810000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x25d45980 | 0x1570000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x272b5980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x272b9980 | 0x1570000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x28829980 | 0x188000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x289b1980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x289b5980 | 0x6e0000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x29095980 | 0x620000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x296b5980 | 0x1b8000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x2986d980 | 0x1810000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x2b07d980 | 0x1570000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x2c5ed980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x2c5f1980 | 0x1570000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x2db61980 | 0x188000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x2dce9980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x2dced980 | 0x6e0000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x2e3cd980 | 0x620000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x2e9ed980 | 0x1b8000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x2eba5980 | 0x1810000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x303b5980 | 0x1570000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x31925980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x31929980 | 0x1570000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x32e99980 | 0x188000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x33021980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x33025980 | 0x6e0000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x33705980 | 0x620000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x33d25980 | 0x1b8000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x33edd980 | 0x1810000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x356ed980 | 0x1570000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x36c5d980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x36c61980 | 0x1570000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x381d1980 | 0x188000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x38359980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x3835d980 | 0x6e0000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x38a3d980 | 0x620000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x3905d980 | 0x1b8000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x39215980 | 0x1810000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x3aa25980 | 0x1570000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x3bf95980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x3bf99980 | 0x1570000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x3d509980 | 0x188000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x3d691980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x3d695980 | 0x6e0000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x3dd75980 | 0x620000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x3e395980 | 0x1b8000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x3e54d980 | 0x1810000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x3fd5d980 | 0x1570000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x412cd980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x412d1980 | 0x1570000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x42841980 | 0x188000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x429c9980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x429cd980 | 0x6e0000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x430ad980 | 0x620000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x436cd980 | 0x1b8000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x43885980 | 0x1810000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x45095980 | 0x1570000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x46605980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x46609980 | 0x1570000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x47b79980 | 0x188000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x47d01980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x47d05980 | 0x6e0000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x483e5980 | 0x620000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x48a05980 | 0x1b8000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x48bbd980 | 0x1810000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x4a3cd980 | 0x1570000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x4b93d980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x4b941980 | 0x1570000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x4ceb1980 | 0x188000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x4d039980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x4d03d980 | 0x6e0000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x4d71d980 | 0x620000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x4dd3d980 | 0x1b8000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x4def5980 | 0x1810000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x4f705980 | 0x1570000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x50c75980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x50c79980 | 0x1570000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x521e9980 | 0x188000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x52371980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x52375980 | 0x6e0000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x52a55980 | 0x620000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x53075980 | 0x1b8000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x5322d980 | 0x1810000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x54a3d980 | 0x1570000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x55fad980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x55fb1980 | 0x1570000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x57521980 | 0x188000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x576a9980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x576ad980 | 0x6e0000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x57d8d980 | 0x620000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x583ad980 | 0x1b8000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x58565980 | 0x1810000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x59d75980 | 0x1570000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x5b2e5980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x5b2e9980 | 0x1570000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x5c859980 | 0x1b8000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x5ca11980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x5ca15980 | 0x6e0000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x5d0f5980 | 0x6e0000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x5d7d5980 | 0x1b8000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x5d98d980 | 0x1810000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x5f19d980 | 0x1570000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x6070d980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x60711980 | 0x1570000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x61c81980 | 0x1b8000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x61e39980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x61e3d980 | 0x6e0000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x6251d980 | 0x6e0000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x62bfd980 | 0x1b8000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x62db5980 | 0x1810000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x645c5980 | 0x1570000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x65b35980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x65b39980 | 0x1570000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x670a9980 | 0x188000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x67231980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x67235980 | 0x6e0000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x67915980 | 0x620000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x67f35980 | 0x1b8000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x680ed980 | 0x1810000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x698fd980 | 0x1570000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x6ae6d980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x6ae71980 | 0x1570000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x6c3e1980 | 0x1b8000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x6c599980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x6c59d980 | 0x6e0000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x6cc7d980 | 0x6e0000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x6d35d980 | 0x1b8000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x6d515980 | 0x1810000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x6ed25980 | 0x1810000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x70535980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x70539980 | 0x1810000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x71d49980 | 0x188000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x71ed1980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x71ed5980 | 0x6e0000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x725b5980 | 0x620000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x72bd5980 | 0x1b8000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x72d8d980 | 0x1f80000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x74d0d980 | 0x1810000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x7651d980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x76521980 | 0x1810000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x77d31980 | 0x1b8000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x77ee9980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x77eed980 | 0x6e0000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x785cd980 | 0x6e0000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x78cad980 | 0x1b8000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x78e65980 | 0x1f80000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x7ade5980 | 0x1810000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x7c5f5980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x7c5f9980 | 0x1810000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x7de09980 | 0x1b8000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x7dfc1980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x7dfc5980 | 0x6e0000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x7e6a5980 | 0x6e0000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x7ed85980 | 0x1b8000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x7ef3d980 | 0x1f80000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x80ebd980 | 0x1810000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0x826cd980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0x826d1980 | 0x1810000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0x83ee1980 | 0x1b8000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0x84099980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0x8409d980 | 0x6e0000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0x8477d980 | 0x6e0000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0x84e5d980 | 0x1b8000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0x85015980 | 0x1f80000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x86f95980 | 0x1810000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x887a5980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x887a9980 | 0x1810000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x89fb9980 | 0x1b8000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x8a171980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x8a175980 | 0x6e0000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x8a855980 | 0x6e0000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x8af35980 | 0x1b8000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x8b0ed980 | 0x1f80000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x8d06d980 | 0x1810000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x8e87d980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x8e881980 | 0x1810000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x90091980 | 0x1b8000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x90249980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x9024d980 | 0x6e0000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x9092d980 | 0x6e0000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x9100d980 | 0x1b8000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x911c5980 | 0x1f80000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0x93145980 | 0x1810000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0x94955980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0x94959980 | 0x1810000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0x96169980 | 0x1b8000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0x96321980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0x96325980 | 0x6e0000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0x96a05980 | 0x6e0000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0x970e5980 | 0x1b8000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0x9729d980 | 0x1f80000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0x9921d980 | 0x1810000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0x9aa2d980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0x9aa31980 | 0x1810000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0x9c241980 | 0x1b8000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0x9c3f9980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0x9c3fd980 | 0x6e0000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0x9cadd980 | 0x6e0000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0x9d1bd980 | 0x1b8000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0x9d375980 | 0x1f80000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0x9f2f5980 | 0x1810000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xa0b05980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xa0b09980 | 0x1810000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xa2319980 | 0x1b8000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xa24d1980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xa24d5980 | 0x6e0000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xa2bb5980 | 0x6e0000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xa3295980 | 0x1b8000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xa344d980 | 0x1f80000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xa53cd980 | 0x1810000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xa6bdd980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xa6be1980 | 0x1810000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xa83f1980 | 0x1b8000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xa85a9980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xa85ad980 | 0x6e0000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xa8c8d980 | 0x6e0000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xa936d980 | 0x1b8000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xa9525980 | 0x1f80000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xab4a5980 | 0x1810000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xaccb5980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xaccb9980 | 0x1810000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xae4c9980 | 0x1b8000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xae681980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xae685980 | 0x6e0000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xaed65980 | 0x6e0000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xaf445980 | 0x1b8000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xaf5fd980 | 0x1f80000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xb157d980 | 0x1810000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xb2d8d980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xb2d91980 | 0x1810000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xb45a1980 | 0x1b8000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xb4759980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xb475d980 | 0x6e0000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xb4e3d980 | 0x6e0000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xb551d980 | 0x1b8000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xb56d5980 | 0x1f80000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xb7655980 | 0x1810000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xb8e65980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xb8e69980 | 0x1810000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xba679980 | 0x1b8000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xba831980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xba835980 | 0x6e0000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xbaf15980 | 0x6e0000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xbb5f5980 | 0x1b8000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xbb7ad980 | 0x1f80000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xbd72d980 | 0x1810000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xbef3d980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xbef41980 | 0x1810000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xc0751980 | 0x1b8000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xc0909980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xc090d980 | 0x6e0000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xc0fed980 | 0x6e0000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xc16cd980 | 0x1b8000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xc1885980 | 0x1f80000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xc3805980 | 0x1810000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xc5015980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xc5019980 | 0x1810000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xc6829980 | 0x188000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xc69b1980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xc69b5980 | 0x6e0000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xc7095980 | 0x620000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xc76b5980 | 0x1b8000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xc786d980 | 0x1f80000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0xc97ed980 | 0x1810000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0xcaffd980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0xcb001980 | 0x1810000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:--------|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | IQ3_XXS |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | IQ3_XXS |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:--------|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_XXS |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:------|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ3_S |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:--------|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_XXS |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_XXS |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | IQ4_NL |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-IQ4_NL.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-IQ4_NL.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 25 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-IQ4\_NL.gguf - GGUF Internal File Dump](#llama-guard-3-8b-iq4_nlgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x119d0000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x12149880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x1214d880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x1214d980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x1f893980 | 0x1b8000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x1fa4b980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1fa4f980 | 0x900000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x2034f980 | 0x6e0000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x20a2f980 | 0x220000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x20c4f980 | 0x2680000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x232cf980 | 0x1810000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x24adf980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x24ae3980 | 0x1810000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x262f3980 | 0x1b8000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x264ab980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x264af980 | 0x900000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x26daf980 | 0x6e0000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x2748f980 | 0x220000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x276af980 | 0x2680000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x29d2f980 | 0x1810000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x2b53f980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x2b543980 | 0x1810000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x2cd53980 | 0x1b8000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x2cf0b980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x2cf0f980 | 0x900000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x2d80f980 | 0x6e0000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x2deef980 | 0x220000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x2e10f980 | 0x2680000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x3078f980 | 0x1810000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x31f9f980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x31fa3980 | 0x1810000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x337b3980 | 0x1b8000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x3396b980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x3396f980 | 0x900000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x3426f980 | 0x6e0000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x3494f980 | 0x220000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x34b6f980 | 0x2680000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x371ef980 | 0x1810000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x389ff980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x38a03980 | 0x1810000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x3a213980 | 0x1b8000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x3a3cb980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x3a3cf980 | 0x900000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x3accf980 | 0x6e0000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x3b3af980 | 0x220000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x3b5cf980 | 0x2680000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x3dc4f980 | 0x1810000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x3f45f980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x3f463980 | 0x1810000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x40c73980 | 0x1b8000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x40e2b980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x40e2f980 | 0x900000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x4172f980 | 0x6e0000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x41e0f980 | 0x220000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x4202f980 | 0x2680000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x446af980 | 0x1810000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x45ebf980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x45ec3980 | 0x1810000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x476d3980 | 0x1b8000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x4788b980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x4788f980 | 0x900000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x4818f980 | 0x6e0000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x4886f980 | 0x220000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x48a8f980 | 0x2680000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x4b10f980 | 0x1810000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x4c91f980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x4c923980 | 0x1810000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x4e133980 | 0x1b8000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x4e2eb980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x4e2ef980 | 0x900000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x4ebef980 | 0x6e0000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x4f2cf980 | 0x220000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x4f4ef980 | 0x2680000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x51b6f980 | 0x1810000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x5337f980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x53383980 | 0x1810000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x54b93980 | 0x1b8000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x54d4b980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x54d4f980 | 0x900000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x5564f980 | 0x6e0000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x55d2f980 | 0x220000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x55f4f980 | 0x2680000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x585cf980 | 0x1810000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x59ddf980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x59de3980 | 0x1810000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x5b5f3980 | 0x1b8000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x5b7ab980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x5b7af980 | 0x900000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x5c0af980 | 0x6e0000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x5c78f980 | 0x220000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x5c9af980 | 0x2680000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x5f02f980 | 0x1810000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x6083f980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x60843980 | 0x1810000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x62053980 | 0x1b8000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x6220b980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x6220f980 | 0x900000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x62b0f980 | 0x6e0000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x631ef980 | 0x220000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x6340f980 | 0x2680000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x65a8f980 | 0x1810000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x6729f980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x672a3980 | 0x1810000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x68ab3980 | 0x1b8000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x68c6b980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x68c6f980 | 0x900000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x6956f980 | 0x6e0000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x69c4f980 | 0x220000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x69e6f980 | 0x2680000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x6c4ef980 | 0x1810000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x6dcff980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x6dd03980 | 0x1810000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x6f513980 | 0x1b8000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x6f6cb980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x6f6cf980 | 0x900000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x6ffcf980 | 0x6e0000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x706af980 | 0x220000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x708cf980 | 0x2680000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x72f4f980 | 0x1810000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x7475f980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x74763980 | 0x1810000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x75f73980 | 0x240000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x761b3980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x761b7980 | 0x900000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x76ab7980 | 0x900000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x773b7980 | 0x240000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x775f7980 | 0x2680000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x79c77980 | 0x1810000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x7b487980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x7b48b980 | 0x1810000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x7cc9b980 | 0x240000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x7cedb980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x7cedf980 | 0x900000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x7d7df980 | 0x900000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x7e0df980 | 0x240000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x7e31f980 | 0x2680000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x8099f980 | 0x1810000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x821af980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x821b3980 | 0x1810000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x839c3980 | 0x1b8000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x83b7b980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x83b7f980 | 0x900000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x8447f980 | 0x6e0000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x84b5f980 | 0x220000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x84d7f980 | 0x2680000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x873ff980 | 0x1810000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x88c0f980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x88c13980 | 0x1810000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x8a423980 | 0x240000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x8a663980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x8a667980 | 0x900000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x8af67980 | 0x900000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x8b867980 | 0x240000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x8baa7980 | 0x2680000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x8e127980 | 0x1f80000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x900a7980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x900ab980 | 0x1f80000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x9202b980 | 0x1b8000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x921e3980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x921e7980 | 0x900000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x92ae7980 | 0x6e0000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x931c7980 | 0x220000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x933e7980 | 0x2680000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x95a67980 | 0x1f80000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x979e7980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x979eb980 | 0x1f80000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x9996b980 | 0x240000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x99bab980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x99baf980 | 0x900000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x9a4af980 | 0x900000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x9adaf980 | 0x240000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x9afef980 | 0x2680000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x9d66f980 | 0x1f80000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x9f5ef980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x9f5f3980 | 0x1f80000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0xa1573980 | 0x240000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0xa17b3980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0xa17b7980 | 0x900000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0xa20b7980 | 0x900000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0xa29b7980 | 0x240000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0xa2bf7980 | 0x2680000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0xa5277980 | 0x1f80000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0xa71f7980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0xa71fb980 | 0x1f80000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0xa917b980 | 0x240000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0xa93bb980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0xa93bf980 | 0x900000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0xa9cbf980 | 0x900000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0xaa5bf980 | 0x240000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0xaa7ff980 | 0x2680000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0xace7f980 | 0x1f80000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0xaedff980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0xaee03980 | 0x1f80000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0xb0d83980 | 0x240000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0xb0fc3980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0xb0fc7980 | 0x900000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0xb18c7980 | 0x900000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0xb21c7980 | 0x240000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0xb2407980 | 0x2680000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0xb4a87980 | 0x1f80000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0xb6a07980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0xb6a0b980 | 0x1f80000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0xb898b980 | 0x240000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0xb8bcb980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0xb8bcf980 | 0x900000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0xb94cf980 | 0x900000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0xb9dcf980 | 0x240000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0xba00f980 | 0x2680000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xbc68f980 | 0x1f80000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xbe60f980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xbe613980 | 0x1f80000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xc0593980 | 0x240000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xc07d3980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xc07d7980 | 0x900000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xc10d7980 | 0x900000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xc19d7980 | 0x240000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xc1c17980 | 0x2680000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xc4297980 | 0x1f80000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xc6217980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xc621b980 | 0x1f80000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xc819b980 | 0x240000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xc83db980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xc83df980 | 0x900000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xc8cdf980 | 0x900000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xc95df980 | 0x240000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xc981f980 | 0x2680000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xcbe9f980 | 0x1f80000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xcde1f980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xcde23980 | 0x1f80000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xcfda3980 | 0x240000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xcffe3980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xcffe7980 | 0x900000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xd08e7980 | 0x900000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xd11e7980 | 0x240000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xd1427980 | 0x2680000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xd3aa7980 | 0x1f80000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xd5a27980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xd5a2b980 | 0x1f80000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xd79ab980 | 0x240000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xd7beb980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xd7bef980 | 0x900000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xd84ef980 | 0x900000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xd8def980 | 0x240000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xd902f980 | 0x2680000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xdb6af980 | 0x1f80000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xdd62f980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xdd633980 | 0x1f80000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xdf5b3980 | 0x240000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xdf7f3980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xdf7f7980 | 0x900000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xe00f7980 | 0x900000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xe09f7980 | 0x240000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xe0c37980 | 0x2680000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xe32b7980 | 0x1f80000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xe5237980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xe523b980 | 0x1f80000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xe71bb980 | 0x240000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xe73fb980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xe73ff980 | 0x900000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xe7cff980 | 0x900000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xe85ff980 | 0x240000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xe883f980 | 0x2680000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xeaebf980 | 0x1f80000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xece3f980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xece43980 | 0x1f80000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xeedc3980 | 0x240000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xef003980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xef007980 | 0x900000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xef907980 | 0x900000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xf0207980 | 0x240000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xf0447980 | 0x2680000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xf2ac7980 | 0x1f80000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xf4a47980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xf4a4b980 | 0x1f80000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xf69cb980 | 0x240000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xf6c0b980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xf6c0f980 | 0x900000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xf750f980 | 0x900000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xf7e0f980 | 0x240000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xf804f980 | 0x2680000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xfa6cf980 | 0x1f80000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xfc64f980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xfc653980 | 0x1f80000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xfe5d3980 | 0x1b8000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xfe78b980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xfe78f980 | 0x900000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xff08f980 | 0x6e0000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xff76f980 | 0x220000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xff98f980 | 0x2680000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0x10200f980 | 0x1f80000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0x103f8f980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0x103f93980 | 0x1f80000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-------|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | IQ4_NL |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | IQ3_S |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-------|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ3_S |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_NL |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-------|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ3_S |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ4_NL |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | IQ3_S |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | IQ4_XS |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | IQ4_NL |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q3_K_L.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q3_K_L.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 13 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q3\_K\_L.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q3_k_lgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0xd746000 |
|
| 100 |
+
| 1 | output_norm.weight | 0xdebf880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0xdec3880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0xdec3980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x1b609980 | 0x150000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x1b759980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1b75d980 | 0xb00000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x1c25d980 | 0x540000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x1c79d980 | 0x240000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x1c9dd980 | 0x1f80000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x1e95d980 | 0x1260000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x1fbbd980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x1fbc1980 | 0x1260000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x20e21980 | 0x150000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x20f71980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x20f75980 | 0xb00000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x21a75980 | 0x540000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x21fb5980 | 0x240000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x221f5980 | 0x2680000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x24875980 | 0x1260000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x25ad5980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x25ad9980 | 0x1260000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x26d39980 | 0x150000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x26e89980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x26e8d980 | 0xb00000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x2798d980 | 0x540000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x27ecd980 | 0x240000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x2810d980 | 0x1f80000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x2a08d980 | 0x1260000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x2b2ed980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x2b2f1980 | 0x1260000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x2c551980 | 0x150000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x2c6a1980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x2c6a5980 | 0xb00000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x2d1a5980 | 0x540000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x2d6e5980 | 0x240000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x2d925980 | 0x1f80000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x2f8a5980 | 0x1260000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x30b05980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x30b09980 | 0x1260000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x31d69980 | 0x150000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x31eb9980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x31ebd980 | 0xb00000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x329bd980 | 0x540000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x32efd980 | 0x240000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x3313d980 | 0x1f80000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x350bd980 | 0x1260000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x3631d980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x36321980 | 0x1260000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x37581980 | 0x150000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x376d1980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x376d5980 | 0xb00000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x381d5980 | 0x540000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x38715980 | 0x240000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x38955980 | 0x1f80000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x3a8d5980 | 0x1260000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x3bb35980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x3bb39980 | 0x1260000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x3cd99980 | 0x150000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x3cee9980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x3ceed980 | 0xb00000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x3d9ed980 | 0x540000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x3df2d980 | 0x240000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x3e16d980 | 0x1f80000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x400ed980 | 0x1260000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x4134d980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x41351980 | 0x1260000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x425b1980 | 0x150000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x42701980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x42705980 | 0xb00000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x43205980 | 0x540000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x43745980 | 0x240000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x43985980 | 0x1f80000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x45905980 | 0x1260000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x46b65980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x46b69980 | 0x1260000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x47dc9980 | 0x150000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x47f19980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x47f1d980 | 0xb00000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x48a1d980 | 0x540000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x48f5d980 | 0x240000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x4919d980 | 0x1f80000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x4b11d980 | 0x1260000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x4c37d980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x4c381980 | 0x1260000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x4d5e1980 | 0x150000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x4d731980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x4d735980 | 0xb00000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x4e235980 | 0x540000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x4e775980 | 0x240000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x4e9b5980 | 0x1f80000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x50935980 | 0x1260000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x51b95980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x51b99980 | 0x1260000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x52df9980 | 0x150000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x52f49980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x52f4d980 | 0xb00000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x53a4d980 | 0x540000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x53f8d980 | 0x240000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x541cd980 | 0x1f80000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x5614d980 | 0x1260000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x573ad980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x573b1980 | 0x1260000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x58611980 | 0x150000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x58761980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x58765980 | 0xb00000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x59265980 | 0x540000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x597a5980 | 0x240000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x599e5980 | 0x1f80000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x5b965980 | 0x1260000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x5cbc5980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x5cbc9980 | 0x1260000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x5de29980 | 0x150000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x5df79980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x5df7d980 | 0xb00000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x5ea7d980 | 0x540000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x5efbd980 | 0x240000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x5f1fd980 | 0x1f80000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x6117d980 | 0x1260000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x623dd980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x623e1980 | 0x1260000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x63641980 | 0x1b8000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x637f9980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x637fd980 | 0xb00000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x642fd980 | 0x6e0000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x649dd980 | 0x240000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x64c1d980 | 0x1f80000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x66b9d980 | 0x1260000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x67dfd980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x67e01980 | 0x1260000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x69061980 | 0x1b8000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x69219980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x6921d980 | 0xb00000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x69d1d980 | 0x6e0000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x6a3fd980 | 0x240000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x6a63d980 | 0x1f80000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x6c5bd980 | 0x1260000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x6d81d980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x6d821980 | 0x1260000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x6ea81980 | 0x150000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x6ebd1980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x6ebd5980 | 0xb00000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x6f6d5980 | 0x540000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x6fc15980 | 0x240000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x6fe55980 | 0x1f80000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x71dd5980 | 0x1260000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x73035980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x73039980 | 0x1260000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x74299980 | 0x1b8000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x74451980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x74455980 | 0xb00000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x74f55980 | 0x6e0000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x75635980 | 0x240000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x75875980 | 0x1f80000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x777f5980 | 0x1810000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x79005980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x79009980 | 0x1810000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x7a819980 | 0x150000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x7a969980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x7a96d980 | 0xb00000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x7b46d980 | 0x540000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x7b9ad980 | 0x240000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x7bbed980 | 0x2680000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x7e26d980 | 0x1810000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x7fa7d980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x7fa81980 | 0x1810000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x81291980 | 0x1b8000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x81449980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x8144d980 | 0xb00000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x81f4d980 | 0x6e0000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x8262d980 | 0x240000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x8286d980 | 0x2680000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x84eed980 | 0x1810000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x866fd980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x86701980 | 0x1810000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x87f11980 | 0x1b8000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x880c9980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x880cd980 | 0xb00000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x88bcd980 | 0x6e0000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x892ad980 | 0x240000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x894ed980 | 0x2680000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x8bb6d980 | 0x1810000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0x8d37d980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0x8d381980 | 0x1810000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0x8eb91980 | 0x1b8000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0x8ed49980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0x8ed4d980 | 0xb00000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0x8f84d980 | 0x6e0000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0x8ff2d980 | 0x240000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0x9016d980 | 0x2680000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x927ed980 | 0x1810000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x93ffd980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x94001980 | 0x1810000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x95811980 | 0x1b8000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x959c9980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x959cd980 | 0xb00000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x964cd980 | 0x6e0000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x96bad980 | 0x240000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x96ded980 | 0x2680000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x9946d980 | 0x1810000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x9ac7d980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x9ac81980 | 0x1810000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x9c491980 | 0x1b8000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x9c649980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x9c64d980 | 0xb00000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x9d14d980 | 0x6e0000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x9d82d980 | 0x240000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x9da6d980 | 0x2680000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xa00ed980 | 0x1810000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xa18fd980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xa1901980 | 0x1810000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xa3111980 | 0x1b8000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xa32c9980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xa32cd980 | 0xb00000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xa3dcd980 | 0x6e0000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xa44ad980 | 0x240000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xa46ed980 | 0x2680000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xa6d6d980 | 0x1810000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xa857d980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xa8581980 | 0x1810000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xa9d91980 | 0x1b8000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xa9f49980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xa9f4d980 | 0xb00000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xaaa4d980 | 0x6e0000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xab12d980 | 0x240000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xab36d980 | 0x2680000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xad9ed980 | 0x1810000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xaf1fd980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xaf201980 | 0x1810000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xb0a11980 | 0x1b8000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xb0bc9980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xb0bcd980 | 0xb00000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xb16cd980 | 0x6e0000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xb1dad980 | 0x240000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xb1fed980 | 0x2680000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xb466d980 | 0x1810000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xb5e7d980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xb5e81980 | 0x1810000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xb7691980 | 0x1b8000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xb7849980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xb784d980 | 0xb00000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xb834d980 | 0x6e0000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xb8a2d980 | 0x240000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xb8c6d980 | 0x2680000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xbb2ed980 | 0x1810000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xbcafd980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xbcb01980 | 0x1810000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xbe311980 | 0x1b8000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xbe4c9980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xbe4cd980 | 0xb00000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xbefcd980 | 0x6e0000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xbf6ad980 | 0x240000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xbf8ed980 | 0x2680000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xc1f6d980 | 0x1810000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xc377d980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xc3781980 | 0x1810000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xc4f91980 | 0x1b8000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xc5149980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xc514d980 | 0xb00000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xc5c4d980 | 0x6e0000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xc632d980 | 0x240000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xc656d980 | 0x2680000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xc8bed980 | 0x1810000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xca3fd980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xca401980 | 0x1810000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xcbc11980 | 0x1b8000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xcbdc9980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xcbdcd980 | 0xb00000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xcc8cd980 | 0x6e0000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xccfad980 | 0x240000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xcd1ed980 | 0x2680000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xcf86d980 | 0x1810000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xd107d980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xd1081980 | 0x1810000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xd2891980 | 0x1b8000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xd2a49980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xd2a4d980 | 0xb00000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xd354d980 | 0x6e0000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xd3c2d980 | 0x240000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xd3e6d980 | 0x2680000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xd64ed980 | 0x1810000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xd7cfd980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xd7d01980 | 0x1810000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xd9511980 | 0x150000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xd9661980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xd9665980 | 0xb00000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xda165980 | 0x540000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xda6a5980 | 0x240000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xda8e5980 | 0x2680000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0xdcf65980 | 0x1810000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0xde775980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0xde779980 | 0x1810000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q3_K_M.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q3_K_M.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 12 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q3\_K\_M.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q3_k_mgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0xd746000 |
|
| 100 |
+
| 1 | output_norm.weight | 0xdebf880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0xdec3880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0xdec3980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x1b609980 | 0x150000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x1b759980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1b75d980 | 0x900000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x1c05d980 | 0x540000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x1c59d980 | 0x1b8000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x1c755980 | 0x1f80000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x1e6d5980 | 0x1260000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x1f935980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x1f939980 | 0x1260000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x20b99980 | 0x150000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x20ce9980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x20ced980 | 0x900000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x215ed980 | 0x540000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x21b2d980 | 0x1b8000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x21ce5980 | 0x1f80000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x23c65980 | 0x1260000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x24ec5980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x24ec9980 | 0x1260000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x26129980 | 0x150000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x26279980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x2627d980 | 0x900000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x26b7d980 | 0x540000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x270bd980 | 0x1b8000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x27275980 | 0x1f80000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x291f5980 | 0x1260000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x2a455980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x2a459980 | 0x1260000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x2b6b9980 | 0x150000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x2b809980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x2b80d980 | 0x900000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x2c10d980 | 0x540000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x2c64d980 | 0x1b8000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x2c805980 | 0x1f80000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x2e785980 | 0x1260000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x2f9e5980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x2f9e9980 | 0x1260000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x30c49980 | 0x150000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x30d99980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x30d9d980 | 0x900000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x3169d980 | 0x540000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x31bdd980 | 0x1b8000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x31d95980 | 0x1f80000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x33d15980 | 0x1260000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x34f75980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x34f79980 | 0x1260000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x361d9980 | 0x150000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x36329980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x3632d980 | 0x900000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x36c2d980 | 0x540000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x3716d980 | 0x1b8000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x37325980 | 0x1f80000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x392a5980 | 0x1260000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x3a505980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x3a509980 | 0x1260000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x3b769980 | 0x150000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x3b8b9980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x3b8bd980 | 0x900000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x3c1bd980 | 0x540000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x3c6fd980 | 0x1b8000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x3c8b5980 | 0x1f80000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x3e835980 | 0x1260000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x3fa95980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x3fa99980 | 0x1260000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x40cf9980 | 0x150000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x40e49980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x40e4d980 | 0x900000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x4174d980 | 0x540000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x41c8d980 | 0x1b8000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x41e45980 | 0x1f80000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x43dc5980 | 0x1260000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x45025980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x45029980 | 0x1260000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x46289980 | 0x150000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x463d9980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x463dd980 | 0x900000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x46cdd980 | 0x540000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x4721d980 | 0x1b8000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x473d5980 | 0x1f80000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x49355980 | 0x1260000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x4a5b5980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x4a5b9980 | 0x1260000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x4b819980 | 0x150000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x4b969980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x4b96d980 | 0x900000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x4c26d980 | 0x540000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x4c7ad980 | 0x1b8000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x4c965980 | 0x1f80000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x4e8e5980 | 0x1260000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x4fb45980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x4fb49980 | 0x1260000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x50da9980 | 0x150000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x50ef9980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x50efd980 | 0x900000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x517fd980 | 0x540000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x51d3d980 | 0x1b8000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x51ef5980 | 0x1f80000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x53e75980 | 0x1260000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x550d5980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x550d9980 | 0x1260000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x56339980 | 0x150000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x56489980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x5648d980 | 0x900000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x56d8d980 | 0x540000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x572cd980 | 0x1b8000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x57485980 | 0x1f80000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x59405980 | 0x1260000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x5a665980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x5a669980 | 0x1260000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x5b8c9980 | 0x150000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x5ba19980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x5ba1d980 | 0x900000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x5c31d980 | 0x540000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x5c85d980 | 0x1b8000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x5ca15980 | 0x1f80000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x5e995980 | 0x1260000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x5fbf5980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x5fbf9980 | 0x1260000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x60e59980 | 0x1b8000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x61011980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x61015980 | 0x900000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x61915980 | 0x6e0000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x61ff5980 | 0x240000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x62235980 | 0x1f80000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x641b5980 | 0x1260000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x65415980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x65419980 | 0x1260000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x66679980 | 0x1b8000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x66831980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x66835980 | 0x900000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x67135980 | 0x6e0000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x67815980 | 0x240000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x67a55980 | 0x1f80000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x699d5980 | 0x1260000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x6ac35980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x6ac39980 | 0x1260000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x6be99980 | 0x150000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x6bfe9980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x6bfed980 | 0x900000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x6c8ed980 | 0x540000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x6ce2d980 | 0x1b8000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x6cfe5980 | 0x1f80000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x6ef65980 | 0x1260000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x701c5980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x701c9980 | 0x1260000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x71429980 | 0x1b8000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x715e1980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x715e5980 | 0x900000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x71ee5980 | 0x6e0000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x725c5980 | 0x240000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x72805980 | 0x1f80000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x74785980 | 0x1810000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x75f95980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x75f99980 | 0x1810000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x777a9980 | 0x150000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x778f9980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x778fd980 | 0x900000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x781fd980 | 0x540000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x7873d980 | 0x1b8000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x788f5980 | 0x1f80000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x7a875980 | 0x1810000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x7c085980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x7c089980 | 0x1810000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x7d899980 | 0x1b8000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x7da51980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x7da55980 | 0x900000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x7e355980 | 0x6e0000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x7ea35980 | 0x240000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x7ec75980 | 0x1f80000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x80bf5980 | 0x1810000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x82405980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x82409980 | 0x1810000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x83c19980 | 0x1b8000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x83dd1980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x83dd5980 | 0x900000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x846d5980 | 0x6e0000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x84db5980 | 0x240000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x84ff5980 | 0x1f80000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x86f75980 | 0x1810000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0x88785980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0x88789980 | 0x1810000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0x89f99980 | 0x1b8000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0x8a151980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0x8a155980 | 0x900000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0x8aa55980 | 0x6e0000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0x8b135980 | 0x240000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0x8b375980 | 0x1f80000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x8d2f5980 | 0x1810000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x8eb05980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x8eb09980 | 0x1810000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x90319980 | 0x1b8000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x904d1980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x904d5980 | 0x900000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x90dd5980 | 0x6e0000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x914b5980 | 0x240000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x916f5980 | 0x1f80000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x93675980 | 0x1810000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x94e85980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x94e89980 | 0x1810000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x96699980 | 0x1b8000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x96851980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x96855980 | 0x900000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x97155980 | 0x6e0000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x97835980 | 0x240000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x97a75980 | 0x1f80000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0x999f5980 | 0x1810000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0x9b205980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0x9b209980 | 0x1810000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0x9ca19980 | 0x1b8000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0x9cbd1980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0x9cbd5980 | 0x900000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0x9d4d5980 | 0x6e0000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0x9dbb5980 | 0x240000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0x9ddf5980 | 0x1f80000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0x9fd75980 | 0x1810000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xa1585980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xa1589980 | 0x1810000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xa2d99980 | 0x1b8000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xa2f51980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xa2f55980 | 0x900000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xa3855980 | 0x6e0000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xa3f35980 | 0x240000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xa4175980 | 0x1f80000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xa60f5980 | 0x1810000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xa7905980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xa7909980 | 0x1810000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xa9119980 | 0x1b8000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xa92d1980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xa92d5980 | 0x900000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xa9bd5980 | 0x6e0000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xaa2b5980 | 0x240000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xaa4f5980 | 0x1f80000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xac475980 | 0x1810000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xadc85980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xadc89980 | 0x1810000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xaf499980 | 0x1b8000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xaf651980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xaf655980 | 0x900000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xaff55980 | 0x6e0000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xb0635980 | 0x240000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xb0875980 | 0x1f80000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xb27f5980 | 0x1810000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xb4005980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xb4009980 | 0x1810000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xb5819980 | 0x1b8000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xb59d1980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xb59d5980 | 0x900000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xb62d5980 | 0x6e0000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xb69b5980 | 0x240000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xb6bf5980 | 0x1f80000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xb8b75980 | 0x1810000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xba385980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xba389980 | 0x1810000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xbbb99980 | 0x1b8000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xbbd51980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xbbd55980 | 0x900000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xbc655980 | 0x6e0000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xbcd35980 | 0x240000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xbcf75980 | 0x1f80000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xbeef5980 | 0x1810000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xc0705980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xc0709980 | 0x1810000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xc1f19980 | 0x1b8000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xc20d1980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xc20d5980 | 0x900000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xc29d5980 | 0x6e0000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xc30b5980 | 0x240000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xc32f5980 | 0x1f80000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xc5275980 | 0x1810000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xc6a85980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xc6a89980 | 0x1810000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xc8299980 | 0x1b8000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xc8451980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xc8455980 | 0x900000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xc8d55980 | 0x6e0000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xc9435980 | 0x240000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xc9675980 | 0x1f80000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xcb5f5980 | 0x1810000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xcce05980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xcce09980 | 0x1810000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xce619980 | 0x150000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xce769980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xce76d980 | 0x900000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xcf06d980 | 0x540000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xcf5ad980 | 0x1b8000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xcf765980 | 0x1f80000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0xd16e5980 | 0x1810000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0xd2ef5980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0xd2ef9980 | 0x1810000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q3_K_S.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q3_K_S.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 11 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q3\_K\_S.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q3_k_sgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0xd746000 |
|
| 100 |
+
| 1 | output_norm.weight | 0xdebf880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0xdec3880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0xdec3980 | 0xa464000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x18327980 | 0x150000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x18477980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1847b980 | 0x6e0000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x18b5b980 | 0x540000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x1909b980 | 0x1b8000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x19253980 | 0x1810000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x1aa63980 | 0x1260000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x1bcc3980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x1bcc7980 | 0x1260000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x1cf27980 | 0x150000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x1d077980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x1d07b980 | 0x6e0000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x1d75b980 | 0x540000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x1dc9b980 | 0x1b8000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x1de53980 | 0x1f80000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x1fdd3980 | 0x1260000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x21033980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x21037980 | 0x1260000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x22297980 | 0x150000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x223e7980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x223eb980 | 0x6e0000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x22acb980 | 0x540000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x2300b980 | 0x1b8000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x231c3980 | 0x1810000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x249d3980 | 0x1260000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x25c33980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x25c37980 | 0x1260000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x26e97980 | 0x150000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x26fe7980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x26feb980 | 0x6e0000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x276cb980 | 0x540000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x27c0b980 | 0x1b8000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x27dc3980 | 0x1810000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x295d3980 | 0x1260000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x2a833980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x2a837980 | 0x1260000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x2ba97980 | 0x150000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x2bbe7980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x2bbeb980 | 0x6e0000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x2c2cb980 | 0x540000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x2c80b980 | 0x1b8000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x2c9c3980 | 0x1810000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x2e1d3980 | 0x1260000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x2f433980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x2f437980 | 0x1260000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x30697980 | 0x150000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x307e7980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x307eb980 | 0x6e0000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x30ecb980 | 0x540000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x3140b980 | 0x1b8000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x315c3980 | 0x1810000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x32dd3980 | 0x1260000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x34033980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x34037980 | 0x1260000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x35297980 | 0x150000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x353e7980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x353eb980 | 0x6e0000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x35acb980 | 0x540000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x3600b980 | 0x1b8000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x361c3980 | 0x1810000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x379d3980 | 0x1260000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x38c33980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x38c37980 | 0x1260000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x39e97980 | 0x150000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x39fe7980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x39feb980 | 0x6e0000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x3a6cb980 | 0x540000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x3ac0b980 | 0x1b8000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x3adc3980 | 0x1810000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x3c5d3980 | 0x1260000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x3d833980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x3d837980 | 0x1260000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x3ea97980 | 0x150000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x3ebe7980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x3ebeb980 | 0x6e0000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x3f2cb980 | 0x540000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x3f80b980 | 0x1b8000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x3f9c3980 | 0x1810000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x411d3980 | 0x1260000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x42433980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x42437980 | 0x1260000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x43697980 | 0x150000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x437e7980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x437eb980 | 0x6e0000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x43ecb980 | 0x540000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x4440b980 | 0x1b8000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x445c3980 | 0x1810000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x45dd3980 | 0x1260000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x47033980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x47037980 | 0x1260000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x48297980 | 0x150000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x483e7980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x483eb980 | 0x6e0000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x48acb980 | 0x540000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x4900b980 | 0x1b8000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x491c3980 | 0x1810000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x4a9d3980 | 0x1260000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x4bc33980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x4bc37980 | 0x1260000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x4ce97980 | 0x150000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x4cfe7980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x4cfeb980 | 0x6e0000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x4d6cb980 | 0x540000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x4dc0b980 | 0x1b8000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x4ddc3980 | 0x1810000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x4f5d3980 | 0x1260000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x50833980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x50837980 | 0x1260000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x51a97980 | 0x150000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x51be7980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x51beb980 | 0x6e0000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x522cb980 | 0x540000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x5280b980 | 0x1b8000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x529c3980 | 0x1810000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x541d3980 | 0x1260000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x55433980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x55437980 | 0x1260000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x56697980 | 0x1b8000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x5684f980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x56853980 | 0x6e0000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x56f33980 | 0x6e0000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x57613980 | 0x1b8000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x577cb980 | 0x1810000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x58fdb980 | 0x1260000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x5a23b980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x5a23f980 | 0x1260000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x5b49f980 | 0x1b8000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x5b657980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x5b65b980 | 0x6e0000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x5bd3b980 | 0x6e0000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x5c41b980 | 0x1b8000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x5c5d3980 | 0x1810000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x5dde3980 | 0x1260000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x5f043980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x5f047980 | 0x1260000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x602a7980 | 0x150000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x603f7980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x603fb980 | 0x6e0000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x60adb980 | 0x540000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x6101b980 | 0x1b8000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x611d3980 | 0x1810000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x629e3980 | 0x1260000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x63c43980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x63c47980 | 0x1260000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x64ea7980 | 0x1b8000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x6505f980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x65063980 | 0x6e0000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x65743980 | 0x6e0000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x65e23980 | 0x1b8000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x65fdb980 | 0x1810000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x677eb980 | 0x1810000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x68ffb980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x68fff980 | 0x1810000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x6a80f980 | 0x150000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x6a95f980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x6a963980 | 0x6e0000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x6b043980 | 0x540000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x6b583980 | 0x1b8000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x6b73b980 | 0x1f80000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x6d6bb980 | 0x1810000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x6eecb980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x6eecf980 | 0x1810000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x706df980 | 0x1b8000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x70897980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x7089b980 | 0x6e0000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x70f7b980 | 0x6e0000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x7165b980 | 0x1b8000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x71813980 | 0x1f80000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x73793980 | 0x1810000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x74fa3980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x74fa7980 | 0x1810000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x767b7980 | 0x1b8000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x7696f980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x76973980 | 0x6e0000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x77053980 | 0x6e0000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x77733980 | 0x1b8000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x778eb980 | 0x1f80000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x7986b980 | 0x1810000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0x7b07b980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0x7b07f980 | 0x1810000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0x7c88f980 | 0x1b8000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0x7ca47980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0x7ca4b980 | 0x6e0000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0x7d12b980 | 0x6e0000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0x7d80b980 | 0x1b8000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0x7d9c3980 | 0x1f80000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x7f943980 | 0x1810000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x81153980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x81157980 | 0x1810000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x82967980 | 0x1b8000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x82b1f980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x82b23980 | 0x6e0000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x83203980 | 0x6e0000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x838e3980 | 0x1b8000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x83a9b980 | 0x1f80000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x85a1b980 | 0x1810000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x8722b980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x8722f980 | 0x1810000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x88a3f980 | 0x1b8000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x88bf7980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x88bfb980 | 0x6e0000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x892db980 | 0x6e0000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x899bb980 | 0x1b8000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x89b73980 | 0x1f80000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0x8baf3980 | 0x1810000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0x8d303980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0x8d307980 | 0x1810000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0x8eb17980 | 0x1b8000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0x8eccf980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0x8ecd3980 | 0x6e0000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0x8f3b3980 | 0x6e0000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0x8fa93980 | 0x1b8000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0x8fc4b980 | 0x1f80000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0x91bcb980 | 0x1810000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0x933db980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0x933df980 | 0x1810000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0x94bef980 | 0x1b8000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0x94da7980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0x94dab980 | 0x6e0000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0x9548b980 | 0x6e0000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0x95b6b980 | 0x1b8000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0x95d23980 | 0x1f80000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0x97ca3980 | 0x1810000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0x994b3980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0x994b7980 | 0x1810000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0x9acc7980 | 0x1b8000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0x9ae7f980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0x9ae83980 | 0x6e0000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0x9b563980 | 0x6e0000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0x9bc43980 | 0x1b8000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0x9bdfb980 | 0x1f80000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0x9dd7b980 | 0x1810000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0x9f58b980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0x9f58f980 | 0x1810000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xa0d9f980 | 0x1b8000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xa0f57980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xa0f5b980 | 0x6e0000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xa163b980 | 0x6e0000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xa1d1b980 | 0x1b8000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xa1ed3980 | 0x1f80000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xa3e53980 | 0x1810000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xa5663980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xa5667980 | 0x1810000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xa6e77980 | 0x1b8000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xa702f980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xa7033980 | 0x6e0000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xa7713980 | 0x6e0000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xa7df3980 | 0x1b8000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xa7fab980 | 0x1f80000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xa9f2b980 | 0x1810000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xab73b980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xab73f980 | 0x1810000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xacf4f980 | 0x1b8000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xad107980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xad10b980 | 0x6e0000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xad7eb980 | 0x6e0000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xadecb980 | 0x1b8000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xae083980 | 0x1f80000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xb0003980 | 0x1810000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xb1813980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xb1817980 | 0x1810000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xb3027980 | 0x1b8000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xb31df980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xb31e3980 | 0x6e0000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xb38c3980 | 0x6e0000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xb3fa3980 | 0x1b8000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xb415b980 | 0x1f80000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xb60db980 | 0x1810000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xb78eb980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xb78ef980 | 0x1810000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xb90ff980 | 0x1b8000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xb92b7980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xb92bb980 | 0x6e0000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xb999b980 | 0x6e0000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xba07b980 | 0x1b8000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xba233980 | 0x1f80000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xbc1b3980 | 0x1810000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xbd9c3980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xbd9c7980 | 0x1810000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xbf1d7980 | 0x150000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xbf327980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xbf32b980 | 0x6e0000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xbfa0b980 | 0x540000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xbff4b980 | 0x1b8000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xc0103980 | 0x1f80000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0xc2083980 | 0x1810000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0xc3893980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0xc3897980 | 0x1810000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q2_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q2_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q3_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q2_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q2_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q4_K_M.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q4_K_M.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 15 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q4\_K\_M.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q4_k_mgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x119d0000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x12149880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x1214d880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x1214d980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x1f893980 | 0x1b8000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x1fa4b980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1fa4f980 | 0x900000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x2034f980 | 0x6e0000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x20a2f980 | 0x240000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x20c6f980 | 0x2680000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x232ef980 | 0x1810000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x24aff980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x24b03980 | 0x1810000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x26313980 | 0x1b8000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x264cb980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x264cf980 | 0x900000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x26dcf980 | 0x6e0000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x274af980 | 0x240000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x276ef980 | 0x2680000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x29d6f980 | 0x1810000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x2b57f980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x2b583980 | 0x1810000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x2cd93980 | 0x1b8000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x2cf4b980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x2cf4f980 | 0x900000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x2d84f980 | 0x6e0000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x2df2f980 | 0x240000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x2e16f980 | 0x2680000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x307ef980 | 0x1810000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x31fff980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x32003980 | 0x1810000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x33813980 | 0x1b8000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x339cb980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x339cf980 | 0x900000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x342cf980 | 0x6e0000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x349af980 | 0x240000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x34bef980 | 0x2680000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x3726f980 | 0x1810000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x38a7f980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x38a83980 | 0x1810000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x3a293980 | 0x1b8000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x3a44b980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x3a44f980 | 0x900000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x3ad4f980 | 0x6e0000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x3b42f980 | 0x240000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x3b66f980 | 0x2680000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x3dcef980 | 0x1810000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x3f4ff980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x3f503980 | 0x1810000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x40d13980 | 0x1b8000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x40ecb980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x40ecf980 | 0x900000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x417cf980 | 0x6e0000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x41eaf980 | 0x240000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x420ef980 | 0x2680000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x4476f980 | 0x1810000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x45f7f980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x45f83980 | 0x1810000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x47793980 | 0x1b8000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x4794b980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x4794f980 | 0x900000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x4824f980 | 0x6e0000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x4892f980 | 0x240000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x48b6f980 | 0x2680000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x4b1ef980 | 0x1810000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x4c9ff980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x4ca03980 | 0x1810000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x4e213980 | 0x1b8000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x4e3cb980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x4e3cf980 | 0x900000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x4eccf980 | 0x6e0000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x4f3af980 | 0x240000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x4f5ef980 | 0x2680000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x51c6f980 | 0x1810000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x5347f980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x53483980 | 0x1810000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x54c93980 | 0x1b8000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x54e4b980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x54e4f980 | 0x900000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x5574f980 | 0x6e0000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x55e2f980 | 0x240000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x5606f980 | 0x2680000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x586ef980 | 0x1810000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x59eff980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x59f03980 | 0x1810000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x5b713980 | 0x1b8000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x5b8cb980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x5b8cf980 | 0x900000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x5c1cf980 | 0x6e0000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x5c8af980 | 0x240000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x5caef980 | 0x2680000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x5f16f980 | 0x1810000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x6097f980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x60983980 | 0x1810000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x62193980 | 0x1b8000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x6234b980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x6234f980 | 0x900000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x62c4f980 | 0x6e0000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x6332f980 | 0x240000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x6356f980 | 0x2680000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x65bef980 | 0x1810000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x673ff980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x67403980 | 0x1810000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x68c13980 | 0x1b8000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x68dcb980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x68dcf980 | 0x900000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x696cf980 | 0x6e0000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x69daf980 | 0x240000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x69fef980 | 0x2680000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x6c66f980 | 0x1810000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x6de7f980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x6de83980 | 0x1810000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x6f693980 | 0x1b8000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x6f84b980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x6f84f980 | 0x900000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x7014f980 | 0x6e0000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x7082f980 | 0x240000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x70a6f980 | 0x2680000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x730ef980 | 0x1810000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x748ff980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x74903980 | 0x1810000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x76113980 | 0x240000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x76353980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x76357980 | 0x900000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x76c57980 | 0x900000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x77557980 | 0x2c0000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x77817980 | 0x2680000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x79e97980 | 0x1810000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x7b6a7980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x7b6ab980 | 0x1810000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x7cebb980 | 0x240000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x7d0fb980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x7d0ff980 | 0x900000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x7d9ff980 | 0x900000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x7e2ff980 | 0x2c0000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x7e5bf980 | 0x2680000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x80c3f980 | 0x1810000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x8244f980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x82453980 | 0x1810000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x83c63980 | 0x1b8000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x83e1b980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x83e1f980 | 0x900000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x8471f980 | 0x6e0000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x84dff980 | 0x240000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x8503f980 | 0x2680000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x876bf980 | 0x1810000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x88ecf980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x88ed3980 | 0x1810000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x8a6e3980 | 0x240000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x8a923980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x8a927980 | 0x900000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x8b227980 | 0x900000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x8bb27980 | 0x2c0000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x8bde7980 | 0x2680000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x8e467980 | 0x1f80000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x903e7980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x903eb980 | 0x1f80000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x9236b980 | 0x1b8000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x92523980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x92527980 | 0x900000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x92e27980 | 0x6e0000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x93507980 | 0x240000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x93747980 | 0x2680000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x95dc7980 | 0x1f80000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x97d47980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x97d4b980 | 0x1f80000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x99ccb980 | 0x240000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x99f0b980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x99f0f980 | 0x900000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x9a80f980 | 0x900000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x9b10f980 | 0x2c0000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x9b3cf980 | 0x2680000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x9da4f980 | 0x1f80000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x9f9cf980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x9f9d3980 | 0x1f80000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0xa1953980 | 0x240000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0xa1b93980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0xa1b97980 | 0x900000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0xa2497980 | 0x900000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0xa2d97980 | 0x2c0000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0xa3057980 | 0x2680000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0xa56d7980 | 0x1f80000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0xa7657980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0xa765b980 | 0x1f80000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0xa95db980 | 0x240000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0xa981b980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0xa981f980 | 0x900000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0xaa11f980 | 0x900000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0xaaa1f980 | 0x2c0000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0xaacdf980 | 0x2680000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0xad35f980 | 0x1f80000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0xaf2df980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0xaf2e3980 | 0x1f80000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0xb1263980 | 0x240000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0xb14a3980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0xb14a7980 | 0x900000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0xb1da7980 | 0x900000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0xb26a7980 | 0x2c0000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0xb2967980 | 0x2680000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0xb4fe7980 | 0x1f80000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0xb6f67980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0xb6f6b980 | 0x1f80000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0xb8eeb980 | 0x240000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0xb912b980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0xb912f980 | 0x900000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0xb9a2f980 | 0x900000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0xba32f980 | 0x2c0000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0xba5ef980 | 0x2680000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xbcc6f980 | 0x1f80000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xbebef980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xbebf3980 | 0x1f80000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xc0b73980 | 0x240000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xc0db3980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xc0db7980 | 0x900000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xc16b7980 | 0x900000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xc1fb7980 | 0x2c0000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xc2277980 | 0x2680000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xc48f7980 | 0x1f80000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xc6877980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xc687b980 | 0x1f80000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xc87fb980 | 0x240000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xc8a3b980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xc8a3f980 | 0x900000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xc933f980 | 0x900000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xc9c3f980 | 0x2c0000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xc9eff980 | 0x2680000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xcc57f980 | 0x1f80000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xce4ff980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xce503980 | 0x1f80000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xd0483980 | 0x240000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xd06c3980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xd06c7980 | 0x900000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xd0fc7980 | 0x900000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xd18c7980 | 0x2c0000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xd1b87980 | 0x2680000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xd4207980 | 0x1f80000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xd6187980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xd618b980 | 0x1f80000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xd810b980 | 0x240000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xd834b980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xd834f980 | 0x900000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xd8c4f980 | 0x900000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xd954f980 | 0x2c0000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xd980f980 | 0x2680000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xdbe8f980 | 0x1f80000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xdde0f980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xdde13980 | 0x1f80000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xdfd93980 | 0x240000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xdffd3980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xdffd7980 | 0x900000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xe08d7980 | 0x900000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xe11d7980 | 0x2c0000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xe1497980 | 0x2680000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xe3b17980 | 0x1f80000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xe5a97980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xe5a9b980 | 0x1f80000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xe7a1b980 | 0x240000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xe7c5b980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xe7c5f980 | 0x900000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xe855f980 | 0x900000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xe8e5f980 | 0x2c0000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xe911f980 | 0x2680000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xeb79f980 | 0x1f80000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xed71f980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xed723980 | 0x1f80000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xef6a3980 | 0x240000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xef8e3980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xef8e7980 | 0x900000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xf01e7980 | 0x900000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xf0ae7980 | 0x2c0000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xf0da7980 | 0x2680000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xf3427980 | 0x1f80000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xf53a7980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xf53ab980 | 0x1f80000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xf732b980 | 0x240000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xf756b980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xf756f980 | 0x900000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xf7e6f980 | 0x900000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xf876f980 | 0x2c0000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xf8a2f980 | 0x2680000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xfb0af980 | 0x1f80000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xfd02f980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xfd033980 | 0x1f80000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xfefb3980 | 0x1b8000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xff16b980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xff16f980 | 0x900000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xffa6f980 | 0x6e0000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0x10014f980 | 0x240000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0x10038f980 | 0x2680000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0x102a0f980 | 0x1f80000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0x10498f980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0x104993980 | 0x1f80000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q4_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q4_K_S.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q4_K_S.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 14 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q4\_K\_S.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q4_k_sgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x119d0000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x12149880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x1214d880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x1214d980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x1f893980 | 0x1b8000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x1fa4b980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x1fa4f980 | 0x900000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x2034f980 | 0x6e0000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x20a2f980 | 0x240000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x20c6f980 | 0x1f80000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x22bef980 | 0x1810000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x243ff980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x24403980 | 0x1810000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x25c13980 | 0x1b8000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x25dcb980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x25dcf980 | 0x900000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x266cf980 | 0x6e0000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x26daf980 | 0x240000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x26fef980 | 0x2680000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x2966f980 | 0x1810000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x2ae7f980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x2ae83980 | 0x1810000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x2c693980 | 0x1b8000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x2c84b980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x2c84f980 | 0x900000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x2d14f980 | 0x6e0000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x2d82f980 | 0x240000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x2da6f980 | 0x1f80000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x2f9ef980 | 0x1810000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x311ff980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x31203980 | 0x1810000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x32a13980 | 0x1b8000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x32bcb980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x32bcf980 | 0x900000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x334cf980 | 0x6e0000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x33baf980 | 0x240000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x33def980 | 0x1f80000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x35d6f980 | 0x1810000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x3757f980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x37583980 | 0x1810000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x38d93980 | 0x1b8000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x38f4b980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x38f4f980 | 0x900000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x3984f980 | 0x6e0000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x39f2f980 | 0x240000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x3a16f980 | 0x1f80000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x3c0ef980 | 0x1810000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x3d8ff980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x3d903980 | 0x1810000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x3f113980 | 0x1b8000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x3f2cb980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x3f2cf980 | 0x900000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x3fbcf980 | 0x6e0000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x402af980 | 0x240000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x404ef980 | 0x1f80000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x4246f980 | 0x1810000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x43c7f980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x43c83980 | 0x1810000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x45493980 | 0x1b8000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x4564b980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x4564f980 | 0x900000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x45f4f980 | 0x6e0000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x4662f980 | 0x240000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x4686f980 | 0x1f80000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x487ef980 | 0x1810000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x49fff980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x4a003980 | 0x1810000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x4b813980 | 0x1b8000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x4b9cb980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x4b9cf980 | 0x900000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x4c2cf980 | 0x6e0000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x4c9af980 | 0x240000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x4cbef980 | 0x1f80000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x4eb6f980 | 0x1810000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x5037f980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x50383980 | 0x1810000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x51b93980 | 0x1b8000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x51d4b980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x51d4f980 | 0x900000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x5264f980 | 0x6e0000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x52d2f980 | 0x240000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x52f6f980 | 0x1f80000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x54eef980 | 0x1810000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x566ff980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x56703980 | 0x1810000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x57f13980 | 0x1b8000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x580cb980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x580cf980 | 0x900000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x589cf980 | 0x6e0000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x590af980 | 0x240000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x592ef980 | 0x1f80000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x5b26f980 | 0x1810000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x5ca7f980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x5ca83980 | 0x1810000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x5e293980 | 0x1b8000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x5e44b980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x5e44f980 | 0x900000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x5ed4f980 | 0x6e0000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x5f42f980 | 0x240000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x5f66f980 | 0x1f80000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x615ef980 | 0x1810000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x62dff980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x62e03980 | 0x1810000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x64613980 | 0x1b8000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x647cb980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x647cf980 | 0x900000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x650cf980 | 0x6e0000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x657af980 | 0x240000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x659ef980 | 0x1f80000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x6796f980 | 0x1810000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x6917f980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x69183980 | 0x1810000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x6a993980 | 0x1b8000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x6ab4b980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x6ab4f980 | 0x900000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x6b44f980 | 0x6e0000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x6bb2f980 | 0x240000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x6bd6f980 | 0x1f80000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x6dcef980 | 0x1810000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x6f4ff980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x6f503980 | 0x1810000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x70d13980 | 0x240000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x70f53980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x70f57980 | 0x900000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x71857980 | 0x900000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x72157980 | 0x240000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x72397980 | 0x1f80000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x74317980 | 0x1810000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x75b27980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x75b2b980 | 0x1810000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x7733b980 | 0x240000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x7757b980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x7757f980 | 0x900000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x77e7f980 | 0x900000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x7877f980 | 0x240000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x789bf980 | 0x1f80000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x7a93f980 | 0x1810000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x7c14f980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x7c153980 | 0x1810000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x7d963980 | 0x1b8000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x7db1b980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x7db1f980 | 0x900000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x7e41f980 | 0x6e0000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x7eaff980 | 0x240000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x7ed3f980 | 0x1f80000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x80cbf980 | 0x1810000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0x824cf980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0x824d3980 | 0x1810000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0x83ce3980 | 0x240000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0x83f23980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0x83f27980 | 0x900000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0x84827980 | 0x900000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0x85127980 | 0x240000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0x85367980 | 0x1f80000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0x872e7980 | 0x1f80000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0x89267980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0x8926b980 | 0x1f80000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0x8b1eb980 | 0x1b8000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0x8b3a3980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0x8b3a7980 | 0x900000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0x8bca7980 | 0x6e0000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0x8c387980 | 0x240000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0x8c5c7980 | 0x2680000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x8ec47980 | 0x1f80000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x90bc7980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x90bcb980 | 0x1f80000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x92b4b980 | 0x240000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x92d8b980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x92d8f980 | 0x900000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x9368f980 | 0x900000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x93f8f980 | 0x240000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x941cf980 | 0x2680000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x9684f980 | 0x1f80000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x987cf980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x987d3980 | 0x1f80000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x9a753980 | 0x240000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x9a993980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x9a997980 | 0x900000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x9b297980 | 0x900000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x9bb97980 | 0x240000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x9bdd7980 | 0x2680000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x9e457980 | 0x1f80000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0xa03d7980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0xa03db980 | 0x1f80000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0xa235b980 | 0x240000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0xa259b980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0xa259f980 | 0x900000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0xa2e9f980 | 0x900000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0xa379f980 | 0x240000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0xa39df980 | 0x2680000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0xa605f980 | 0x1f80000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0xa7fdf980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0xa7fe3980 | 0x1f80000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0xa9f63980 | 0x240000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0xaa1a3980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0xaa1a7980 | 0x900000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0xaaaa7980 | 0x900000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0xab3a7980 | 0x240000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0xab5e7980 | 0x2680000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0xadc67980 | 0x1f80000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0xafbe7980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0xafbeb980 | 0x1f80000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0xb1b6b980 | 0x240000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0xb1dab980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0xb1daf980 | 0x900000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0xb26af980 | 0x900000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0xb2faf980 | 0x240000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0xb31ef980 | 0x2680000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xb586f980 | 0x1f80000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xb77ef980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xb77f3980 | 0x1f80000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xb9773980 | 0x240000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xb99b3980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xb99b7980 | 0x900000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xba2b7980 | 0x900000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xbabb7980 | 0x240000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xbadf7980 | 0x2680000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xbd477980 | 0x1f80000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xbf3f7980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xbf3fb980 | 0x1f80000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xc137b980 | 0x240000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xc15bb980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xc15bf980 | 0x900000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xc1ebf980 | 0x900000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xc27bf980 | 0x240000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xc29ff980 | 0x2680000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xc507f980 | 0x1f80000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xc6fff980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xc7003980 | 0x1f80000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xc8f83980 | 0x240000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xc91c3980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xc91c7980 | 0x900000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xc9ac7980 | 0x900000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xca3c7980 | 0x240000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xca607980 | 0x2680000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xccc87980 | 0x1f80000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xcec07980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xcec0b980 | 0x1f80000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0xd0b8b980 | 0x240000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0xd0dcb980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0xd0dcf980 | 0x900000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0xd16cf980 | 0x900000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0xd1fcf980 | 0x240000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0xd220f980 | 0x2680000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0xd488f980 | 0x1f80000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0xd680f980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0xd6813980 | 0x1f80000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0xd8793980 | 0x240000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0xd89d3980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0xd89d7980 | 0x900000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0xd92d7980 | 0x900000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0xd9bd7980 | 0x240000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0xd9e17980 | 0x2680000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0xdc497980 | 0x1f80000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0xde417980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0xde41b980 | 0x1f80000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0xe039b980 | 0x240000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0xe05db980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0xe05df980 | 0x900000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0xe0edf980 | 0x900000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0xe17df980 | 0x240000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0xe1a1f980 | 0x2680000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0xe409f980 | 0x1f80000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0xe601f980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0xe6023980 | 0x1f80000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0xe7fa3980 | 0x240000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0xe81e3980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0xe81e7980 | 0x900000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0xe8ae7980 | 0x900000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0xe93e7980 | 0x240000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0xe9627980 | 0x2680000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0xebca7980 | 0x1f80000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0xedc27980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0xedc2b980 | 0x1f80000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0xefbab980 | 0x240000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0xefdeb980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0xefdef980 | 0x900000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0xf06ef980 | 0x900000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0xf0fef980 | 0x240000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0xf122f980 | 0x2680000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0xf38af980 | 0x1f80000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0xf582f980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0xf5833980 | 0x1f80000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0xf77b3980 | 0x1b8000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0xf796b980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0xf796f980 | 0x900000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0xf826f980 | 0x6e0000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0xf894f980 | 0x240000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0xf8b8f980 | 0x2680000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0xfb20f980 | 0x1f80000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0xfd18f980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0xfd193980 | 0x1f80000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q4_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q3_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q4_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q3_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q3_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q5_K_M.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q5_K_M.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 17 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q5\_K\_M.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q5_k_mgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x15870000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x15fe9880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x15fed880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x15fed980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x23733980 | 0x240000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x23973980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x23977980 | 0xb00000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x24477980 | 0x900000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x24d77980 | 0x2c0000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x25037980 | 0x2df0000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x27e27980 | 0x1f80000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x29da7980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x29dab980 | 0x1f80000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x2bd2b980 | 0x240000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x2bf6b980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x2bf6f980 | 0xb00000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x2ca6f980 | 0x900000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x2d36f980 | 0x2c0000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x2d62f980 | 0x2df0000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x3041f980 | 0x1f80000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x3239f980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x323a3980 | 0x1f80000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x34323980 | 0x240000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x34563980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x34567980 | 0xb00000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x35067980 | 0x900000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x35967980 | 0x2c0000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x35c27980 | 0x2df0000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x38a17980 | 0x1f80000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x3a997980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x3a99b980 | 0x1f80000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x3c91b980 | 0x240000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x3cb5b980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x3cb5f980 | 0xb00000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x3d65f980 | 0x900000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x3df5f980 | 0x2c0000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x3e21f980 | 0x2df0000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x4100f980 | 0x1f80000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x42f8f980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x42f93980 | 0x1f80000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x44f13980 | 0x240000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x45153980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x45157980 | 0xb00000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x45c57980 | 0x900000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x46557980 | 0x2c0000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x46817980 | 0x2df0000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x49607980 | 0x1f80000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x4b587980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x4b58b980 | 0x1f80000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x4d50b980 | 0x240000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x4d74b980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x4d74f980 | 0xb00000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x4e24f980 | 0x900000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x4eb4f980 | 0x2c0000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x4ee0f980 | 0x2df0000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x51bff980 | 0x1f80000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x53b7f980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x53b83980 | 0x1f80000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x55b03980 | 0x240000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x55d43980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x55d47980 | 0xb00000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x56847980 | 0x900000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x57147980 | 0x2c0000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x57407980 | 0x2df0000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x5a1f7980 | 0x1f80000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x5c177980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x5c17b980 | 0x1f80000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x5e0fb980 | 0x240000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x5e33b980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x5e33f980 | 0xb00000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x5ee3f980 | 0x900000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x5f73f980 | 0x2c0000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x5f9ff980 | 0x2df0000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x627ef980 | 0x1f80000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x6476f980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x64773980 | 0x1f80000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x666f3980 | 0x240000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x66933980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x66937980 | 0xb00000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x67437980 | 0x900000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x67d37980 | 0x2c0000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x67ff7980 | 0x2df0000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x6ade7980 | 0x1f80000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x6cd67980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x6cd6b980 | 0x1f80000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x6eceb980 | 0x240000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x6ef2b980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x6ef2f980 | 0xb00000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x6fa2f980 | 0x900000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x7032f980 | 0x2c0000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x705ef980 | 0x2df0000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x733df980 | 0x1f80000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x7535f980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x75363980 | 0x1f80000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x772e3980 | 0x240000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x77523980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x77527980 | 0xb00000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x78027980 | 0x900000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x78927980 | 0x2c0000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x78be7980 | 0x2df0000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x7b9d7980 | 0x1f80000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x7d957980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x7d95b980 | 0x1f80000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x7f8db980 | 0x240000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x7fb1b980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x7fb1f980 | 0xb00000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x8061f980 | 0x900000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x80f1f980 | 0x2c0000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x811df980 | 0x2df0000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x83fcf980 | 0x1f80000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x85f4f980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x85f53980 | 0x1f80000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x87ed3980 | 0x240000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x88113980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x88117980 | 0xb00000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x88c17980 | 0x900000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x89517980 | 0x2c0000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x897d7980 | 0x2df0000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x8c5c7980 | 0x1f80000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x8e547980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x8e54b980 | 0x1f80000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x904cb980 | 0x2c0000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x9078b980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x9078f980 | 0xb00000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x9128f980 | 0xb00000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x91d8f980 | 0x348000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x920d7980 | 0x2df0000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x94ec7980 | 0x1f80000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x96e47980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x96e4b980 | 0x1f80000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x98dcb980 | 0x2c0000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x9908b980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x9908f980 | 0xb00000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x99b8f980 | 0xb00000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x9a68f980 | 0x348000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x9a9d7980 | 0x2df0000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x9d7c7980 | 0x1f80000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x9f747980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x9f74b980 | 0x1f80000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0xa16cb980 | 0x240000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0xa190b980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0xa190f980 | 0xb00000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0xa240f980 | 0x900000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0xa2d0f980 | 0x2c0000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0xa2fcf980 | 0x2df0000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0xa5dbf980 | 0x1f80000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0xa7d3f980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0xa7d43980 | 0x1f80000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0xa9cc3980 | 0x2c0000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0xa9f83980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0xa9f87980 | 0xb00000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0xaaa87980 | 0xb00000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0xab587980 | 0x348000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0xab8cf980 | 0x2df0000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0xae6bf980 | 0x2680000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0xb0d3f980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0xb0d43980 | 0x2680000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0xb33c3980 | 0x240000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0xb3603980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0xb3607980 | 0xb00000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0xb4107980 | 0x900000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0xb4a07980 | 0x2c0000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0xb4cc7980 | 0x2df0000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0xb7ab7980 | 0x2680000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0xba137980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0xba13b980 | 0x2680000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0xbc7bb980 | 0x2c0000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0xbca7b980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0xbca7f980 | 0xb00000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0xbd57f980 | 0xb00000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0xbe07f980 | 0x348000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0xbe3c7980 | 0x2df0000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0xc11b7980 | 0x2680000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0xc3837980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0xc383b980 | 0x2680000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0xc5ebb980 | 0x2c0000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0xc617b980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0xc617f980 | 0xb00000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0xc6c7f980 | 0xb00000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0xc777f980 | 0x348000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0xc7ac7980 | 0x2df0000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0xca8b7980 | 0x2680000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0xccf37980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0xccf3b980 | 0x2680000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0xcf5bb980 | 0x2c0000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0xcf87b980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0xcf87f980 | 0xb00000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0xd037f980 | 0xb00000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0xd0e7f980 | 0x348000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0xd11c7980 | 0x2df0000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0xd3fb7980 | 0x2680000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0xd6637980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0xd663b980 | 0x2680000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0xd8cbb980 | 0x2c0000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0xd8f7b980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0xd8f7f980 | 0xb00000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0xd9a7f980 | 0xb00000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0xda57f980 | 0x348000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0xda8c7980 | 0x2df0000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0xdd6b7980 | 0x2680000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0xdfd37980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0xdfd3b980 | 0x2680000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0xe23bb980 | 0x2c0000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0xe267b980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0xe267f980 | 0xb00000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0xe317f980 | 0xb00000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0xe3c7f980 | 0x348000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0xe3fc7980 | 0x2df0000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xe6db7980 | 0x2680000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xe9437980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xe943b980 | 0x2680000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xebabb980 | 0x2c0000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xebd7b980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xebd7f980 | 0xb00000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xec87f980 | 0xb00000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xed37f980 | 0x348000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xed6c7980 | 0x2df0000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xf04b7980 | 0x2680000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xf2b37980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xf2b3b980 | 0x2680000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xf51bb980 | 0x2c0000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xf547b980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xf547f980 | 0xb00000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xf5f7f980 | 0xb00000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xf6a7f980 | 0x348000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xf6dc7980 | 0x2df0000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xf9bb7980 | 0x2680000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xfc237980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xfc23b980 | 0x2680000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xfe8bb980 | 0x2c0000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xfeb7b980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xfeb7f980 | 0xb00000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xff67f980 | 0xb00000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0x10017f980 | 0x348000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0x1004c7980 | 0x2df0000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0x1032b7980 | 0x2680000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0x105937980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0x10593b980 | 0x2680000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0x107fbb980 | 0x2c0000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0x10827b980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0x10827f980 | 0xb00000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0x108d7f980 | 0xb00000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0x10987f980 | 0x348000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0x109bc7980 | 0x2df0000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0x10c9b7980 | 0x2680000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0x10f037980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0x10f03b980 | 0x2680000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0x1116bb980 | 0x2c0000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0x11197b980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0x11197f980 | 0xb00000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0x11247f980 | 0xb00000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0x112f7f980 | 0x348000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0x1132c7980 | 0x2df0000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0x1160b7980 | 0x2680000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0x118737980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0x11873b980 | 0x2680000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0x11adbb980 | 0x2c0000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0x11b07b980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0x11b07f980 | 0xb00000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0x11bb7f980 | 0xb00000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0x11c67f980 | 0x348000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0x11c9c7980 | 0x2df0000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0x11f7b7980 | 0x2680000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0x121e37980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0x121e3b980 | 0x2680000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0x1244bb980 | 0x2c0000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0x12477b980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0x12477f980 | 0xb00000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0x12527f980 | 0xb00000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0x125d7f980 | 0x348000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0x1260c7980 | 0x2df0000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0x128eb7980 | 0x2680000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0x12b537980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0x12b53b980 | 0x2680000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0x12dbbb980 | 0x2c0000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0x12de7b980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0x12de7f980 | 0xb00000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0x12e97f980 | 0xb00000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0x12f47f980 | 0x348000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0x12f7c7980 | 0x2df0000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0x1325b7980 | 0x2680000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0x134c37980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0x134c3b980 | 0x2680000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0x1372bb980 | 0x240000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0x1374fb980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0x1374ff980 | 0xb00000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0x137fff980 | 0x900000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0x1388ff980 | 0x2c0000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0x138bbf980 | 0x2df0000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0x13b9af980 | 0x2680000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0x13e02f980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0x13e033980 | 0x2680000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q5_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q5_K_S.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q5_K_S.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 16 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q5\_K\_S.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q5_k_sgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x15870000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x15fe9880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x15fed880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x15fed980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x23733980 | 0x240000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x23973980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x23977980 | 0xb00000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x24477980 | 0x900000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x24d77980 | 0x2c0000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x25037980 | 0x2680000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x276b7980 | 0x1f80000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x29637980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x2963b980 | 0x1f80000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x2b5bb980 | 0x240000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x2b7fb980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x2b7ff980 | 0xb00000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x2c2ff980 | 0x900000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x2cbff980 | 0x2c0000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x2cebf980 | 0x2df0000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x2fcaf980 | 0x1f80000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x31c2f980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x31c33980 | 0x1f80000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x33bb3980 | 0x240000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x33df3980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x33df7980 | 0xb00000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x348f7980 | 0x900000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x351f7980 | 0x2c0000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x354b7980 | 0x2680000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x37b37980 | 0x1f80000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x39ab7980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x39abb980 | 0x1f80000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x3ba3b980 | 0x240000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x3bc7b980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x3bc7f980 | 0xb00000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x3c77f980 | 0x900000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x3d07f980 | 0x2c0000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x3d33f980 | 0x2680000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x3f9bf980 | 0x1f80000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x4193f980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x41943980 | 0x1f80000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x438c3980 | 0x240000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x43b03980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x43b07980 | 0xb00000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x44607980 | 0x900000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x44f07980 | 0x2c0000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x451c7980 | 0x2680000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x47847980 | 0x1f80000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x497c7980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x497cb980 | 0x1f80000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x4b74b980 | 0x240000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x4b98b980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x4b98f980 | 0xb00000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x4c48f980 | 0x900000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x4cd8f980 | 0x2c0000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x4d04f980 | 0x2680000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x4f6cf980 | 0x1f80000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x5164f980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x51653980 | 0x1f80000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x535d3980 | 0x240000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x53813980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x53817980 | 0xb00000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x54317980 | 0x900000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x54c17980 | 0x2c0000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x54ed7980 | 0x2680000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x57557980 | 0x1f80000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x594d7980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x594db980 | 0x1f80000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x5b45b980 | 0x240000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x5b69b980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x5b69f980 | 0xb00000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x5c19f980 | 0x900000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x5ca9f980 | 0x2c0000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x5cd5f980 | 0x2680000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x5f3df980 | 0x1f80000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x6135f980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x61363980 | 0x1f80000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x632e3980 | 0x240000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x63523980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x63527980 | 0xb00000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x64027980 | 0x900000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x64927980 | 0x2c0000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x64be7980 | 0x2680000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x67267980 | 0x1f80000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x691e7980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x691eb980 | 0x1f80000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x6b16b980 | 0x240000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x6b3ab980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x6b3af980 | 0xb00000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x6beaf980 | 0x900000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x6c7af980 | 0x2c0000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x6ca6f980 | 0x2680000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x6f0ef980 | 0x1f80000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x7106f980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x71073980 | 0x1f80000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x72ff3980 | 0x240000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x73233980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x73237980 | 0xb00000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x73d37980 | 0x900000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x74637980 | 0x2c0000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x748f7980 | 0x2680000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x76f77980 | 0x1f80000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x78ef7980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x78efb980 | 0x1f80000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x7ae7b980 | 0x240000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x7b0bb980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x7b0bf980 | 0xb00000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x7bbbf980 | 0x900000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x7c4bf980 | 0x2c0000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x7c77f980 | 0x2680000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x7edff980 | 0x1f80000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0x80d7f980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0x80d83980 | 0x1f80000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0x82d03980 | 0x240000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0x82f43980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0x82f47980 | 0xb00000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0x83a47980 | 0x900000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0x84347980 | 0x2c0000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0x84607980 | 0x2680000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0x86c87980 | 0x1f80000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0x88c07980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0x88c0b980 | 0x1f80000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0x8ab8b980 | 0x2c0000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0x8ae4b980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0x8ae4f980 | 0xb00000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0x8b94f980 | 0xb00000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0x8c44f980 | 0x2c0000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0x8c70f980 | 0x2680000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0x8ed8f980 | 0x1f80000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0x90d0f980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0x90d13980 | 0x1f80000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0x92c93980 | 0x2c0000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0x92f53980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0x92f57980 | 0xb00000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0x93a57980 | 0xb00000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0x94557980 | 0x2c0000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0x94817980 | 0x2680000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0x96e97980 | 0x1f80000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0x98e17980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0x98e1b980 | 0x1f80000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0x9ad9b980 | 0x240000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0x9afdb980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0x9afdf980 | 0xb00000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0x9badf980 | 0x900000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0x9c3df980 | 0x2c0000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0x9c69f980 | 0x2680000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0x9ed1f980 | 0x1f80000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0xa0c9f980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0xa0ca3980 | 0x1f80000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0xa2c23980 | 0x2c0000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0xa2ee3980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0xa2ee7980 | 0xb00000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0xa39e7980 | 0xb00000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0xa44e7980 | 0x2c0000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0xa47a7980 | 0x2680000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0xa6e27980 | 0x2680000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0xa94a7980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0xa94ab980 | 0x2680000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0xabb2b980 | 0x240000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0xabd6b980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0xabd6f980 | 0xb00000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0xac86f980 | 0x900000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0xad16f980 | 0x2c0000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0xad42f980 | 0x2df0000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0xb021f980 | 0x2680000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0xb289f980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0xb28a3980 | 0x2680000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0xb4f23980 | 0x2c0000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0xb51e3980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0xb51e7980 | 0xb00000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0xb5ce7980 | 0xb00000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0xb67e7980 | 0x2c0000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0xb6aa7980 | 0x2df0000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0xb9897980 | 0x2680000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0xbbf17980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0xbbf1b980 | 0x2680000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0xbe59b980 | 0x2c0000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0xbe85b980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0xbe85f980 | 0xb00000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0xbf35f980 | 0xb00000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0xbfe5f980 | 0x2c0000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0xc011f980 | 0x2df0000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0xc2f0f980 | 0x2680000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0xc558f980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0xc5593980 | 0x2680000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0xc7c13980 | 0x2c0000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0xc7ed3980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0xc7ed7980 | 0xb00000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0xc89d7980 | 0xb00000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0xc94d7980 | 0x2c0000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0xc9797980 | 0x2df0000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0xcc587980 | 0x2680000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0xcec07980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0xcec0b980 | 0x2680000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0xd128b980 | 0x2c0000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0xd154b980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0xd154f980 | 0xb00000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0xd204f980 | 0xb00000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0xd2b4f980 | 0x2c0000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0xd2e0f980 | 0x2df0000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0xd5bff980 | 0x2680000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0xd827f980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0xd8283980 | 0x2680000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0xda903980 | 0x2c0000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0xdabc3980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0xdabc7980 | 0xb00000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0xdb6c7980 | 0xb00000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0xdc1c7980 | 0x2c0000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0xdc487980 | 0x2df0000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0xdf277980 | 0x2680000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0xe18f7980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0xe18fb980 | 0x2680000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0xe3f7b980 | 0x2c0000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0xe423b980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0xe423f980 | 0xb00000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0xe4d3f980 | 0xb00000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0xe583f980 | 0x2c0000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0xe5aff980 | 0x2df0000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0xe88ef980 | 0x2680000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0xeaf6f980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0xeaf73980 | 0x2680000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0xed5f3980 | 0x2c0000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0xed8b3980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0xed8b7980 | 0xb00000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0xee3b7980 | 0xb00000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0xeeeb7980 | 0x2c0000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0xef177980 | 0x2df0000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0xf1f67980 | 0x2680000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0xf45e7980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0xf45eb980 | 0x2680000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0xf6c6b980 | 0x2c0000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0xf6f2b980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0xf6f2f980 | 0xb00000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0xf7a2f980 | 0xb00000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0xf852f980 | 0x2c0000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0xf87ef980 | 0x2df0000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0xfb5df980 | 0x2680000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0xfdc5f980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0xfdc63980 | 0x2680000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0x1002e3980 | 0x2c0000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0x1005a3980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0x1005a7980 | 0xb00000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0x1010a7980 | 0xb00000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0x101ba7980 | 0x2c0000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0x101e67980 | 0x2df0000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0x104c57980 | 0x2680000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0x1072d7980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0x1072db980 | 0x2680000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0x10995b980 | 0x2c0000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0x109c1b980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0x109c1f980 | 0xb00000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0x10a71f980 | 0xb00000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0x10b21f980 | 0x2c0000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0x10b4df980 | 0x2df0000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0x10e2cf980 | 0x2680000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0x11094f980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0x110953980 | 0x2680000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0x112fd3980 | 0x2c0000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0x113293980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0x113297980 | 0xb00000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0x113d97980 | 0xb00000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0x114897980 | 0x2c0000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0x114b57980 | 0x2df0000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0x117947980 | 0x2680000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0x119fc7980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0x119fcb980 | 0x2680000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0x11c64b980 | 0x2c0000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0x11c90b980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0x11c90f980 | 0xb00000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0x11d40f980 | 0xb00000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0x11df0f980 | 0x2c0000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0x11e1cf980 | 0x2df0000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0x120fbf980 | 0x2680000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0x12363f980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0x123643980 | 0x2680000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0x125cc3980 | 0x2c0000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0x125f83980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0x125f87980 | 0xb00000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0x126a87980 | 0xb00000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0x127587980 | 0x2c0000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0x127847980 | 0x2df0000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0x12a637980 | 0x2680000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0x12ccb7980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0x12ccbb980 | 0x2680000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0x12f33b980 | 0x240000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0x12f57b980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0x12f57f980 | 0xb00000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0x13007f980 | 0x900000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0x13097f980 | 0x2c0000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0x130c3f980 | 0x2df0000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0x133a2f980 | 0x2680000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0x1360af980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0x1360b3980 | 0x2680000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q5_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q4_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q5_K |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q4_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q4_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q6_K |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q6_K.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q6_K.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 18 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q6\_K.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q6_kgguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x19afa000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x1a273880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x1a277880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x1a277980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x279bd980 | 0x2c0000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x27c7d980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x27c81980 | 0xd20000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x289a1980 | 0xb00000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x294a1980 | 0x348000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x297e9980 | 0x3b80000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x2d369980 | 0x2680000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x2f9e9980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x2f9ed980 | 0x2680000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x3206d980 | 0x2c0000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x3232d980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x32331980 | 0xd20000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x33051980 | 0xb00000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x33b51980 | 0x348000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x33e99980 | 0x3b80000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x37a19980 | 0x2680000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x3a099980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x3a09d980 | 0x2680000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x3c71d980 | 0x2c0000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x3c9dd980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x3c9e1980 | 0xd20000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x3d701980 | 0xb00000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x3e201980 | 0x348000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x3e549980 | 0x3b80000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x420c9980 | 0x2680000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x44749980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x4474d980 | 0x2680000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x46dcd980 | 0x2c0000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x4708d980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x47091980 | 0xd20000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x47db1980 | 0xb00000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x488b1980 | 0x348000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x48bf9980 | 0x3b80000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x4c779980 | 0x2680000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x4edf9980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x4edfd980 | 0x2680000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x5147d980 | 0x2c0000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x5173d980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x51741980 | 0xd20000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x52461980 | 0xb00000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x52f61980 | 0x348000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x532a9980 | 0x3b80000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x56e29980 | 0x2680000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x594a9980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x594ad980 | 0x2680000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x5bb2d980 | 0x2c0000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x5bded980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x5bdf1980 | 0xd20000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x5cb11980 | 0xb00000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x5d611980 | 0x348000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x5d959980 | 0x3b80000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x614d9980 | 0x2680000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x63b59980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x63b5d980 | 0x2680000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x661dd980 | 0x2c0000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x6649d980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x664a1980 | 0xd20000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x671c1980 | 0xb00000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x67cc1980 | 0x348000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x68009980 | 0x3b80000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x6bb89980 | 0x2680000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x6e209980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x6e20d980 | 0x2680000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x7088d980 | 0x2c0000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x70b4d980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x70b51980 | 0xd20000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x71871980 | 0xb00000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x72371980 | 0x348000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x726b9980 | 0x3b80000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x76239980 | 0x2680000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x788b9980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x788bd980 | 0x2680000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x7af3d980 | 0x2c0000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x7b1fd980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x7b201980 | 0xd20000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x7bf21980 | 0xb00000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x7ca21980 | 0x348000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x7cd69980 | 0x3b80000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x808e9980 | 0x2680000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x82f69980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x82f6d980 | 0x2680000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x855ed980 | 0x2c0000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x858ad980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x858b1980 | 0xd20000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x865d1980 | 0xb00000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x870d1980 | 0x348000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x87419980 | 0x3b80000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x8af99980 | 0x2680000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0x8d619980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0x8d61d980 | 0x2680000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0x8fc9d980 | 0x2c0000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0x8ff5d980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0x8ff61980 | 0xd20000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0x90c81980 | 0xb00000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0x91781980 | 0x348000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0x91ac9980 | 0x3b80000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0x95649980 | 0x2680000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0x97cc9980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0x97ccd980 | 0x2680000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0x9a34d980 | 0x2c0000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0x9a60d980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0x9a611980 | 0xd20000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0x9b331980 | 0xb00000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0x9be31980 | 0x348000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0x9c179980 | 0x3b80000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0x9fcf9980 | 0x2680000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0xa2379980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0xa237d980 | 0x2680000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0xa49fd980 | 0x2c0000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0xa4cbd980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0xa4cc1980 | 0xd20000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0xa59e1980 | 0xb00000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0xa64e1980 | 0x348000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0xa6829980 | 0x3b80000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0xaa3a9980 | 0x2680000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0xaca29980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0xaca2d980 | 0x2680000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0xaf0ad980 | 0x348000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0xaf3f5980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0xaf3f9980 | 0xd20000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0xb0119980 | 0xd20000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0xb0e39980 | 0x440000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0xb1279980 | 0x3b80000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0xb4df9980 | 0x2680000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0xb7479980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0xb747d980 | 0x2680000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0xb9afd980 | 0x348000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0xb9e45980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0xb9e49980 | 0xd20000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0xbab69980 | 0xd20000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0xbb889980 | 0x440000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0xbbcc9980 | 0x3b80000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0xbf849980 | 0x2680000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0xc1ec9980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0xc1ecd980 | 0x2680000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0xc454d980 | 0x2c0000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0xc480d980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0xc4811980 | 0xd20000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0xc5531980 | 0xb00000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0xc6031980 | 0x348000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0xc6379980 | 0x3b80000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0xc9ef9980 | 0x2680000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0xcc579980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0xcc57d980 | 0x2680000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0xcebfd980 | 0x348000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0xcef45980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0xcef49980 | 0xd20000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0xcfc69980 | 0xd20000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0xd0989980 | 0x440000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0xd0dc9980 | 0x3b80000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0xd4949980 | 0x2df0000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0xd7739980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0xd773d980 | 0x2df0000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0xda52d980 | 0x2c0000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0xda7ed980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0xda7f1980 | 0xd20000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0xdb511980 | 0xb00000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0xdc011980 | 0x348000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0xdc359980 | 0x3b80000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0xdfed9980 | 0x2df0000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0xe2cc9980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0xe2ccd980 | 0x2df0000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0xe5abd980 | 0x348000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0xe5e05980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0xe5e09980 | 0xd20000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0xe6b29980 | 0xd20000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0xe7849980 | 0x440000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0xe7c89980 | 0x3b80000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0xeb809980 | 0x2df0000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0xee5f9980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0xee5fd980 | 0x2df0000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0xf13ed980 | 0x348000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0xf1735980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0xf1739980 | 0xd20000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0xf2459980 | 0xd20000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0xf3179980 | 0x440000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0xf35b9980 | 0x3b80000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0xf7139980 | 0x2df0000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0xf9f29980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0xf9f2d980 | 0x2df0000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0xfcd1d980 | 0x348000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0xfd065980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0xfd069980 | 0xd20000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0xfdd89980 | 0xd20000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0xfeaa9980 | 0x440000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0xfeee9980 | 0x3b80000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x102a69980 | 0x2df0000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x105859980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x10585d980 | 0x2df0000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x10864d980 | 0x348000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x108995980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x108999980 | 0xd20000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x1096b9980 | 0xd20000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x10a3d9980 | 0x440000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x10a819980 | 0x3b80000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x10e399980 | 0x2df0000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x111189980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x11118d980 | 0x2df0000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x113f7d980 | 0x348000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x1142c5980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x1142c9980 | 0xd20000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x114fe9980 | 0xd20000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x115d09980 | 0x440000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x116149980 | 0x3b80000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0x119cc9980 | 0x2df0000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0x11cab9980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0x11cabd980 | 0x2df0000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0x11f8ad980 | 0x348000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0x11fbf5980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0x11fbf9980 | 0xd20000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0x120919980 | 0xd20000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0x121639980 | 0x440000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0x121a79980 | 0x3b80000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0x1255f9980 | 0x2df0000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0x1283e9980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0x1283ed980 | 0x2df0000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0x12b1dd980 | 0x348000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0x12b525980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0x12b529980 | 0xd20000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0x12c249980 | 0xd20000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0x12cf69980 | 0x440000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0x12d3a9980 | 0x3b80000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0x130f29980 | 0x2df0000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0x133d19980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0x133d1d980 | 0x2df0000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0x136b0d980 | 0x348000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0x136e55980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0x136e59980 | 0xd20000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0x137b79980 | 0xd20000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0x138899980 | 0x440000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0x138cd9980 | 0x3b80000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0x13c859980 | 0x2df0000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0x13f649980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0x13f64d980 | 0x2df0000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0x14243d980 | 0x348000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0x142785980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0x142789980 | 0xd20000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0x1434a9980 | 0xd20000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0x1441c9980 | 0x440000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0x144609980 | 0x3b80000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0x148189980 | 0x2df0000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0x14af79980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0x14af7d980 | 0x2df0000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0x14dd6d980 | 0x348000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0x14e0b5980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0x14e0b9980 | 0xd20000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0x14edd9980 | 0xd20000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0x14faf9980 | 0x440000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0x14ff39980 | 0x3b80000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0x153ab9980 | 0x2df0000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0x1568a9980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0x1568ad980 | 0x2df0000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0x15969d980 | 0x348000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0x1599e5980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0x1599e9980 | 0xd20000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0x15a709980 | 0xd20000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0x15b429980 | 0x440000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0x15b869980 | 0x3b80000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0x15f3e9980 | 0x2df0000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0x1621d9980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0x1621dd980 | 0x2df0000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0x164fcd980 | 0x348000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0x165315980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0x165319980 | 0xd20000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0x166039980 | 0xd20000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0x166d59980 | 0x440000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0x167199980 | 0x3b80000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0x16ad19980 | 0x2df0000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0x16db09980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0x16db0d980 | 0x2df0000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0x1708fd980 | 0x348000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0x170c45980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0x170c49980 | 0xd20000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0x171969980 | 0xd20000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0x172689980 | 0x440000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0x172ac9980 | 0x3b80000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0x176649980 | 0x2df0000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0x179439980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0x17943d980 | 0x2df0000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0x17c22d980 | 0x2c0000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0x17c4ed980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0x17c4f1980 | 0xd20000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0x17d211980 | 0xb00000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0x17dd11980 | 0x348000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0x17e059980 | 0x3b80000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0x181bd9980 | 0x2df0000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0x1849c9980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0x1849cd980 | 0x2df0000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q6_K |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q5_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q5_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q5_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|
scores/Llama-Guard-3-8B-Q8_0.md
ADDED
|
@@ -0,0 +1,978 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Llama-Guard-3-8B-Q8_0.gguf - GGUF Internal File Dump
|
| 2 |
+
|
| 3 |
+
- Endian: LITTLE endian
|
| 4 |
+
|
| 5 |
+
## Key Value Metadata Store
|
| 6 |
+
|
| 7 |
+
There are 39 key-value pairs in this file
|
| 8 |
+
|
| 9 |
+
| POS | TYPE | Count | Key | Value |
|
| 10 |
+
|----:|:---------|-------:|:---------------------------------------|:--------------------------------------------------------------------|
|
| 11 |
+
| 1 | UINT32 | 1 | GGUF.version | 3 |
|
| 12 |
+
| 2 | UINT64 | 1 | GGUF.tensor_count | 292 |
|
| 13 |
+
| 3 | UINT64 | 1 | GGUF.kv_count | 36 |
|
| 14 |
+
| 4 | STRING | 1 | general.architecture | `llama` |
|
| 15 |
+
| 5 | STRING | 1 | general.type | `model` |
|
| 16 |
+
| 6 | STRING | 1 | general.name | `Llama Guard 3 8B` |
|
| 17 |
+
| 7 | STRING | 1 | general.basename | `Llama-Guard-3` |
|
| 18 |
+
| 8 | STRING | 1 | general.size_label | `8B` |
|
| 19 |
+
| 9 | STRING | 1 | general.license | `llama3.1` |
|
| 20 |
+
| 10 | UINT32 | 1 | general.base_model.count | 1 |
|
| 21 |
+
| 11 | STRING | 1 | general.base_model.0.name | `Meta Llama 3.1 8B` |
|
| 22 |
+
| 12 | STRING | 1 | general.base_model.0.organization | `Meta Llama` |
|
| 23 |
+
| 13 | STRING | 1 | general.base_model.0.repo_url | `https://huggingface.co/meta-llama/Meta-Llama-3.1-8B` |
|
| 24 |
+
| 14 | [STRING] | 6 | general.tags | [ `facebook`, `meta`, `pytorch`, `llama`, `llama-3`, ... ] |
|
| 25 |
+
| 15 | [STRING] | 1 | general.languages | [ `en` ] |
|
| 26 |
+
| 16 | UINT32 | 1 | llama.block_count | 32 |
|
| 27 |
+
| 17 | UINT32 | 1 | llama.context_length | 131072 |
|
| 28 |
+
| 18 | UINT32 | 1 | llama.embedding_length | 4096 |
|
| 29 |
+
| 19 | UINT32 | 1 | llama.feed_forward_length | 14336 |
|
| 30 |
+
| 20 | UINT32 | 1 | llama.attention.head_count | 32 |
|
| 31 |
+
| 21 | UINT32 | 1 | llama.attention.head_count_kv | 8 |
|
| 32 |
+
| 22 | FLOAT32 | 1 | llama.rope.freq_base | 500000.0 |
|
| 33 |
+
| 23 | FLOAT32 | 1 | llama.attention.layer_norm_rms_epsilon | 1e-05 |
|
| 34 |
+
| 24 | UINT32 | 1 | llama.vocab_size | 128256 |
|
| 35 |
+
| 25 | UINT32 | 1 | llama.rope.dimension_count | 128 |
|
| 36 |
+
| 26 | STRING | 1 | tokenizer.ggml.model | `gpt2` |
|
| 37 |
+
| 27 | STRING | 1 | tokenizer.ggml.pre | `smaug-bpe` |
|
| 38 |
+
| 28 | [STRING] | 128256 | tokenizer.ggml.tokens | [ `!`, `"`, `#`, `$`, `%`, ... ] |
|
| 39 |
+
| 29 | [INT32] | 128256 | tokenizer.ggml.token_type | [ 1, 1, 1, 1, 1, 1, 1, ... ] |
|
| 40 |
+
| 30 | [STRING] | 280147 | tokenizer.ggml.merges | [ `Ġ Ġ`, `Ġ ĠĠĠ`, `ĠĠ ĠĠ`, `ĠĠĠ Ġ`, `i n`, ... ] |
|
| 41 |
+
| 31 | UINT32 | 1 | tokenizer.ggml.bos_token_id | 128000 |
|
| 42 |
+
| 32 | UINT32 | 1 | tokenizer.ggml.eos_token_id | 128009 |
|
| 43 |
+
| 33 | STRING | 1 | tokenizer.chat_template | `{% if messages|length % 2 == 0`...`assistant<|end_header_id|>" }}` |
|
| 44 |
+
| 34 | UINT32 | 1 | general.quantization_version | 2 |
|
| 45 |
+
| 35 | UINT32 | 1 | general.file_type | 7 |
|
| 46 |
+
| 36 | STRING | 1 | quantize.imatrix.file | `./imatrix/imatrix-Llama-Guard-3-8B-small.dat` |
|
| 47 |
+
| 37 | STRING | 1 | quantize.imatrix.dataset | `../../datasets/imatrix/calibration_eur_small.txt` |
|
| 48 |
+
| 38 | INT32 | 1 | quantize.imatrix.entries_count | 224 |
|
| 49 |
+
| 39 | INT32 | 1 | quantize.imatrix.chunks_count | 962 |
|
| 50 |
+
|
| 51 |
+
## Tensors Overview ~8B Elements
|
| 52 |
+
|
| 53 |
+
Total number of elements in all tensors: 8030261312 Elements
|
| 54 |
+
|
| 55 |
+
- [Llama-Guard-3-8B-Q8\_0.gguf - GGUF Internal File Dump](#llama-guard-3-8b-q8_0gguf---gguf-internal-file-dump)
|
| 56 |
+
- [Key Value Metadata Store](#key-value-metadata-store)
|
| 57 |
+
- [Tensors Overview ~8B Elements](#tensors-overview-8b-elements)
|
| 58 |
+
- [Tensor Data Offset](#tensor-data-offset)
|
| 59 |
+
- [Base Tensor Group : ~1B Elements](#base-tensor-group--1b-elements)
|
| 60 |
+
- [Block 0 Tensor Group : ~218M Elements](#block-0-tensor-group--218m-elements)
|
| 61 |
+
- [Block 1 Tensor Group : ~218M Elements](#block-1-tensor-group--218m-elements)
|
| 62 |
+
- [Block 2 Tensor Group : ~218M Elements](#block-2-tensor-group--218m-elements)
|
| 63 |
+
- [Block 3 Tensor Group : ~218M Elements](#block-3-tensor-group--218m-elements)
|
| 64 |
+
- [Block 4 Tensor Group : ~218M Elements](#block-4-tensor-group--218m-elements)
|
| 65 |
+
- [Block 5 Tensor Group : ~218M Elements](#block-5-tensor-group--218m-elements)
|
| 66 |
+
- [Block 6 Tensor Group : ~218M Elements](#block-6-tensor-group--218m-elements)
|
| 67 |
+
- [Block 7 Tensor Group : ~218M Elements](#block-7-tensor-group--218m-elements)
|
| 68 |
+
- [Block 8 Tensor Group : ~218M Elements](#block-8-tensor-group--218m-elements)
|
| 69 |
+
- [Block 9 Tensor Group : ~218M Elements](#block-9-tensor-group--218m-elements)
|
| 70 |
+
- [Block 10 Tensor Group : ~218M Elements](#block-10-tensor-group--218m-elements)
|
| 71 |
+
- [Block 11 Tensor Group : ~218M Elements](#block-11-tensor-group--218m-elements)
|
| 72 |
+
- [Block 12 Tensor Group : ~218M Elements](#block-12-tensor-group--218m-elements)
|
| 73 |
+
- [Block 13 Tensor Group : ~218M Elements](#block-13-tensor-group--218m-elements)
|
| 74 |
+
- [Block 14 Tensor Group : ~218M Elements](#block-14-tensor-group--218m-elements)
|
| 75 |
+
- [Block 15 Tensor Group : ~218M Elements](#block-15-tensor-group--218m-elements)
|
| 76 |
+
- [Block 16 Tensor Group : ~218M Elements](#block-16-tensor-group--218m-elements)
|
| 77 |
+
- [Block 17 Tensor Group : ~218M Elements](#block-17-tensor-group--218m-elements)
|
| 78 |
+
- [Block 18 Tensor Group : ~218M Elements](#block-18-tensor-group--218m-elements)
|
| 79 |
+
- [Block 19 Tensor Group : ~218M Elements](#block-19-tensor-group--218m-elements)
|
| 80 |
+
- [Block 20 Tensor Group : ~218M Elements](#block-20-tensor-group--218m-elements)
|
| 81 |
+
- [Block 21 Tensor Group : ~218M Elements](#block-21-tensor-group--218m-elements)
|
| 82 |
+
- [Block 22 Tensor Group : ~218M Elements](#block-22-tensor-group--218m-elements)
|
| 83 |
+
- [Block 23 Tensor Group : ~218M Elements](#block-23-tensor-group--218m-elements)
|
| 84 |
+
- [Block 24 Tensor Group : ~218M Elements](#block-24-tensor-group--218m-elements)
|
| 85 |
+
- [Block 25 Tensor Group : ~218M Elements](#block-25-tensor-group--218m-elements)
|
| 86 |
+
- [Block 26 Tensor Group : ~218M Elements](#block-26-tensor-group--218m-elements)
|
| 87 |
+
- [Block 27 Tensor Group : ~218M Elements](#block-27-tensor-group--218m-elements)
|
| 88 |
+
- [Block 28 Tensor Group : ~218M Elements](#block-28-tensor-group--218m-elements)
|
| 89 |
+
- [Block 29 Tensor Group : ~218M Elements](#block-29-tensor-group--218m-elements)
|
| 90 |
+
- [Block 30 Tensor Group : ~218M Elements](#block-30-tensor-group--218m-elements)
|
| 91 |
+
- [Block 31 Tensor Group : ~218M Elements](#block-31-tensor-group--218m-elements)
|
| 92 |
+
|
| 93 |
+
### Tensor Data Offset
|
| 94 |
+
|
| 95 |
+
This table contains the offset and data segment relative to start of file
|
| 96 |
+
|
| 97 |
+
| T_ID | Tensor Layer Name | Data Offset (B) | Data Size (B) |
|
| 98 |
+
|-----:|:--------------------------|-----------------:|-----------------:|
|
| 99 |
+
| 0 | output.weight | 0x779880 | 0x21450000 |
|
| 100 |
+
| 1 | output_norm.weight | 0x21bc9880 | 0x4000 |
|
| 101 |
+
| 2 | rope_freqs.weight | 0x21bcd880 | 0x100 |
|
| 102 |
+
| 3 | token_embd.weight | 0x21bcd980 | 0xd746000 |
|
| 103 |
+
| 4 | blk.0.attn_k.weight | 0x2f313980 | 0x348000 |
|
| 104 |
+
| 5 | blk.0.attn_norm.weight | 0x2f65b980 | 0x4000 |
|
| 105 |
+
| 6 | blk.0.attn_output.weight | 0x2f65f980 | 0x1100000 |
|
| 106 |
+
| 7 | blk.0.attn_q.weight | 0x3075f980 | 0xd20000 |
|
| 107 |
+
| 8 | blk.0.attn_v.weight | 0x3147f980 | 0x440000 |
|
| 108 |
+
| 9 | blk.0.ffn_down.weight | 0x318bf980 | 0x3b80000 |
|
| 109 |
+
| 10 | blk.0.ffn_gate.weight | 0x3543f980 | 0x2df0000 |
|
| 110 |
+
| 11 | blk.0.ffn_norm.weight | 0x3822f980 | 0x4000 |
|
| 111 |
+
| 12 | blk.0.ffn_up.weight | 0x38233980 | 0x2df0000 |
|
| 112 |
+
| 13 | blk.1.attn_k.weight | 0x3b023980 | 0x348000 |
|
| 113 |
+
| 14 | blk.1.attn_norm.weight | 0x3b36b980 | 0x4000 |
|
| 114 |
+
| 15 | blk.1.attn_output.weight | 0x3b36f980 | 0x1100000 |
|
| 115 |
+
| 16 | blk.1.attn_q.weight | 0x3c46f980 | 0xd20000 |
|
| 116 |
+
| 17 | blk.1.attn_v.weight | 0x3d18f980 | 0x440000 |
|
| 117 |
+
| 18 | blk.1.ffn_down.weight | 0x3d5cf980 | 0x3b80000 |
|
| 118 |
+
| 19 | blk.1.ffn_gate.weight | 0x4114f980 | 0x2df0000 |
|
| 119 |
+
| 20 | blk.1.ffn_norm.weight | 0x43f3f980 | 0x4000 |
|
| 120 |
+
| 21 | blk.1.ffn_up.weight | 0x43f43980 | 0x2df0000 |
|
| 121 |
+
| 22 | blk.2.attn_k.weight | 0x46d33980 | 0x348000 |
|
| 122 |
+
| 23 | blk.2.attn_norm.weight | 0x4707b980 | 0x4000 |
|
| 123 |
+
| 24 | blk.2.attn_output.weight | 0x4707f980 | 0x1100000 |
|
| 124 |
+
| 25 | blk.2.attn_q.weight | 0x4817f980 | 0xd20000 |
|
| 125 |
+
| 26 | blk.2.attn_v.weight | 0x48e9f980 | 0x440000 |
|
| 126 |
+
| 27 | blk.2.ffn_down.weight | 0x492df980 | 0x3b80000 |
|
| 127 |
+
| 28 | blk.2.ffn_gate.weight | 0x4ce5f980 | 0x2df0000 |
|
| 128 |
+
| 29 | blk.2.ffn_norm.weight | 0x4fc4f980 | 0x4000 |
|
| 129 |
+
| 30 | blk.2.ffn_up.weight | 0x4fc53980 | 0x2df0000 |
|
| 130 |
+
| 31 | blk.3.attn_k.weight | 0x52a43980 | 0x348000 |
|
| 131 |
+
| 32 | blk.3.attn_norm.weight | 0x52d8b980 | 0x4000 |
|
| 132 |
+
| 33 | blk.3.attn_output.weight | 0x52d8f980 | 0x1100000 |
|
| 133 |
+
| 34 | blk.3.attn_q.weight | 0x53e8f980 | 0xd20000 |
|
| 134 |
+
| 35 | blk.3.attn_v.weight | 0x54baf980 | 0x440000 |
|
| 135 |
+
| 36 | blk.3.ffn_down.weight | 0x54fef980 | 0x3b80000 |
|
| 136 |
+
| 37 | blk.3.ffn_gate.weight | 0x58b6f980 | 0x2df0000 |
|
| 137 |
+
| 38 | blk.3.ffn_norm.weight | 0x5b95f980 | 0x4000 |
|
| 138 |
+
| 39 | blk.3.ffn_up.weight | 0x5b963980 | 0x2df0000 |
|
| 139 |
+
| 40 | blk.4.attn_k.weight | 0x5e753980 | 0x348000 |
|
| 140 |
+
| 41 | blk.4.attn_norm.weight | 0x5ea9b980 | 0x4000 |
|
| 141 |
+
| 42 | blk.4.attn_output.weight | 0x5ea9f980 | 0x1100000 |
|
| 142 |
+
| 43 | blk.4.attn_q.weight | 0x5fb9f980 | 0xd20000 |
|
| 143 |
+
| 44 | blk.4.attn_v.weight | 0x608bf980 | 0x440000 |
|
| 144 |
+
| 45 | blk.4.ffn_down.weight | 0x60cff980 | 0x3b80000 |
|
| 145 |
+
| 46 | blk.4.ffn_gate.weight | 0x6487f980 | 0x2df0000 |
|
| 146 |
+
| 47 | blk.4.ffn_norm.weight | 0x6766f980 | 0x4000 |
|
| 147 |
+
| 48 | blk.4.ffn_up.weight | 0x67673980 | 0x2df0000 |
|
| 148 |
+
| 49 | blk.5.attn_k.weight | 0x6a463980 | 0x348000 |
|
| 149 |
+
| 50 | blk.5.attn_norm.weight | 0x6a7ab980 | 0x4000 |
|
| 150 |
+
| 51 | blk.5.attn_output.weight | 0x6a7af980 | 0x1100000 |
|
| 151 |
+
| 52 | blk.5.attn_q.weight | 0x6b8af980 | 0xd20000 |
|
| 152 |
+
| 53 | blk.5.attn_v.weight | 0x6c5cf980 | 0x440000 |
|
| 153 |
+
| 54 | blk.5.ffn_down.weight | 0x6ca0f980 | 0x3b80000 |
|
| 154 |
+
| 55 | blk.5.ffn_gate.weight | 0x7058f980 | 0x2df0000 |
|
| 155 |
+
| 56 | blk.5.ffn_norm.weight | 0x7337f980 | 0x4000 |
|
| 156 |
+
| 57 | blk.5.ffn_up.weight | 0x73383980 | 0x2df0000 |
|
| 157 |
+
| 58 | blk.6.attn_k.weight | 0x76173980 | 0x348000 |
|
| 158 |
+
| 59 | blk.6.attn_norm.weight | 0x764bb980 | 0x4000 |
|
| 159 |
+
| 60 | blk.6.attn_output.weight | 0x764bf980 | 0x1100000 |
|
| 160 |
+
| 61 | blk.6.attn_q.weight | 0x775bf980 | 0xd20000 |
|
| 161 |
+
| 62 | blk.6.attn_v.weight | 0x782df980 | 0x440000 |
|
| 162 |
+
| 63 | blk.6.ffn_down.weight | 0x7871f980 | 0x3b80000 |
|
| 163 |
+
| 64 | blk.6.ffn_gate.weight | 0x7c29f980 | 0x2df0000 |
|
| 164 |
+
| 65 | blk.6.ffn_norm.weight | 0x7f08f980 | 0x4000 |
|
| 165 |
+
| 66 | blk.6.ffn_up.weight | 0x7f093980 | 0x2df0000 |
|
| 166 |
+
| 67 | blk.7.attn_k.weight | 0x81e83980 | 0x348000 |
|
| 167 |
+
| 68 | blk.7.attn_norm.weight | 0x821cb980 | 0x4000 |
|
| 168 |
+
| 69 | blk.7.attn_output.weight | 0x821cf980 | 0x1100000 |
|
| 169 |
+
| 70 | blk.7.attn_q.weight | 0x832cf980 | 0xd20000 |
|
| 170 |
+
| 71 | blk.7.attn_v.weight | 0x83fef980 | 0x440000 |
|
| 171 |
+
| 72 | blk.7.ffn_down.weight | 0x8442f980 | 0x3b80000 |
|
| 172 |
+
| 73 | blk.7.ffn_gate.weight | 0x87faf980 | 0x2df0000 |
|
| 173 |
+
| 74 | blk.7.ffn_norm.weight | 0x8ad9f980 | 0x4000 |
|
| 174 |
+
| 75 | blk.7.ffn_up.weight | 0x8ada3980 | 0x2df0000 |
|
| 175 |
+
| 76 | blk.8.attn_k.weight | 0x8db93980 | 0x348000 |
|
| 176 |
+
| 77 | blk.8.attn_norm.weight | 0x8dedb980 | 0x4000 |
|
| 177 |
+
| 78 | blk.8.attn_output.weight | 0x8dedf980 | 0x1100000 |
|
| 178 |
+
| 79 | blk.8.attn_q.weight | 0x8efdf980 | 0xd20000 |
|
| 179 |
+
| 80 | blk.8.attn_v.weight | 0x8fcff980 | 0x440000 |
|
| 180 |
+
| 81 | blk.8.ffn_down.weight | 0x9013f980 | 0x3b80000 |
|
| 181 |
+
| 82 | blk.8.ffn_gate.weight | 0x93cbf980 | 0x2df0000 |
|
| 182 |
+
| 83 | blk.8.ffn_norm.weight | 0x96aaf980 | 0x4000 |
|
| 183 |
+
| 84 | blk.8.ffn_up.weight | 0x96ab3980 | 0x2df0000 |
|
| 184 |
+
| 85 | blk.9.attn_k.weight | 0x998a3980 | 0x348000 |
|
| 185 |
+
| 86 | blk.9.attn_norm.weight | 0x99beb980 | 0x4000 |
|
| 186 |
+
| 87 | blk.9.attn_output.weight | 0x99bef980 | 0x1100000 |
|
| 187 |
+
| 88 | blk.9.attn_q.weight | 0x9acef980 | 0xd20000 |
|
| 188 |
+
| 89 | blk.9.attn_v.weight | 0x9ba0f980 | 0x440000 |
|
| 189 |
+
| 90 | blk.9.ffn_down.weight | 0x9be4f980 | 0x3b80000 |
|
| 190 |
+
| 91 | blk.9.ffn_gate.weight | 0x9f9cf980 | 0x2df0000 |
|
| 191 |
+
| 92 | blk.9.ffn_norm.weight | 0xa27bf980 | 0x4000 |
|
| 192 |
+
| 93 | blk.9.ffn_up.weight | 0xa27c3980 | 0x2df0000 |
|
| 193 |
+
| 94 | blk.10.attn_k.weight | 0xa55b3980 | 0x348000 |
|
| 194 |
+
| 95 | blk.10.attn_norm.weight | 0xa58fb980 | 0x4000 |
|
| 195 |
+
| 96 | blk.10.attn_output.weight | 0xa58ff980 | 0x1100000 |
|
| 196 |
+
| 97 | blk.10.attn_q.weight | 0xa69ff980 | 0xd20000 |
|
| 197 |
+
| 98 | blk.10.attn_v.weight | 0xa771f980 | 0x440000 |
|
| 198 |
+
| 99 | blk.10.ffn_down.weight | 0xa7b5f980 | 0x3b80000 |
|
| 199 |
+
| 100 | blk.10.ffn_gate.weight | 0xab6df980 | 0x2df0000 |
|
| 200 |
+
| 101 | blk.10.ffn_norm.weight | 0xae4cf980 | 0x4000 |
|
| 201 |
+
| 102 | blk.10.ffn_up.weight | 0xae4d3980 | 0x2df0000 |
|
| 202 |
+
| 103 | blk.11.attn_k.weight | 0xb12c3980 | 0x348000 |
|
| 203 |
+
| 104 | blk.11.attn_norm.weight | 0xb160b980 | 0x4000 |
|
| 204 |
+
| 105 | blk.11.attn_output.weight | 0xb160f980 | 0x1100000 |
|
| 205 |
+
| 106 | blk.11.attn_q.weight | 0xb270f980 | 0xd20000 |
|
| 206 |
+
| 107 | blk.11.attn_v.weight | 0xb342f980 | 0x440000 |
|
| 207 |
+
| 108 | blk.11.ffn_down.weight | 0xb386f980 | 0x3b80000 |
|
| 208 |
+
| 109 | blk.11.ffn_gate.weight | 0xb73ef980 | 0x2df0000 |
|
| 209 |
+
| 110 | blk.11.ffn_norm.weight | 0xba1df980 | 0x4000 |
|
| 210 |
+
| 111 | blk.11.ffn_up.weight | 0xba1e3980 | 0x2df0000 |
|
| 211 |
+
| 112 | blk.12.attn_k.weight | 0xbcfd3980 | 0x348000 |
|
| 212 |
+
| 113 | blk.12.attn_norm.weight | 0xbd31b980 | 0x4000 |
|
| 213 |
+
| 114 | blk.12.attn_output.weight | 0xbd31f980 | 0x1100000 |
|
| 214 |
+
| 115 | blk.12.attn_q.weight | 0xbe41f980 | 0xd20000 |
|
| 215 |
+
| 116 | blk.12.attn_v.weight | 0xbf13f980 | 0x440000 |
|
| 216 |
+
| 117 | blk.12.ffn_down.weight | 0xbf57f980 | 0x3b80000 |
|
| 217 |
+
| 118 | blk.12.ffn_gate.weight | 0xc30ff980 | 0x2df0000 |
|
| 218 |
+
| 119 | blk.12.ffn_norm.weight | 0xc5eef980 | 0x4000 |
|
| 219 |
+
| 120 | blk.12.ffn_up.weight | 0xc5ef3980 | 0x2df0000 |
|
| 220 |
+
| 121 | blk.13.attn_k.weight | 0xc8ce3980 | 0x440000 |
|
| 221 |
+
| 122 | blk.13.attn_norm.weight | 0xc9123980 | 0x4000 |
|
| 222 |
+
| 123 | blk.13.attn_output.weight | 0xc9127980 | 0x1100000 |
|
| 223 |
+
| 124 | blk.13.attn_q.weight | 0xca227980 | 0x1100000 |
|
| 224 |
+
| 125 | blk.13.attn_v.weight | 0xcb327980 | 0x800000 |
|
| 225 |
+
| 126 | blk.13.ffn_down.weight | 0xcbb27980 | 0x3b80000 |
|
| 226 |
+
| 127 | blk.13.ffn_gate.weight | 0xcf6a7980 | 0x2df0000 |
|
| 227 |
+
| 128 | blk.13.ffn_norm.weight | 0xd2497980 | 0x4000 |
|
| 228 |
+
| 129 | blk.13.ffn_up.weight | 0xd249b980 | 0x2df0000 |
|
| 229 |
+
| 130 | blk.14.attn_k.weight | 0xd528b980 | 0x440000 |
|
| 230 |
+
| 131 | blk.14.attn_norm.weight | 0xd56cb980 | 0x4000 |
|
| 231 |
+
| 132 | blk.14.attn_output.weight | 0xd56cf980 | 0x1100000 |
|
| 232 |
+
| 133 | blk.14.attn_q.weight | 0xd67cf980 | 0x1100000 |
|
| 233 |
+
| 134 | blk.14.attn_v.weight | 0xd78cf980 | 0x800000 |
|
| 234 |
+
| 135 | blk.14.ffn_down.weight | 0xd80cf980 | 0x3b80000 |
|
| 235 |
+
| 136 | blk.14.ffn_gate.weight | 0xdbc4f980 | 0x2df0000 |
|
| 236 |
+
| 137 | blk.14.ffn_norm.weight | 0xdea3f980 | 0x4000 |
|
| 237 |
+
| 138 | blk.14.ffn_up.weight | 0xdea43980 | 0x2df0000 |
|
| 238 |
+
| 139 | blk.15.attn_k.weight | 0xe1833980 | 0x348000 |
|
| 239 |
+
| 140 | blk.15.attn_norm.weight | 0xe1b7b980 | 0x4000 |
|
| 240 |
+
| 141 | blk.15.attn_output.weight | 0xe1b7f980 | 0x1100000 |
|
| 241 |
+
| 142 | blk.15.attn_q.weight | 0xe2c7f980 | 0xd20000 |
|
| 242 |
+
| 143 | blk.15.attn_v.weight | 0xe399f980 | 0x440000 |
|
| 243 |
+
| 144 | blk.15.ffn_down.weight | 0xe3ddf980 | 0x3b80000 |
|
| 244 |
+
| 145 | blk.15.ffn_gate.weight | 0xe795f980 | 0x2df0000 |
|
| 245 |
+
| 146 | blk.15.ffn_norm.weight | 0xea74f980 | 0x4000 |
|
| 246 |
+
| 147 | blk.15.ffn_up.weight | 0xea753980 | 0x2df0000 |
|
| 247 |
+
| 148 | blk.16.attn_k.weight | 0xed543980 | 0x440000 |
|
| 248 |
+
| 149 | blk.16.attn_norm.weight | 0xed983980 | 0x4000 |
|
| 249 |
+
| 150 | blk.16.attn_output.weight | 0xed987980 | 0x1100000 |
|
| 250 |
+
| 151 | blk.16.attn_q.weight | 0xeea87980 | 0x1100000 |
|
| 251 |
+
| 152 | blk.16.attn_v.weight | 0xefb87980 | 0x800000 |
|
| 252 |
+
| 153 | blk.16.ffn_down.weight | 0xf0387980 | 0x3b80000 |
|
| 253 |
+
| 154 | blk.16.ffn_gate.weight | 0xf3f07980 | 0x3b80000 |
|
| 254 |
+
| 155 | blk.16.ffn_norm.weight | 0xf7a87980 | 0x4000 |
|
| 255 |
+
| 156 | blk.16.ffn_up.weight | 0xf7a8b980 | 0x3b80000 |
|
| 256 |
+
| 157 | blk.17.attn_k.weight | 0xfb60b980 | 0x348000 |
|
| 257 |
+
| 158 | blk.17.attn_norm.weight | 0xfb953980 | 0x4000 |
|
| 258 |
+
| 159 | blk.17.attn_output.weight | 0xfb957980 | 0x1100000 |
|
| 259 |
+
| 160 | blk.17.attn_q.weight | 0xfca57980 | 0xd20000 |
|
| 260 |
+
| 161 | blk.17.attn_v.weight | 0xfd777980 | 0x440000 |
|
| 261 |
+
| 162 | blk.17.ffn_down.weight | 0xfdbb7980 | 0x3b80000 |
|
| 262 |
+
| 163 | blk.17.ffn_gate.weight | 0x101737980 | 0x3b80000 |
|
| 263 |
+
| 164 | blk.17.ffn_norm.weight | 0x1052b7980 | 0x4000 |
|
| 264 |
+
| 165 | blk.17.ffn_up.weight | 0x1052bb980 | 0x3b80000 |
|
| 265 |
+
| 166 | blk.18.attn_k.weight | 0x108e3b980 | 0x440000 |
|
| 266 |
+
| 167 | blk.18.attn_norm.weight | 0x10927b980 | 0x4000 |
|
| 267 |
+
| 168 | blk.18.attn_output.weight | 0x10927f980 | 0x1100000 |
|
| 268 |
+
| 169 | blk.18.attn_q.weight | 0x10a37f980 | 0x1100000 |
|
| 269 |
+
| 170 | blk.18.attn_v.weight | 0x10b47f980 | 0x800000 |
|
| 270 |
+
| 171 | blk.18.ffn_down.weight | 0x10bc7f980 | 0x3b80000 |
|
| 271 |
+
| 172 | blk.18.ffn_gate.weight | 0x10f7ff980 | 0x3b80000 |
|
| 272 |
+
| 173 | blk.18.ffn_norm.weight | 0x11337f980 | 0x4000 |
|
| 273 |
+
| 174 | blk.18.ffn_up.weight | 0x113383980 | 0x3b80000 |
|
| 274 |
+
| 175 | blk.19.attn_k.weight | 0x116f03980 | 0x440000 |
|
| 275 |
+
| 176 | blk.19.attn_norm.weight | 0x117343980 | 0x4000 |
|
| 276 |
+
| 177 | blk.19.attn_output.weight | 0x117347980 | 0x1100000 |
|
| 277 |
+
| 178 | blk.19.attn_q.weight | 0x118447980 | 0x1100000 |
|
| 278 |
+
| 179 | blk.19.attn_v.weight | 0x119547980 | 0x800000 |
|
| 279 |
+
| 180 | blk.19.ffn_down.weight | 0x119d47980 | 0x3b80000 |
|
| 280 |
+
| 181 | blk.19.ffn_gate.weight | 0x11d8c7980 | 0x3b80000 |
|
| 281 |
+
| 182 | blk.19.ffn_norm.weight | 0x121447980 | 0x4000 |
|
| 282 |
+
| 183 | blk.19.ffn_up.weight | 0x12144b980 | 0x3b80000 |
|
| 283 |
+
| 184 | blk.20.attn_k.weight | 0x124fcb980 | 0x440000 |
|
| 284 |
+
| 185 | blk.20.attn_norm.weight | 0x12540b980 | 0x4000 |
|
| 285 |
+
| 186 | blk.20.attn_output.weight | 0x12540f980 | 0x1100000 |
|
| 286 |
+
| 187 | blk.20.attn_q.weight | 0x12650f980 | 0x1100000 |
|
| 287 |
+
| 188 | blk.20.attn_v.weight | 0x12760f980 | 0x800000 |
|
| 288 |
+
| 189 | blk.20.ffn_down.weight | 0x127e0f980 | 0x3b80000 |
|
| 289 |
+
| 190 | blk.20.ffn_gate.weight | 0x12b98f980 | 0x3b80000 |
|
| 290 |
+
| 191 | blk.20.ffn_norm.weight | 0x12f50f980 | 0x4000 |
|
| 291 |
+
| 192 | blk.20.ffn_up.weight | 0x12f513980 | 0x3b80000 |
|
| 292 |
+
| 193 | blk.21.attn_k.weight | 0x133093980 | 0x440000 |
|
| 293 |
+
| 194 | blk.21.attn_norm.weight | 0x1334d3980 | 0x4000 |
|
| 294 |
+
| 195 | blk.21.attn_output.weight | 0x1334d7980 | 0x1100000 |
|
| 295 |
+
| 196 | blk.21.attn_q.weight | 0x1345d7980 | 0x1100000 |
|
| 296 |
+
| 197 | blk.21.attn_v.weight | 0x1356d7980 | 0x800000 |
|
| 297 |
+
| 198 | blk.21.ffn_down.weight | 0x135ed7980 | 0x3b80000 |
|
| 298 |
+
| 199 | blk.21.ffn_gate.weight | 0x139a57980 | 0x3b80000 |
|
| 299 |
+
| 200 | blk.21.ffn_norm.weight | 0x13d5d7980 | 0x4000 |
|
| 300 |
+
| 201 | blk.21.ffn_up.weight | 0x13d5db980 | 0x3b80000 |
|
| 301 |
+
| 202 | blk.22.attn_k.weight | 0x14115b980 | 0x440000 |
|
| 302 |
+
| 203 | blk.22.attn_norm.weight | 0x14159b980 | 0x4000 |
|
| 303 |
+
| 204 | blk.22.attn_output.weight | 0x14159f980 | 0x1100000 |
|
| 304 |
+
| 205 | blk.22.attn_q.weight | 0x14269f980 | 0x1100000 |
|
| 305 |
+
| 206 | blk.22.attn_v.weight | 0x14379f980 | 0x800000 |
|
| 306 |
+
| 207 | blk.22.ffn_down.weight | 0x143f9f980 | 0x3b80000 |
|
| 307 |
+
| 208 | blk.22.ffn_gate.weight | 0x147b1f980 | 0x3b80000 |
|
| 308 |
+
| 209 | blk.22.ffn_norm.weight | 0x14b69f980 | 0x4000 |
|
| 309 |
+
| 210 | blk.22.ffn_up.weight | 0x14b6a3980 | 0x3b80000 |
|
| 310 |
+
| 211 | blk.23.attn_k.weight | 0x14f223980 | 0x440000 |
|
| 311 |
+
| 212 | blk.23.attn_norm.weight | 0x14f663980 | 0x4000 |
|
| 312 |
+
| 213 | blk.23.attn_output.weight | 0x14f667980 | 0x1100000 |
|
| 313 |
+
| 214 | blk.23.attn_q.weight | 0x150767980 | 0x1100000 |
|
| 314 |
+
| 215 | blk.23.attn_v.weight | 0x151867980 | 0x800000 |
|
| 315 |
+
| 216 | blk.23.ffn_down.weight | 0x152067980 | 0x3b80000 |
|
| 316 |
+
| 217 | blk.23.ffn_gate.weight | 0x155be7980 | 0x3b80000 |
|
| 317 |
+
| 218 | blk.23.ffn_norm.weight | 0x159767980 | 0x4000 |
|
| 318 |
+
| 219 | blk.23.ffn_up.weight | 0x15976b980 | 0x3b80000 |
|
| 319 |
+
| 220 | blk.24.attn_k.weight | 0x15d2eb980 | 0x440000 |
|
| 320 |
+
| 221 | blk.24.attn_norm.weight | 0x15d72b980 | 0x4000 |
|
| 321 |
+
| 222 | blk.24.attn_output.weight | 0x15d72f980 | 0x1100000 |
|
| 322 |
+
| 223 | blk.24.attn_q.weight | 0x15e82f980 | 0x1100000 |
|
| 323 |
+
| 224 | blk.24.attn_v.weight | 0x15f92f980 | 0x800000 |
|
| 324 |
+
| 225 | blk.24.ffn_down.weight | 0x16012f980 | 0x3b80000 |
|
| 325 |
+
| 226 | blk.24.ffn_gate.weight | 0x163caf980 | 0x3b80000 |
|
| 326 |
+
| 227 | blk.24.ffn_norm.weight | 0x16782f980 | 0x4000 |
|
| 327 |
+
| 228 | blk.24.ffn_up.weight | 0x167833980 | 0x3b80000 |
|
| 328 |
+
| 229 | blk.25.attn_k.weight | 0x16b3b3980 | 0x440000 |
|
| 329 |
+
| 230 | blk.25.attn_norm.weight | 0x16b7f3980 | 0x4000 |
|
| 330 |
+
| 231 | blk.25.attn_output.weight | 0x16b7f7980 | 0x1100000 |
|
| 331 |
+
| 232 | blk.25.attn_q.weight | 0x16c8f7980 | 0x1100000 |
|
| 332 |
+
| 233 | blk.25.attn_v.weight | 0x16d9f7980 | 0x800000 |
|
| 333 |
+
| 234 | blk.25.ffn_down.weight | 0x16e1f7980 | 0x3b80000 |
|
| 334 |
+
| 235 | blk.25.ffn_gate.weight | 0x171d77980 | 0x3b80000 |
|
| 335 |
+
| 236 | blk.25.ffn_norm.weight | 0x1758f7980 | 0x4000 |
|
| 336 |
+
| 237 | blk.25.ffn_up.weight | 0x1758fb980 | 0x3b80000 |
|
| 337 |
+
| 238 | blk.26.attn_k.weight | 0x17947b980 | 0x440000 |
|
| 338 |
+
| 239 | blk.26.attn_norm.weight | 0x1798bb980 | 0x4000 |
|
| 339 |
+
| 240 | blk.26.attn_output.weight | 0x1798bf980 | 0x1100000 |
|
| 340 |
+
| 241 | blk.26.attn_q.weight | 0x17a9bf980 | 0x1100000 |
|
| 341 |
+
| 242 | blk.26.attn_v.weight | 0x17babf980 | 0x800000 |
|
| 342 |
+
| 243 | blk.26.ffn_down.weight | 0x17c2bf980 | 0x3b80000 |
|
| 343 |
+
| 244 | blk.26.ffn_gate.weight | 0x17fe3f980 | 0x3b80000 |
|
| 344 |
+
| 245 | blk.26.ffn_norm.weight | 0x1839bf980 | 0x4000 |
|
| 345 |
+
| 246 | blk.26.ffn_up.weight | 0x1839c3980 | 0x3b80000 |
|
| 346 |
+
| 247 | blk.27.attn_k.weight | 0x187543980 | 0x440000 |
|
| 347 |
+
| 248 | blk.27.attn_norm.weight | 0x187983980 | 0x4000 |
|
| 348 |
+
| 249 | blk.27.attn_output.weight | 0x187987980 | 0x1100000 |
|
| 349 |
+
| 250 | blk.27.attn_q.weight | 0x188a87980 | 0x1100000 |
|
| 350 |
+
| 251 | blk.27.attn_v.weight | 0x189b87980 | 0x800000 |
|
| 351 |
+
| 252 | blk.27.ffn_down.weight | 0x18a387980 | 0x3b80000 |
|
| 352 |
+
| 253 | blk.27.ffn_gate.weight | 0x18df07980 | 0x3b80000 |
|
| 353 |
+
| 254 | blk.27.ffn_norm.weight | 0x191a87980 | 0x4000 |
|
| 354 |
+
| 255 | blk.27.ffn_up.weight | 0x191a8b980 | 0x3b80000 |
|
| 355 |
+
| 256 | blk.28.attn_k.weight | 0x19560b980 | 0x440000 |
|
| 356 |
+
| 257 | blk.28.attn_norm.weight | 0x195a4b980 | 0x4000 |
|
| 357 |
+
| 258 | blk.28.attn_output.weight | 0x195a4f980 | 0x1100000 |
|
| 358 |
+
| 259 | blk.28.attn_q.weight | 0x196b4f980 | 0x1100000 |
|
| 359 |
+
| 260 | blk.28.attn_v.weight | 0x197c4f980 | 0x800000 |
|
| 360 |
+
| 261 | blk.28.ffn_down.weight | 0x19844f980 | 0x3b80000 |
|
| 361 |
+
| 262 | blk.28.ffn_gate.weight | 0x19bfcf980 | 0x3b80000 |
|
| 362 |
+
| 263 | blk.28.ffn_norm.weight | 0x19fb4f980 | 0x4000 |
|
| 363 |
+
| 264 | blk.28.ffn_up.weight | 0x19fb53980 | 0x3b80000 |
|
| 364 |
+
| 265 | blk.29.attn_k.weight | 0x1a36d3980 | 0x440000 |
|
| 365 |
+
| 266 | blk.29.attn_norm.weight | 0x1a3b13980 | 0x4000 |
|
| 366 |
+
| 267 | blk.29.attn_output.weight | 0x1a3b17980 | 0x1100000 |
|
| 367 |
+
| 268 | blk.29.attn_q.weight | 0x1a4c17980 | 0x1100000 |
|
| 368 |
+
| 269 | blk.29.attn_v.weight | 0x1a5d17980 | 0x800000 |
|
| 369 |
+
| 270 | blk.29.ffn_down.weight | 0x1a6517980 | 0x3b80000 |
|
| 370 |
+
| 271 | blk.29.ffn_gate.weight | 0x1aa097980 | 0x3b80000 |
|
| 371 |
+
| 272 | blk.29.ffn_norm.weight | 0x1adc17980 | 0x4000 |
|
| 372 |
+
| 273 | blk.29.ffn_up.weight | 0x1adc1b980 | 0x3b80000 |
|
| 373 |
+
| 274 | blk.30.attn_k.weight | 0x1b179b980 | 0x440000 |
|
| 374 |
+
| 275 | blk.30.attn_norm.weight | 0x1b1bdb980 | 0x4000 |
|
| 375 |
+
| 276 | blk.30.attn_output.weight | 0x1b1bdf980 | 0x1100000 |
|
| 376 |
+
| 277 | blk.30.attn_q.weight | 0x1b2cdf980 | 0x1100000 |
|
| 377 |
+
| 278 | blk.30.attn_v.weight | 0x1b3ddf980 | 0x800000 |
|
| 378 |
+
| 279 | blk.30.ffn_down.weight | 0x1b45df980 | 0x3b80000 |
|
| 379 |
+
| 280 | blk.30.ffn_gate.weight | 0x1b815f980 | 0x3b80000 |
|
| 380 |
+
| 281 | blk.30.ffn_norm.weight | 0x1bbcdf980 | 0x4000 |
|
| 381 |
+
| 282 | blk.30.ffn_up.weight | 0x1bbce3980 | 0x3b80000 |
|
| 382 |
+
| 283 | blk.31.attn_k.weight | 0x1bf863980 | 0x348000 |
|
| 383 |
+
| 284 | blk.31.attn_norm.weight | 0x1bfbab980 | 0x4000 |
|
| 384 |
+
| 285 | blk.31.attn_output.weight | 0x1bfbaf980 | 0x1100000 |
|
| 385 |
+
| 286 | blk.31.attn_q.weight | 0x1c0caf980 | 0xd20000 |
|
| 386 |
+
| 287 | blk.31.attn_v.weight | 0x1c19cf980 | 0x440000 |
|
| 387 |
+
| 288 | blk.31.ffn_down.weight | 0x1c1e0f980 | 0x3b80000 |
|
| 388 |
+
| 289 | blk.31.ffn_gate.weight | 0x1c598f980 | 0x3b80000 |
|
| 389 |
+
| 290 | blk.31.ffn_norm.weight | 0x1c950f980 | 0x4000 |
|
| 390 |
+
| 291 | blk.31.ffn_up.weight | 0x1c9513980 | 0x3b80000 |
|
| 391 |
+
|
| 392 |
+
### <a name="base">Base Tensor Group : ~1B Elements</a>
|
| 393 |
+
|
| 394 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 395 |
+
|-----:|:-------------------|:---------------------------------|:------------------|:----------------------|:-----|
|
| 396 |
+
| 0 | output.weight | Output (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q8_0 |
|
| 397 |
+
| 1 | output_norm.weight | Output Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 398 |
+
| 2 | rope_freqs.weight | Rope_Freqs (W) | ( 64) 64 | 64 x 1 x 1 x 1 | F32 |
|
| 399 |
+
| 3 | token_embd.weight | Token Embedding (W) | (~525M) 525336576 | 4096 x 128256 x 1 x 1 | Q3_K |
|
| 400 |
+
|
| 401 |
+
- Total elements in base: ( ~1B) 1050677312
|
| 402 |
+
- Percentage of total elements: 13.08%
|
| 403 |
+
|
| 404 |
+
|
| 405 |
+
### <a name="blk_0">Block 0 Tensor Group : ~218M Elements</a>
|
| 406 |
+
|
| 407 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 408 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 409 |
+
| 4 | blk.0.attn_k.weight | Block 0 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 410 |
+
| 5 | blk.0.attn_norm.weight | Block 0 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 411 |
+
| 6 | blk.0.attn_output.weight | Block 0 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 412 |
+
| 7 | blk.0.attn_q.weight | Block 0 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 413 |
+
| 8 | blk.0.attn_v.weight | Block 0 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 414 |
+
| 9 | blk.0.ffn_down.weight | Block 0 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 415 |
+
| 10 | blk.0.ffn_gate.weight | Block 0 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 416 |
+
| 11 | blk.0.ffn_norm.weight | Block 0 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 417 |
+
| 12 | blk.0.ffn_up.weight | Block 0 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 418 |
+
|
| 419 |
+
- Total elements in blk.0: (~218M) 218112000
|
| 420 |
+
- Percentage of total elements: 2.72%
|
| 421 |
+
|
| 422 |
+
|
| 423 |
+
### <a name="blk_1">Block 1 Tensor Group : ~218M Elements</a>
|
| 424 |
+
|
| 425 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 426 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 427 |
+
| 13 | blk.1.attn_k.weight | Block 1 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 428 |
+
| 14 | blk.1.attn_norm.weight | Block 1 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 429 |
+
| 15 | blk.1.attn_output.weight | Block 1 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 430 |
+
| 16 | blk.1.attn_q.weight | Block 1 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 431 |
+
| 17 | blk.1.attn_v.weight | Block 1 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 432 |
+
| 18 | blk.1.ffn_down.weight | Block 1 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 433 |
+
| 19 | blk.1.ffn_gate.weight | Block 1 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 434 |
+
| 20 | blk.1.ffn_norm.weight | Block 1 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 435 |
+
| 21 | blk.1.ffn_up.weight | Block 1 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 436 |
+
|
| 437 |
+
- Total elements in blk.1: (~218M) 218112000
|
| 438 |
+
- Percentage of total elements: 2.72%
|
| 439 |
+
|
| 440 |
+
|
| 441 |
+
### <a name="blk_2">Block 2 Tensor Group : ~218M Elements</a>
|
| 442 |
+
|
| 443 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 444 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 445 |
+
| 22 | blk.2.attn_k.weight | Block 2 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 446 |
+
| 23 | blk.2.attn_norm.weight | Block 2 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 447 |
+
| 24 | blk.2.attn_output.weight | Block 2 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 448 |
+
| 25 | blk.2.attn_q.weight | Block 2 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 449 |
+
| 26 | blk.2.attn_v.weight | Block 2 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 450 |
+
| 27 | blk.2.ffn_down.weight | Block 2 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 451 |
+
| 28 | blk.2.ffn_gate.weight | Block 2 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 452 |
+
| 29 | blk.2.ffn_norm.weight | Block 2 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 453 |
+
| 30 | blk.2.ffn_up.weight | Block 2 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 454 |
+
|
| 455 |
+
- Total elements in blk.2: (~218M) 218112000
|
| 456 |
+
- Percentage of total elements: 2.72%
|
| 457 |
+
|
| 458 |
+
|
| 459 |
+
### <a name="blk_3">Block 3 Tensor Group : ~218M Elements</a>
|
| 460 |
+
|
| 461 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 462 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 463 |
+
| 31 | blk.3.attn_k.weight | Block 3 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 464 |
+
| 32 | blk.3.attn_norm.weight | Block 3 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 465 |
+
| 33 | blk.3.attn_output.weight | Block 3 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 466 |
+
| 34 | blk.3.attn_q.weight | Block 3 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 467 |
+
| 35 | blk.3.attn_v.weight | Block 3 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 468 |
+
| 36 | blk.3.ffn_down.weight | Block 3 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 469 |
+
| 37 | blk.3.ffn_gate.weight | Block 3 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 470 |
+
| 38 | blk.3.ffn_norm.weight | Block 3 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 471 |
+
| 39 | blk.3.ffn_up.weight | Block 3 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 472 |
+
|
| 473 |
+
- Total elements in blk.3: (~218M) 218112000
|
| 474 |
+
- Percentage of total elements: 2.72%
|
| 475 |
+
|
| 476 |
+
|
| 477 |
+
### <a name="blk_4">Block 4 Tensor Group : ~218M Elements</a>
|
| 478 |
+
|
| 479 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 480 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 481 |
+
| 40 | blk.4.attn_k.weight | Block 4 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 482 |
+
| 41 | blk.4.attn_norm.weight | Block 4 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 483 |
+
| 42 | blk.4.attn_output.weight | Block 4 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 484 |
+
| 43 | blk.4.attn_q.weight | Block 4 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 485 |
+
| 44 | blk.4.attn_v.weight | Block 4 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 486 |
+
| 45 | blk.4.ffn_down.weight | Block 4 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 487 |
+
| 46 | blk.4.ffn_gate.weight | Block 4 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 488 |
+
| 47 | blk.4.ffn_norm.weight | Block 4 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 489 |
+
| 48 | blk.4.ffn_up.weight | Block 4 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 490 |
+
|
| 491 |
+
- Total elements in blk.4: (~218M) 218112000
|
| 492 |
+
- Percentage of total elements: 2.72%
|
| 493 |
+
|
| 494 |
+
|
| 495 |
+
### <a name="blk_5">Block 5 Tensor Group : ~218M Elements</a>
|
| 496 |
+
|
| 497 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 498 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 499 |
+
| 49 | blk.5.attn_k.weight | Block 5 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 500 |
+
| 50 | blk.5.attn_norm.weight | Block 5 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 501 |
+
| 51 | blk.5.attn_output.weight | Block 5 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 502 |
+
| 52 | blk.5.attn_q.weight | Block 5 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 503 |
+
| 53 | blk.5.attn_v.weight | Block 5 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 504 |
+
| 54 | blk.5.ffn_down.weight | Block 5 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 505 |
+
| 55 | blk.5.ffn_gate.weight | Block 5 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 506 |
+
| 56 | blk.5.ffn_norm.weight | Block 5 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 507 |
+
| 57 | blk.5.ffn_up.weight | Block 5 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 508 |
+
|
| 509 |
+
- Total elements in blk.5: (~218M) 218112000
|
| 510 |
+
- Percentage of total elements: 2.72%
|
| 511 |
+
|
| 512 |
+
|
| 513 |
+
### <a name="blk_6">Block 6 Tensor Group : ~218M Elements</a>
|
| 514 |
+
|
| 515 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 516 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 517 |
+
| 58 | blk.6.attn_k.weight | Block 6 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 518 |
+
| 59 | blk.6.attn_norm.weight | Block 6 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 519 |
+
| 60 | blk.6.attn_output.weight | Block 6 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 520 |
+
| 61 | blk.6.attn_q.weight | Block 6 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 521 |
+
| 62 | blk.6.attn_v.weight | Block 6 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 522 |
+
| 63 | blk.6.ffn_down.weight | Block 6 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 523 |
+
| 64 | blk.6.ffn_gate.weight | Block 6 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 524 |
+
| 65 | blk.6.ffn_norm.weight | Block 6 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 525 |
+
| 66 | blk.6.ffn_up.weight | Block 6 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 526 |
+
|
| 527 |
+
- Total elements in blk.6: (~218M) 218112000
|
| 528 |
+
- Percentage of total elements: 2.72%
|
| 529 |
+
|
| 530 |
+
|
| 531 |
+
### <a name="blk_7">Block 7 Tensor Group : ~218M Elements</a>
|
| 532 |
+
|
| 533 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 534 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 535 |
+
| 67 | blk.7.attn_k.weight | Block 7 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 536 |
+
| 68 | blk.7.attn_norm.weight | Block 7 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 537 |
+
| 69 | blk.7.attn_output.weight | Block 7 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 538 |
+
| 70 | blk.7.attn_q.weight | Block 7 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 539 |
+
| 71 | blk.7.attn_v.weight | Block 7 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 540 |
+
| 72 | blk.7.ffn_down.weight | Block 7 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 541 |
+
| 73 | blk.7.ffn_gate.weight | Block 7 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 542 |
+
| 74 | blk.7.ffn_norm.weight | Block 7 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 543 |
+
| 75 | blk.7.ffn_up.weight | Block 7 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 544 |
+
|
| 545 |
+
- Total elements in blk.7: (~218M) 218112000
|
| 546 |
+
- Percentage of total elements: 2.72%
|
| 547 |
+
|
| 548 |
+
|
| 549 |
+
### <a name="blk_8">Block 8 Tensor Group : ~218M Elements</a>
|
| 550 |
+
|
| 551 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 552 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 553 |
+
| 76 | blk.8.attn_k.weight | Block 8 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 554 |
+
| 77 | blk.8.attn_norm.weight | Block 8 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 555 |
+
| 78 | blk.8.attn_output.weight | Block 8 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 556 |
+
| 79 | blk.8.attn_q.weight | Block 8 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 557 |
+
| 80 | blk.8.attn_v.weight | Block 8 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 558 |
+
| 81 | blk.8.ffn_down.weight | Block 8 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 559 |
+
| 82 | blk.8.ffn_gate.weight | Block 8 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 560 |
+
| 83 | blk.8.ffn_norm.weight | Block 8 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 561 |
+
| 84 | blk.8.ffn_up.weight | Block 8 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 562 |
+
|
| 563 |
+
- Total elements in blk.8: (~218M) 218112000
|
| 564 |
+
- Percentage of total elements: 2.72%
|
| 565 |
+
|
| 566 |
+
|
| 567 |
+
### <a name="blk_9">Block 9 Tensor Group : ~218M Elements</a>
|
| 568 |
+
|
| 569 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 570 |
+
|-----:|:-------------------------|:-----------------------------------------------|:----------------|:----------------------|:-----|
|
| 571 |
+
| 85 | blk.9.attn_k.weight | Block 9 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 572 |
+
| 86 | blk.9.attn_norm.weight | Block 9 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 573 |
+
| 87 | blk.9.attn_output.weight | Block 9 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 574 |
+
| 88 | blk.9.attn_q.weight | Block 9 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 575 |
+
| 89 | blk.9.attn_v.weight | Block 9 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 576 |
+
| 90 | blk.9.ffn_down.weight | Block 9 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 577 |
+
| 91 | blk.9.ffn_gate.weight | Block 9 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 578 |
+
| 92 | blk.9.ffn_norm.weight | Block 9 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 579 |
+
| 93 | blk.9.ffn_up.weight | Block 9 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 580 |
+
|
| 581 |
+
- Total elements in blk.9: (~218M) 218112000
|
| 582 |
+
- Percentage of total elements: 2.72%
|
| 583 |
+
|
| 584 |
+
|
| 585 |
+
### <a name="blk_10">Block 10 Tensor Group : ~218M Elements</a>
|
| 586 |
+
|
| 587 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 588 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 589 |
+
| 94 | blk.10.attn_k.weight | Block 10 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 590 |
+
| 95 | blk.10.attn_norm.weight | Block 10 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 591 |
+
| 96 | blk.10.attn_output.weight | Block 10 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 592 |
+
| 97 | blk.10.attn_q.weight | Block 10 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 593 |
+
| 98 | blk.10.attn_v.weight | Block 10 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 594 |
+
| 99 | blk.10.ffn_down.weight | Block 10 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 595 |
+
| 100 | blk.10.ffn_gate.weight | Block 10 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 596 |
+
| 101 | blk.10.ffn_norm.weight | Block 10 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 597 |
+
| 102 | blk.10.ffn_up.weight | Block 10 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 598 |
+
|
| 599 |
+
- Total elements in blk.10: (~218M) 218112000
|
| 600 |
+
- Percentage of total elements: 2.72%
|
| 601 |
+
|
| 602 |
+
|
| 603 |
+
### <a name="blk_11">Block 11 Tensor Group : ~218M Elements</a>
|
| 604 |
+
|
| 605 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 606 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 607 |
+
| 103 | blk.11.attn_k.weight | Block 11 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 608 |
+
| 104 | blk.11.attn_norm.weight | Block 11 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 609 |
+
| 105 | blk.11.attn_output.weight | Block 11 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 610 |
+
| 106 | blk.11.attn_q.weight | Block 11 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 611 |
+
| 107 | blk.11.attn_v.weight | Block 11 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 612 |
+
| 108 | blk.11.ffn_down.weight | Block 11 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 613 |
+
| 109 | blk.11.ffn_gate.weight | Block 11 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 614 |
+
| 110 | blk.11.ffn_norm.weight | Block 11 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 615 |
+
| 111 | blk.11.ffn_up.weight | Block 11 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 616 |
+
|
| 617 |
+
- Total elements in blk.11: (~218M) 218112000
|
| 618 |
+
- Percentage of total elements: 2.72%
|
| 619 |
+
|
| 620 |
+
|
| 621 |
+
### <a name="blk_12">Block 12 Tensor Group : ~218M Elements</a>
|
| 622 |
+
|
| 623 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 624 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 625 |
+
| 112 | blk.12.attn_k.weight | Block 12 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 626 |
+
| 113 | blk.12.attn_norm.weight | Block 12 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 627 |
+
| 114 | blk.12.attn_output.weight | Block 12 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 628 |
+
| 115 | blk.12.attn_q.weight | Block 12 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 629 |
+
| 116 | blk.12.attn_v.weight | Block 12 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 630 |
+
| 117 | blk.12.ffn_down.weight | Block 12 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 631 |
+
| 118 | blk.12.ffn_gate.weight | Block 12 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 632 |
+
| 119 | blk.12.ffn_norm.weight | Block 12 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 633 |
+
| 120 | blk.12.ffn_up.weight | Block 12 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 634 |
+
|
| 635 |
+
- Total elements in blk.12: (~218M) 218112000
|
| 636 |
+
- Percentage of total elements: 2.72%
|
| 637 |
+
|
| 638 |
+
|
| 639 |
+
### <a name="blk_13">Block 13 Tensor Group : ~218M Elements</a>
|
| 640 |
+
|
| 641 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 642 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 643 |
+
| 121 | blk.13.attn_k.weight | Block 13 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 644 |
+
| 122 | blk.13.attn_norm.weight | Block 13 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 645 |
+
| 123 | blk.13.attn_output.weight | Block 13 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 646 |
+
| 124 | blk.13.attn_q.weight | Block 13 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 647 |
+
| 125 | blk.13.attn_v.weight | Block 13 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 648 |
+
| 126 | blk.13.ffn_down.weight | Block 13 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 649 |
+
| 127 | blk.13.ffn_gate.weight | Block 13 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 650 |
+
| 128 | blk.13.ffn_norm.weight | Block 13 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 651 |
+
| 129 | blk.13.ffn_up.weight | Block 13 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 652 |
+
|
| 653 |
+
- Total elements in blk.13: (~218M) 218112000
|
| 654 |
+
- Percentage of total elements: 2.72%
|
| 655 |
+
|
| 656 |
+
|
| 657 |
+
### <a name="blk_14">Block 14 Tensor Group : ~218M Elements</a>
|
| 658 |
+
|
| 659 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 660 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 661 |
+
| 130 | blk.14.attn_k.weight | Block 14 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 662 |
+
| 131 | blk.14.attn_norm.weight | Block 14 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 663 |
+
| 132 | blk.14.attn_output.weight | Block 14 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 664 |
+
| 133 | blk.14.attn_q.weight | Block 14 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 665 |
+
| 134 | blk.14.attn_v.weight | Block 14 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 666 |
+
| 135 | blk.14.ffn_down.weight | Block 14 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 667 |
+
| 136 | blk.14.ffn_gate.weight | Block 14 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 668 |
+
| 137 | blk.14.ffn_norm.weight | Block 14 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 669 |
+
| 138 | blk.14.ffn_up.weight | Block 14 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 670 |
+
|
| 671 |
+
- Total elements in blk.14: (~218M) 218112000
|
| 672 |
+
- Percentage of total elements: 2.72%
|
| 673 |
+
|
| 674 |
+
|
| 675 |
+
### <a name="blk_15">Block 15 Tensor Group : ~218M Elements</a>
|
| 676 |
+
|
| 677 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 678 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 679 |
+
| 139 | blk.15.attn_k.weight | Block 15 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 680 |
+
| 140 | blk.15.attn_norm.weight | Block 15 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 681 |
+
| 141 | blk.15.attn_output.weight | Block 15 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 682 |
+
| 142 | blk.15.attn_q.weight | Block 15 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 683 |
+
| 143 | blk.15.attn_v.weight | Block 15 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 684 |
+
| 144 | blk.15.ffn_down.weight | Block 15 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 685 |
+
| 145 | blk.15.ffn_gate.weight | Block 15 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 686 |
+
| 146 | blk.15.ffn_norm.weight | Block 15 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 687 |
+
| 147 | blk.15.ffn_up.weight | Block 15 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q6_K |
|
| 688 |
+
|
| 689 |
+
- Total elements in blk.15: (~218M) 218112000
|
| 690 |
+
- Percentage of total elements: 2.72%
|
| 691 |
+
|
| 692 |
+
|
| 693 |
+
### <a name="blk_16">Block 16 Tensor Group : ~218M Elements</a>
|
| 694 |
+
|
| 695 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 696 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 697 |
+
| 148 | blk.16.attn_k.weight | Block 16 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 698 |
+
| 149 | blk.16.attn_norm.weight | Block 16 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 699 |
+
| 150 | blk.16.attn_output.weight | Block 16 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 700 |
+
| 151 | blk.16.attn_q.weight | Block 16 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 701 |
+
| 152 | blk.16.attn_v.weight | Block 16 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 702 |
+
| 153 | blk.16.ffn_down.weight | Block 16 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 703 |
+
| 154 | blk.16.ffn_gate.weight | Block 16 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 704 |
+
| 155 | blk.16.ffn_norm.weight | Block 16 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 705 |
+
| 156 | blk.16.ffn_up.weight | Block 16 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 706 |
+
|
| 707 |
+
- Total elements in blk.16: (~218M) 218112000
|
| 708 |
+
- Percentage of total elements: 2.72%
|
| 709 |
+
|
| 710 |
+
|
| 711 |
+
### <a name="blk_17">Block 17 Tensor Group : ~218M Elements</a>
|
| 712 |
+
|
| 713 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 714 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 715 |
+
| 157 | blk.17.attn_k.weight | Block 17 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 716 |
+
| 158 | blk.17.attn_norm.weight | Block 17 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 717 |
+
| 159 | blk.17.attn_output.weight | Block 17 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 718 |
+
| 160 | blk.17.attn_q.weight | Block 17 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 719 |
+
| 161 | blk.17.attn_v.weight | Block 17 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 720 |
+
| 162 | blk.17.ffn_down.weight | Block 17 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 721 |
+
| 163 | blk.17.ffn_gate.weight | Block 17 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 722 |
+
| 164 | blk.17.ffn_norm.weight | Block 17 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 723 |
+
| 165 | blk.17.ffn_up.weight | Block 17 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 724 |
+
|
| 725 |
+
- Total elements in blk.17: (~218M) 218112000
|
| 726 |
+
- Percentage of total elements: 2.72%
|
| 727 |
+
|
| 728 |
+
|
| 729 |
+
### <a name="blk_18">Block 18 Tensor Group : ~218M Elements</a>
|
| 730 |
+
|
| 731 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 732 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 733 |
+
| 166 | blk.18.attn_k.weight | Block 18 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 734 |
+
| 167 | blk.18.attn_norm.weight | Block 18 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 735 |
+
| 168 | blk.18.attn_output.weight | Block 18 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 736 |
+
| 169 | blk.18.attn_q.weight | Block 18 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 737 |
+
| 170 | blk.18.attn_v.weight | Block 18 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 738 |
+
| 171 | blk.18.ffn_down.weight | Block 18 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 739 |
+
| 172 | blk.18.ffn_gate.weight | Block 18 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 740 |
+
| 173 | blk.18.ffn_norm.weight | Block 18 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 741 |
+
| 174 | blk.18.ffn_up.weight | Block 18 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 742 |
+
|
| 743 |
+
- Total elements in blk.18: (~218M) 218112000
|
| 744 |
+
- Percentage of total elements: 2.72%
|
| 745 |
+
|
| 746 |
+
|
| 747 |
+
### <a name="blk_19">Block 19 Tensor Group : ~218M Elements</a>
|
| 748 |
+
|
| 749 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 750 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 751 |
+
| 175 | blk.19.attn_k.weight | Block 19 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 752 |
+
| 176 | blk.19.attn_norm.weight | Block 19 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 753 |
+
| 177 | blk.19.attn_output.weight | Block 19 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 754 |
+
| 178 | blk.19.attn_q.weight | Block 19 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 755 |
+
| 179 | blk.19.attn_v.weight | Block 19 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 756 |
+
| 180 | blk.19.ffn_down.weight | Block 19 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 757 |
+
| 181 | blk.19.ffn_gate.weight | Block 19 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 758 |
+
| 182 | blk.19.ffn_norm.weight | Block 19 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 759 |
+
| 183 | blk.19.ffn_up.weight | Block 19 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 760 |
+
|
| 761 |
+
- Total elements in blk.19: (~218M) 218112000
|
| 762 |
+
- Percentage of total elements: 2.72%
|
| 763 |
+
|
| 764 |
+
|
| 765 |
+
### <a name="blk_20">Block 20 Tensor Group : ~218M Elements</a>
|
| 766 |
+
|
| 767 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 768 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 769 |
+
| 184 | blk.20.attn_k.weight | Block 20 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 770 |
+
| 185 | blk.20.attn_norm.weight | Block 20 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 771 |
+
| 186 | blk.20.attn_output.weight | Block 20 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 772 |
+
| 187 | blk.20.attn_q.weight | Block 20 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 773 |
+
| 188 | blk.20.attn_v.weight | Block 20 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 774 |
+
| 189 | blk.20.ffn_down.weight | Block 20 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 775 |
+
| 190 | blk.20.ffn_gate.weight | Block 20 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 776 |
+
| 191 | blk.20.ffn_norm.weight | Block 20 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 777 |
+
| 192 | blk.20.ffn_up.weight | Block 20 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 778 |
+
|
| 779 |
+
- Total elements in blk.20: (~218M) 218112000
|
| 780 |
+
- Percentage of total elements: 2.72%
|
| 781 |
+
|
| 782 |
+
|
| 783 |
+
### <a name="blk_21">Block 21 Tensor Group : ~218M Elements</a>
|
| 784 |
+
|
| 785 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 786 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 787 |
+
| 193 | blk.21.attn_k.weight | Block 21 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 788 |
+
| 194 | blk.21.attn_norm.weight | Block 21 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 789 |
+
| 195 | blk.21.attn_output.weight | Block 21 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 790 |
+
| 196 | blk.21.attn_q.weight | Block 21 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 791 |
+
| 197 | blk.21.attn_v.weight | Block 21 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 792 |
+
| 198 | blk.21.ffn_down.weight | Block 21 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 793 |
+
| 199 | blk.21.ffn_gate.weight | Block 21 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 794 |
+
| 200 | blk.21.ffn_norm.weight | Block 21 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 795 |
+
| 201 | blk.21.ffn_up.weight | Block 21 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 796 |
+
|
| 797 |
+
- Total elements in blk.21: (~218M) 218112000
|
| 798 |
+
- Percentage of total elements: 2.72%
|
| 799 |
+
|
| 800 |
+
|
| 801 |
+
### <a name="blk_22">Block 22 Tensor Group : ~218M Elements</a>
|
| 802 |
+
|
| 803 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 804 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 805 |
+
| 202 | blk.22.attn_k.weight | Block 22 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 806 |
+
| 203 | blk.22.attn_norm.weight | Block 22 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 807 |
+
| 204 | blk.22.attn_output.weight | Block 22 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 808 |
+
| 205 | blk.22.attn_q.weight | Block 22 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 809 |
+
| 206 | blk.22.attn_v.weight | Block 22 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 810 |
+
| 207 | blk.22.ffn_down.weight | Block 22 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 811 |
+
| 208 | blk.22.ffn_gate.weight | Block 22 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 812 |
+
| 209 | blk.22.ffn_norm.weight | Block 22 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 813 |
+
| 210 | blk.22.ffn_up.weight | Block 22 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 814 |
+
|
| 815 |
+
- Total elements in blk.22: (~218M) 218112000
|
| 816 |
+
- Percentage of total elements: 2.72%
|
| 817 |
+
|
| 818 |
+
|
| 819 |
+
### <a name="blk_23">Block 23 Tensor Group : ~218M Elements</a>
|
| 820 |
+
|
| 821 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 822 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 823 |
+
| 211 | blk.23.attn_k.weight | Block 23 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 824 |
+
| 212 | blk.23.attn_norm.weight | Block 23 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 825 |
+
| 213 | blk.23.attn_output.weight | Block 23 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 826 |
+
| 214 | blk.23.attn_q.weight | Block 23 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 827 |
+
| 215 | blk.23.attn_v.weight | Block 23 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 828 |
+
| 216 | blk.23.ffn_down.weight | Block 23 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 829 |
+
| 217 | blk.23.ffn_gate.weight | Block 23 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 830 |
+
| 218 | blk.23.ffn_norm.weight | Block 23 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 831 |
+
| 219 | blk.23.ffn_up.weight | Block 23 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 832 |
+
|
| 833 |
+
- Total elements in blk.23: (~218M) 218112000
|
| 834 |
+
- Percentage of total elements: 2.72%
|
| 835 |
+
|
| 836 |
+
|
| 837 |
+
### <a name="blk_24">Block 24 Tensor Group : ~218M Elements</a>
|
| 838 |
+
|
| 839 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 840 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 841 |
+
| 220 | blk.24.attn_k.weight | Block 24 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 842 |
+
| 221 | blk.24.attn_norm.weight | Block 24 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 843 |
+
| 222 | blk.24.attn_output.weight | Block 24 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 844 |
+
| 223 | blk.24.attn_q.weight | Block 24 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 845 |
+
| 224 | blk.24.attn_v.weight | Block 24 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 846 |
+
| 225 | blk.24.ffn_down.weight | Block 24 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 847 |
+
| 226 | blk.24.ffn_gate.weight | Block 24 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 848 |
+
| 227 | blk.24.ffn_norm.weight | Block 24 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 849 |
+
| 228 | blk.24.ffn_up.weight | Block 24 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 850 |
+
|
| 851 |
+
- Total elements in blk.24: (~218M) 218112000
|
| 852 |
+
- Percentage of total elements: 2.72%
|
| 853 |
+
|
| 854 |
+
|
| 855 |
+
### <a name="blk_25">Block 25 Tensor Group : ~218M Elements</a>
|
| 856 |
+
|
| 857 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 858 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 859 |
+
| 229 | blk.25.attn_k.weight | Block 25 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 860 |
+
| 230 | blk.25.attn_norm.weight | Block 25 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 861 |
+
| 231 | blk.25.attn_output.weight | Block 25 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 862 |
+
| 232 | blk.25.attn_q.weight | Block 25 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 863 |
+
| 233 | blk.25.attn_v.weight | Block 25 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 864 |
+
| 234 | blk.25.ffn_down.weight | Block 25 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 865 |
+
| 235 | blk.25.ffn_gate.weight | Block 25 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 866 |
+
| 236 | blk.25.ffn_norm.weight | Block 25 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 867 |
+
| 237 | blk.25.ffn_up.weight | Block 25 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 868 |
+
|
| 869 |
+
- Total elements in blk.25: (~218M) 218112000
|
| 870 |
+
- Percentage of total elements: 2.72%
|
| 871 |
+
|
| 872 |
+
|
| 873 |
+
### <a name="blk_26">Block 26 Tensor Group : ~218M Elements</a>
|
| 874 |
+
|
| 875 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 876 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 877 |
+
| 238 | blk.26.attn_k.weight | Block 26 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 878 |
+
| 239 | blk.26.attn_norm.weight | Block 26 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 879 |
+
| 240 | blk.26.attn_output.weight | Block 26 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 880 |
+
| 241 | blk.26.attn_q.weight | Block 26 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 881 |
+
| 242 | blk.26.attn_v.weight | Block 26 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 882 |
+
| 243 | blk.26.ffn_down.weight | Block 26 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 883 |
+
| 244 | blk.26.ffn_gate.weight | Block 26 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 884 |
+
| 245 | blk.26.ffn_norm.weight | Block 26 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 885 |
+
| 246 | blk.26.ffn_up.weight | Block 26 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 886 |
+
|
| 887 |
+
- Total elements in blk.26: (~218M) 218112000
|
| 888 |
+
- Percentage of total elements: 2.72%
|
| 889 |
+
|
| 890 |
+
|
| 891 |
+
### <a name="blk_27">Block 27 Tensor Group : ~218M Elements</a>
|
| 892 |
+
|
| 893 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 894 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 895 |
+
| 247 | blk.27.attn_k.weight | Block 27 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 896 |
+
| 248 | blk.27.attn_norm.weight | Block 27 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 897 |
+
| 249 | blk.27.attn_output.weight | Block 27 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 898 |
+
| 250 | blk.27.attn_q.weight | Block 27 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 899 |
+
| 251 | blk.27.attn_v.weight | Block 27 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 900 |
+
| 252 | blk.27.ffn_down.weight | Block 27 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 901 |
+
| 253 | blk.27.ffn_gate.weight | Block 27 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 902 |
+
| 254 | blk.27.ffn_norm.weight | Block 27 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 903 |
+
| 255 | blk.27.ffn_up.weight | Block 27 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 904 |
+
|
| 905 |
+
- Total elements in blk.27: (~218M) 218112000
|
| 906 |
+
- Percentage of total elements: 2.72%
|
| 907 |
+
|
| 908 |
+
|
| 909 |
+
### <a name="blk_28">Block 28 Tensor Group : ~218M Elements</a>
|
| 910 |
+
|
| 911 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 912 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 913 |
+
| 256 | blk.28.attn_k.weight | Block 28 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 914 |
+
| 257 | blk.28.attn_norm.weight | Block 28 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 915 |
+
| 258 | blk.28.attn_output.weight | Block 28 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 916 |
+
| 259 | blk.28.attn_q.weight | Block 28 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 917 |
+
| 260 | blk.28.attn_v.weight | Block 28 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 918 |
+
| 261 | blk.28.ffn_down.weight | Block 28 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 919 |
+
| 262 | blk.28.ffn_gate.weight | Block 28 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 920 |
+
| 263 | blk.28.ffn_norm.weight | Block 28 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 921 |
+
| 264 | blk.28.ffn_up.weight | Block 28 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 922 |
+
|
| 923 |
+
- Total elements in blk.28: (~218M) 218112000
|
| 924 |
+
- Percentage of total elements: 2.72%
|
| 925 |
+
|
| 926 |
+
|
| 927 |
+
### <a name="blk_29">Block 29 Tensor Group : ~218M Elements</a>
|
| 928 |
+
|
| 929 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 930 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 931 |
+
| 265 | blk.29.attn_k.weight | Block 29 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 932 |
+
| 266 | blk.29.attn_norm.weight | Block 29 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 933 |
+
| 267 | blk.29.attn_output.weight | Block 29 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 934 |
+
| 268 | blk.29.attn_q.weight | Block 29 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 935 |
+
| 269 | blk.29.attn_v.weight | Block 29 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 936 |
+
| 270 | blk.29.ffn_down.weight | Block 29 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 937 |
+
| 271 | blk.29.ffn_gate.weight | Block 29 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 938 |
+
| 272 | blk.29.ffn_norm.weight | Block 29 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 939 |
+
| 273 | blk.29.ffn_up.weight | Block 29 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 940 |
+
|
| 941 |
+
- Total elements in blk.29: (~218M) 218112000
|
| 942 |
+
- Percentage of total elements: 2.72%
|
| 943 |
+
|
| 944 |
+
|
| 945 |
+
### <a name="blk_30">Block 30 Tensor Group : ~218M Elements</a>
|
| 946 |
+
|
| 947 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 948 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 949 |
+
| 274 | blk.30.attn_k.weight | Block 30 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 950 |
+
| 275 | blk.30.attn_norm.weight | Block 30 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 951 |
+
| 276 | blk.30.attn_output.weight | Block 30 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 952 |
+
| 277 | blk.30.attn_q.weight | Block 30 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 953 |
+
| 278 | blk.30.attn_v.weight | Block 30 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | F16 |
|
| 954 |
+
| 279 | blk.30.ffn_down.weight | Block 30 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 955 |
+
| 280 | blk.30.ffn_gate.weight | Block 30 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 956 |
+
| 281 | blk.30.ffn_norm.weight | Block 30 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 957 |
+
| 282 | blk.30.ffn_up.weight | Block 30 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 958 |
+
|
| 959 |
+
- Total elements in blk.30: (~218M) 218112000
|
| 960 |
+
- Percentage of total elements: 2.72%
|
| 961 |
+
|
| 962 |
+
|
| 963 |
+
### <a name="blk_31">Block 31 Tensor Group : ~218M Elements</a>
|
| 964 |
+
|
| 965 |
+
| T_ID | Tensor Layer Name | Human Friendly Tensor Layer Name | Elements | Shape | Type |
|
| 966 |
+
|-----:|:--------------------------|:------------------------------------------------|:----------------|:----------------------|:-----|
|
| 967 |
+
| 283 | blk.31.attn_k.weight | Block 31 Attention Key (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q6_K |
|
| 968 |
+
| 284 | blk.31.attn_norm.weight | Block 31 Attention Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 969 |
+
| 285 | blk.31.attn_output.weight | Block 31 Attention Output (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q8_0 |
|
| 970 |
+
| 286 | blk.31.attn_q.weight | Block 31 Attention Query (W) | (~17M) 16777216 | 4096 x 4096 x 1 x 1 | Q6_K |
|
| 971 |
+
| 287 | blk.31.attn_v.weight | Block 31 Attention Value (W) | ( ~4M) 4194304 | 4096 x 1024 x 1 x 1 | Q8_0 |
|
| 972 |
+
| 288 | blk.31.ffn_down.weight | Block 31 Feed-Forward Network "Down" (W) | (~59M) 58720256 | 14336 x 4096 x 1 x 1 | Q8_0 |
|
| 973 |
+
| 289 | blk.31.ffn_gate.weight | Block 31 Feed-Forward Network "Gate" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 974 |
+
| 290 | blk.31.ffn_norm.weight | Block 31 Feed-Forward Network Normalization (W) | ( ~4K) 4096 | 4096 x 1 x 1 x 1 | F32 |
|
| 975 |
+
| 291 | blk.31.ffn_up.weight | Block 31 Feed-Forward Network "Up" (W) | (~59M) 58720256 | 4096 x 14336 x 1 x 1 | Q8_0 |
|
| 976 |
+
|
| 977 |
+
- Total elements in blk.31: (~218M) 218112000
|
| 978 |
+
- Percentage of total elements: 2.72%
|