{/* Header */}

Model Inspector

Explore the complete architecture of the loaded model

{isLoading ? 'Loading...' : (isConnected ? 'Model Connected' : 'Disconnected')}

{/* Model Overview Section */}

{expandedSections.has('overview') && (

Total Parameters

{formatNumber(modelInfo.totalParams)}

{modelInfo.totalParams > 1e9 ? `${(modelInfo.totalParams / 1e9).toFixed(1)} Billion` : modelInfo.totalParams > 1e6 ? `${(modelInfo.totalParams / 1e6).toFixed(1)} Million` : formatNumber(modelInfo.totalParams)}

Vocabulary Size

{formatNumber(modelInfo.vocabSize)}

Unique tokens

Context Length

{formatNumber(modelInfo.maxPositions)}

Max tokens

Architecture

Transformer

GPT-style

{/* Device Information */} {deviceInfo && (

Device Information

Running on: {deviceInfo}

)} {/* Model Configuration */} {modelConfig && (

Configuration

Activation: {String(modelConfig.activation_function)}

Cache: {modelConfig.use_cache ? 'Enabled' : 'Disabled'}

)}

{/* Architecture Details Section */}

{expandedSections.has('architecture') && (

{/* Layer Structure */}

Layer Structure (×{modelInfo.layers})

Multi-Head Attention ({modelInfo.heads} heads, {modelInfo.hiddenSize / modelInfo.heads} dims/head)

QKV Projection: {modelInfo.hiddenSize} → {modelInfo.hiddenSize * 3}

Output Projection: {modelInfo.hiddenSize} → {modelInfo.hiddenSize}

Feed-Forward Network (4× expansion)

FC1: {modelInfo.hiddenSize} → {modelInfo.hiddenSize * 4}

FC2: {modelInfo.hiddenSize * 4} → {modelInfo.hiddenSize}

Layer Normalization

Residual Connections

{/* Data Flow */}

Data Flow Through Model

{`Input Text ↓ [Token Embeddings] (${modelInfo.vocabSize.toLocaleString()} × ${modelInfo.hiddenSize.toLocaleString()}) ↓ [+ Rotary Position Embeddings] ↓ ╔═══════════════════════╗ ║ Layer 0 ║ ║ ├─ Attention (${modelInfo.heads}h) ║ ║ └─ FFN (${modelInfo.hiddenSize * 4}d) ║ ╚═══════════════════════╝ ↓ ... (${modelInfo.layers - 2} more layers) ↓ ╔═══════════════════════╗ ║ Layer ${modelInfo.layers - 1} ║ ║ ├─ Attention (${modelInfo.heads}h) ║ ║ └─ FFN (${modelInfo.hiddenSize * 4}d) ║ ╚═══════════════════════╝ ↓ [Layer Norm] ↓ [Language Model Head] ↓ ${modelInfo.vocabSize.toLocaleString()} Token Probabilities`}

)}

{/* Accessible Components Section */}

{expandedSections.has('accessible') && (

{modelInfo.accessible.map((item, idx) => (

{item}

))}

Complete Transparency

Every computation, weight, and decision in the model's {formatNumber(modelInfo.totalParams)} parameters is accessible. The "black box" becomes a "glass box" - we can visualize the entire thinking process as tokens flow through the network.

)}

{/* Decision Path Visualization Section */}

{expandedSections.has('decision-path') && (

Loading decision path visualization...

)}

{expandedSections.has('computation') && (

Operations per token ~{formatNumber(modelInfo.totalParams * 2)}

Attention computations {modelInfo.layers * modelInfo.heads} heads

Probability calculations {modelInfo.vocabSize.toLocaleString()} tokens

Memory footprint (FP32) {((modelInfo.totalParams * 4) / (1024 * 1024 * 1024)).toFixed(2)} GB

Memory footprint (FP16) {((modelInfo.totalParams * 2) / (1024 * 1024 * 1024)).toFixed(2)} GB

Each token generation involves passing through all {modelInfo.layers} layers, computing attention across {modelInfo.layers * modelInfo.heads} heads, and producing probabilities for {modelInfo.vocabSize.toLocaleString()} possible next tokens.

)}