🧠Gemma-2-2B: Pre-Softmax Surgery Lab
Optimized for Low-VRAM. Model loads once. Greedy cache removal.
Input Text
1. Run Analysis
Baseline Prediction
Pre-Softmax Attention (Gemma 18x8)
Token Indices