Visualize Any Hugging Face Model

rippeltippel 35 points 4 comments May 06, 2026
hfviewer.com · View on Hacker News

Discussion Highlights (2 comments)

anavid7

Where is it capturing the model "structure" from?

aesthesia

This is a neat idea. When I'm looking up models I usually want to see something about the architecture, but also some of the hyperparameters for the specific model---residual dimension, total number of layers, tokenizer configs. There's some of that in the visualization but it's spotty. The results for Nemotron 3 Nano are hard to parse, and I think actually incorrect: https://hfviewer.com/nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-B... I'm guessing this is because the implementation uses layers that are all instances of the same class, with forward passes that branch on the layer type specified at construction time.

Semantic search powered by Rivestack pgvector
8,303 stories · 78,303 chunks indexed