-
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper • 2512.17351 • Published • 24 -
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003
Updated • 4 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002
Updated • 2 -
facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003
Updated • 1
Zeyuan Allen-Zhu
zhuzeyuan
AI & ML interests
None yet
Recent Activity
updated
a model
12 days ago
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003
updated
a model
12 days ago
facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.002
updated
a model
12 days ago
facebook/PhysicsLM4.2__LlamaCanon-3B-Nemo-1T-lr0.003