Is computation run on a Tenstorrent Tensix core accelerator card guaranteed to be deterministic?

noahhein · January 16, 2025, 10:12pm

Question

General Q: Is computation run on a Tenstorrent Tensix core accelerator card guaranteed to be deterministic?

Specifically, imagine you set up several (n+1; n>1) servers with identical hardware + a N300S card and identical Ubuntu 20.04 + TT-BUDA software stack (including all dependencies), then run the same AI model (from tt-buda-demos/model_demos) with the same configuration/inputs on each server. Would you expect to get exactly the same output from the model run on every server?

Assumptions: there is no randomness intentionally added to the model runs, only interested in potential sources of non-determinism in the hardware architecture, multi-core parallel computation or TT-BUDA/Metalium software stack.

Answer

Yes the output should be identical run to run assuming no randomness in the model. Same input => same output.

Topic	Replies	Views
This Thursday: Attend our online event! Community events	17	March 10, 2025
Upcoming webinar: Modeling multi-device and scale-out in a compiler Community events	21	March 26, 2025
System Compatibility FAQ	27	January 16, 2025
Announcing the Tenstorrent Bounty Program! Community	54	February 24, 2025
Office Hours with Tenstorrent: Accelerate AI & Optimize with Our Engineers 🚀 Community	17	January 23, 2025

Is computation run on a Tenstorrent Tensix core accelerator card guaranteed to be deterministic?

Question

Answer

Related topics