Tool/software:
Hi TI Team,
We are using the TDA4VM platform with custom deep learning models running through the TFLite runtime with the TIDL delegate, as part of the EdgeAI SDK (from https://github.com/TexasInstruments/edgeai-tidl-tools).
Our goal is to measure the effective TOPS (Tera Operations Per Second) utilized per model or per inference execution, in order to understand hardware utilization and optimize our deployment.
We have already integrated our models using the edgeai-tidl-tools
flow and can run inference via TFLite in both Python and C++.
Could you please guide us on the following:
-
What is the recommended method to estimate or measure TOPS per inference using the EdgeAI SDK or TIDL tools?
- Can we access real-time TIDL accelerator utilization or profiling data (e.g., via logs or system APIs)?
Regards
Ashay