Hi:
I convert a TIDL SSD model using the deploy.prototxa and caffemodel from "github.com/tidsp/caffe-jacinto-models/tree/caffe-0.16/trained/object_detection/voc0712/JDetNet/ssd512x512_ds_PSP_dsFac_32_fc_0_hdDS8_1_kerMbox_3_1stHdSameOpCh_1/sparse";
and I change the "keep_top_k: 200" to "keep_top_k:20" in the deploy.prototxt.
but the model converted runs very slow on TDA2x platform:I run the "f: TIDL usecase"、"1:TIDL File I/O Usecase" 、“2: EVE”、“p: Print Statistics”,the time result is very slow,as below:
[IPU1-0] 70.164551 s: ### CPU [ EVE1], LinkID [ 49],
[IPU1-0] 70.164612 s:
[IPU1-0] 70.164673 s: [ ALG_TIDL ] Link Statistics,
[IPU1-0] 70.165192 s: ******************************
[IPU1-0] 70.165283 s:
[IPU1-0] 70.165314 s: Elapsed time = 10549 msec
[IPU1-0] 70.165588 s:
[IPU1-0] 70.165649 s: New data Recv = 0.47 fps
[IPU1-0] 70.165710 s:
[IPU1-0] 70.165771 s: Input Statistics,
[IPU1-0] 70.165832 s:
[IPU1-0] 70.165893 s: CH | In Recv | In Drop | In User Drop | In Process
[IPU1-0] 70.166168 s: | FPS | FPS | FPS | FPS
[IPU1-0] 70.166259 s: --------------------------------------------------
[IPU1-0] 70.166351 s: 0 | 0.28 0. 0 0. 0 0.37
[IPU1-0] 70.166473 s:
[IPU1-0] 70.166687 s: Output Statistics,
[IPU1-0] 70.166748 s:
[IPU1-0] 70.166809 s: CH | Out | Out | Out Drop | Out User Drop
[IPU1-0] 70.166900 s: | ID | FPS | FPS | FPS
[IPU1-0] 70.167144 s: ---------------------------------------------
[IPU1-0] 70.167236 s: 0 | 0 0.37 0. 0 0. 0
[IPU1-0] 70.167510 s:
[IPU1-0] 70.167571 s: [ ALG_TIDL ] LATENCY,
[IPU1-0] 70.167632 s: ********************
[IPU1-0] 70.167693 s: Local Link Latency : Avg = 2587537 us, Min = 2488867 us, Max = 2880833 us,
[IPU1-0] 70.168029 s: Source to Link Latency : Avg = 10172909 us, Min = 10031111 us, Max = 10340694 us,
[IPU1-0] 70.168181 s:
[IPU1-0] 70.168303 s:
[IPU1-0] 70.168334 s: ### CPU [ EVE1], LinkID [ 0],
But when I run the SSD model "tidl_param_jdetNet_ssd.bin" release in "PROCESSOR_SDK_VISION_03_03_00_00\ti_components\algorithms\REL.TIDL.01.01.00.00\modules\ti_dl\test\testvecs\config\tidl_models\jdetnet",it's run quick,only need 177236 us.
And my's import file is:
# Default - 0
randParams = 0
# 0: Caffe, 1: TensorFlow, Default - 0
modelType = 0
# 0: Fixed quantization By tarininng Framework, 1: Dyanamic quantization by TIDL, Default - 1
quantizationStyle = 1
# quantRoundAdd/100 will be added while rounding to integer, Default - 50
quantRoundAdd = 25
numParamBits = 8
# 0 : 8bit Unsigned, 1 : 8bit Signed Default - 1
inElementType = 0
inputNetFile = "..\..\test\testvecs\config\caffe-jacinto-models\trained\image_detection\jdetNet_512x512\deploy.prototxt"
inputParamsFile = "..\..\test\testvecs\config\caffe-jacinto-models\trained\image_detection\jdetNet_512x512\ti-jdetNet_512x512.caffemodel"
outputNetFile = "..\..\test\testvecs\config\tidl_models\tidl_net_jdetNet_ssd_512x512.bin"
outputParamsFile = "..\..\test\testvecs\config\tidl_models\tidl_param_jdetNet_ssd_512x512.bin"
rawSampleInData = 1
preProcType = 4
sampleInData = "..\..\test\testvecs\input\trace_dump_0_512x512.y"
tidlStatsTool = "..\quantStatsTool\eve_test_dl_algo.out.exe"
layersGroupId = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 1 2 2 2 2 2 2 2 0
conv2dKernelType = 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
Am I missing something?Pls help!
Regards