Hi,
I'm trying to convert a tensorflow model to TIDL and run it on a AM57x processor. Towards this, I followed the instructions shared in this thread to convert the model and the model got converted.
For simplicity, I have used the network in the TensorflowExample (cifar10_train.py) itself as reference. Only difference in my model is that I'm using MNIST dataset for training and testing instead of CIFAR-10 dataset. I have validated the results of my modified network on tensorflow and so, I expect it to work with TIDL also (given the network in TensorflowExample is validated on TIDL). However, when I tried to run the application using the converted model, I get the following issues -
1. Application terminates with segfault (core dump analysis below) if the target device type is EVE.
(gdb) bt
#0 0xb663b522 in std::local_Rb_tree_decrement (__x=0x5714e0)
at /home/tcwg-buildslave/workspace/tcwg-make-release/label/docker-trusty-amd64-tcwg-build/target/arm-linux-gnueabihf/snapshots/gcc-linaro-6.2-2016.11/libstdc++-v3/src/c++98/tree.cc:98
#1 0xb6d75c5e in Coal::Object::Object(Coal::Object::Type, Coal::Object*) () from /usr/lib/libOpenCL.so.1
#2 0xb6d6f91a in Coal::Event::Event(Coal::CommandQueue*, Coal::Event::Status, unsigned int, _cl_event* const*, int*) () from /usr/lib/libOpenCL.so.1
#3 0xb6d72820 in Coal::KernelEvent::KernelEvent(Coal::CommandQueue*, Coal::Kernel*, unsigned int, unsigned int const*, unsigned int const*, unsigned int const*, unsigned int, _cl_event* const*, int*) () from /usr/lib/libOpenCL.so.1
#4 0xb6d72cca in Coal::TaskEvent::TaskEvent(Coal::CommandQueue*, Coal::Kernel*, unsigned int, _cl_event* const*, int*) () from /usr/lib/libOpenCL.so.1
#5 0xb6d6d51a in clEnqueueTask () from /usr/lib/libOpenCL.so.1
#6 0x0001d8ea in tidl::Kernel::RunAsync() ()
#7 0x0001b0f2 in tidl::ExecutionObject::ProcessFrameStartAsync() ()
#8 0x00014b38 in RunConfiguration (config_file=..., num_devices=num_devices@entry=1, device_type=device_type@entry=tidl::DeviceType::DLA, format=format@entry=0, input_file=...) at main.cpp:229
#9 0x000137a2 in main (argc=3, argv=0xbeb1dc84) at main.cpp:116
2. Application terminates with segmentation fault if target device type is DSP and number of devices configured is 2 instead of 1
ERROR: [ Line: 312] CL_INVALID_PROGRAM_EXECUTABLE
core dump analysis:
(gdb) bt
#0 0xb6d3fd02 in std::_Rb_tree<Coal::Object*, Coal::Object*, std::_Identity<Coal::Object*>, std::less<Coal::Object*>, std::allocator<Coal::Object*> >::_M_erase(std::_Rb_tree_node<Coal::Object*>*) () from /usr/lib/libOpenCL.so.1
#1 0xbee35614 in ?? ()
Is there some step that I'm missing?
Regards,
Manu