Part Number: TDA4VMXEVM
Hi,
I am trying to train a ssd-lite-regnet-fpn model with jacinto-ai/pytorch-mmdetection and I am running into errors. I have installed open-mmlab/mmdetection and am able to train and test models. When I try to follow instructions on jacinto-ai/pytorch-mmdetection, I get the following error:
(open-mmlab) pkambhampati@usrdlx836as001:~/KTM/pytorch-mmdetection$ CUDA_VISIBLE_DEVICES=6,7 ./run_detection_train.sh
Training with: ./configs/ssd/ssd-lite_regnet_fpn_bgr.py @ 2020-09-02_21-13-54
*****************************************
Setting OMP_NUM_THREADS environment variable for each process to be 1 in default, to avoid your system being overloaded, please further tune the variable for optimal performance in your application as needed.
*****************************************
Traceback (most recent call last):
Traceback (most recent call last):
File "./xmmdet/tools/train.py", line 14, in <module>
File "./xmmdet/tools/train.py", line 14, in <module>
from xmmdet.apis import set_random_seed, train_detectorfrom xmmdet.apis import set_random_seed, train_detector
ModuleNotFoundError: ModuleNotFoundErrorNo module named 'xmmdet':
No module named 'xmmdet'
Traceback (most recent call last):
File "./scripts/train_detection_main.py", line 30, in <module>
distributed_launch.main()
File "/data/home/pkambhampati/anaconda3/envs/open-mmlab/lib/python3.7/site-packages/torch/distributed/launch.py", line 257, in main
cmd=cmd)
subprocess.CalledProcessError: Command '['/data/home/pkambhampati/anaconda3/envs/open-mmlab/bin/python', '-u', './xmmdet/tools/train.py', '--local_rank=1', '--launcher=pytorch', './configs/ssd/ssd-lite_regnet_fpn_bgr.py']' returned non-zero exit status 1.
Done.
Are there any further steps that I need to follow to perform the training?
Regards,
Prithvi