Observing that Semantic features learned in an image classification task and Appearance features learned in a similarity matching task complement each other, we build a twofold Siamese network, named SA-Siam, for real-time object tracking. SA-Siam i
GOTURN,Li_High_Performance_Visual_CVPR_2018_paper,MDNEt,MOT,On The Stability of Video Detection and Tracking,SiamFC,Wang_Visual_Tracking_With_ICCV_2015_paper