 The authors use a transfer learning technique to initialize the parameters of the VQA with those obtained from solving a smaller problem, then apply these parameters to a larger problem. Simulations have shown that this method can reduce the occurrence of the baron plateau and improve the training efficiency. This article was authored by Huan Yu Liu, typing Sun, Yu Chen Wu, and others.